• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Language modeling for effective construction of domain specific thesauri
 
  • Details
  • Full
Options
2004
Conference Paper
Titel

Language modeling for effective construction of domain specific thesauri

Abstract
In this paper we present an approach for effective construction of domain specific thesauri. We assume that the collection is partitioned into document categories. By taking advantage of these pre-defined categories, we are able to conceptualize a new topical language model to weight term topicality more accurately. With the help of information theory, interesting relationships among thesaurus elements are discovered deductively. Based on the "Layer-Seeds" clustering algorithm, topical terms from documents in a certain category will be organized according to their relationships in a tree-like hierarchical structure --- a thesaurus. Experimental results show that the thesaurus contains satisfactory structures, although it differs to some extent from a manually created thesaurus. A first evaluation of the thesaurus in a query expansion task yields evidence that an increase of recall can be achieved without loss of precision.
Author(s)
Chen, L.B.
Thiel, U.
Hauptwerk
Natural language processing and information systems
Konferenz
International Conference on Applications of Natural Language to Information Systems (NLDB) 2004
Thumbnail Image
DOI
10.1007/b104284
Language
English
google-scholar
IPSI
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022