• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Documents as Intelligent Agents
 
  • Details
  • Full
Options
January 1, 2023
Conference Paper
Title

Documents as Intelligent Agents

Title Supplement
An Approach to Optimize Document Representations in Semantic Search
Abstract
Finding good representations for documents in the context of semantic search is a relevant problem with applications in domains like medicine, research or data search. In this paper we propose to represent each document in a search index by a number of different contextual embeddings. We define and evaluate eight different strategies to combine embeddings of document title, document passages and relevant user queries by means of linear combinations, averaging, and clustering. In addition we apply an agent-based approach to search whereby each data item is modeled as an agent that tries to optimize its metadata and presentation over time by incorporating information received via the users' interactions with the search system. We validate the document representation strategies and the agent-based approach in the context of a medical information retrieval dataset and find that a linear combination of the title embedding, mean passage embedding and the mean over the clustered embeddings of relevant queries offers the best trade-off between search-performance and index size. We further find, that incorporating embeddings of relevant user queries can significantly improve the performance of representation strategies based on semantic embeddings. The agent-based system performs slightly better than the other representation strategies but comes with a larger index size.
Author(s)
Strauß, Oliver  
Fraunhofer-Institut für Arbeitswirtschaft und Organisation IAO  
Kett, Holger Joachim
Fraunhofer-Institut für Arbeitswirtschaft und Organisation IAO  
Mainwork
WEBIST 2023, 19th International Conference on Web Information Systems and Technologies. Proceedings  
Project(s)
Incentives and Economics of Data Sharing  
Funder
Bundesministerium für Bildung und Forschung -BMBF-  
Conference
International Conference on Web Information Systems and Technologies 2023  
Open Access
File(s)
Download (474.18 KB)
Rights
CC BY-NC-ND 4.0: Creative Commons Attribution-NonCommercial-NoDerivatives
DOI
10.5220/0012239200003584
10.24406/publica-2469
Language
English
Fraunhofer-Institut für Arbeitswirtschaft und Organisation IAO  
Keyword(s)
  • Agent-Based Retrieval

  • Dataset Research

  • Semantic Search

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024