• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Towards a multi-way similarity join operator
 
  • Details
  • Full
Options
2017
Conference Paper
Titel

Towards a multi-way similarity join operator

Abstract
Increasing volumes of data consumed and managed by enterprises demand effective and efficient data integration approaches. Additionally, the amount and variety of data sources impose further challenges for query engines. However, the majority of existing query engines rely on binary join-based query planners and execution methods with complexity that depends on the number of involved data sources. Moreover, traditional binary join operators are not able to distinguish between similar and different tuples, treating every incoming tuple as an independent object. Thus, if tuples are represented differently but refer to the same real-world entity, they are still considered as non-related objects. We propose MSimJoin, an approach towards a multi-way similarity join operator. MSimJoin accepts more than two inputs and is able to identify duplicates that correspond to similar entities from incoming tuples using Semantic Web technologies. Therefore, MSimJoin allows for the reduction of both the height of tree query plans and duplicated results.
Author(s)
Galkin, Mikhail
Vidal, Maria-Esther
Auer, Sören
Hauptwerk
New trends in databases and information systems
Konferenz
European Conference on Advances in Databases and Information Systems (ADBIS) 2017
International Workshop on Data Science - Methodologies and Use-Cases (DaS) 2017
Thumbnail Image
DOI
10.1007/978-3-319-67162-8_26
Language
English
google-scholar
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022