• English
  • Deutsch
  • Log In
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. GEMMS: A Generic and Extensible Metadata Management System for data lakes
 
  • Details
  • Full
Options
2016
  • Konferenzbeitrag

Titel

GEMMS: A Generic and Extensible Metadata Management System for data lakes

Abstract
The heterogeneity of sources in Big Data systems requires new integration approaches which can handle the large volume of the data as well as its variety. Data lakes have been proposed to reduce the upfront integration costs and to provide more flexibility in integrating and analyzing information. In data lakes, data from the sources is copied in its original structure to a repository; only a syntactic integration is done as data is stored in a common semi-structured format. Metadata plays an important role, as the source data is not loaded into an integrated repository with a unified schema; the data has to come with its own metadata. This paper presents GEMMS, a Generic and Extensible Metadata Management System for data lakes which extracts metadata from the sources and manages the structural and semantical information in an extensible metamodel. The system has been developed with a focus on scientific data management in the life sciences which is often only file-based with limited query functiona
Author(s)
Quix, C.
Hai, R.
Vatov, I.
Hauptwerk
CAiSE Forum, CAiSE-Forum 2016, at the 28th International Conference on Advanced Information Systems Engineering, CAiSE 2016
Konferenz
International Conference on Advanced Information Systems Engineering (CAiSE) 2016
Thumbnail Image
Externer Link
Externer Link
Language
Englisch
google-scholar
FIT
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022