• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Data Lakes: A Survey of Functions and Systems
 
  • Details
  • Full
Options
2023
Journal Article
Title

Data Lakes: A Survey of Functions and Systems

Abstract
Data lakes are becoming increasingly prevalent for big data management and data analytics. In contrast to traditional schema-on-write approaches such as data warehouses, data lakes are repositories storing raw data in its original formats and providing a common access interface. Despite the strong interest raised from both academia and industry, there is a large body of ambiguity regarding the definition, functions and available technologies for data lakes. A complete, coherent picture of data lake challenges and solutions is still missing. This survey reviews the development, architectures, and systems of data lakes. We provide a comprehensive overview of research questions for designing and building data lakes. We classify the existing approaches and systems based on their provided functions for data lakes, which makes this survey a useful technical reference for designing, implementing and deploying data lakes. We hope that the thorough comparison of existing solutions and the discussion of open research challenges in this survey will motivate the future development of data lake research and practice.
Author(s)
Hai, Rihan
Koutras, Christos
Quix, Christoph  
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Jarke, Matthias  
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Journal
IEEE transactions on knowledge and data engineering  
DOI
10.1109/TKDE.2023.3270101
Additional full text version
Landing Page
Language
English
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Keyword(s)
  • Big Data applications

  • Data discovery

  • Data lake

  • Lakes

  • Maintenance engineering

  • Memory

  • Metadata

  • Metadata management

  • Proposals

  • Semantics

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024