• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Data Lake
 
  • Details
  • Full
Options
2020
Book Article
Title

Data Lake

Abstract
Data lakes (DL) have been proposed as a new concept for centralized data repositories. In contrast to data warehouses (DW), which usually require a complex and fine-tuned Extract-Transform-Load (ETL) process, DLs use a simpler model which just aims at loading the complete source data in its raw format into the DL. While a more complex ETL process with data transformation and aggregation increases the data quality, it might also come with some information loss as irregular or unstructured data not fitting into the integrated DW schema will not be loaded into the DW. Moreover, some data silos might not get connected to integrated data repositories at all due to the complexity of the data integration process. DLs address these problems: they should provide access to the source data in its original format without requiring an elaborated ETL process to ingest the data into the lake.
Author(s)
Quix, Christoph  
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Geisler, Sandra  
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
Hai, Rihan
RWTH Aachen University
Mainwork
Encyclopedia of Big Data. Online resource  
DOI
10.1007/978-3-319-32001-4_309-1
Language
English
Fraunhofer-Institut für Angewandte Informationstechnik FIT  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024