• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Mapping large scale research metadata to linked data: A performance comparison of HBase, CSV and XML
 
  • Details
  • Full
Options
2015
Conference Paper
Title

Mapping large scale research metadata to linked data: A performance comparison of HBase, CSV and XML

Abstract
OpenAIRE, the Open Access Infrastructure for Research in Europe, comprises a database of all EC FP7 and H2020 funded research projects, including metadata of their results (publications and datasets). These data are stored in an HBase NoSQL database, post-processed, and exposed as HTML for human consumption, and as XML through a web service interface. As an intermediate format to facilitate statistical computations, CSV is generated internally. To interlink the OpenAIRE data with related data on the Web, we aim at exporting them as Linked Open Data (LOD). The LOD export is required to integrate into the overall data processing workflow, where derived data are regenerated from the base data every day. We thus faced the challenge of identifying the best-performing conversion approach. We evaluated the performances of creating LOD by a MapReduce job on top of HBase, by mapping the intermediate CSV files, and by mapping the XML output.
Author(s)
Vahdati, Sahar
Karim, Farah
Huang, J.-Y.
Lange, Christoph  orcid-logo
Mainwork
Metadata and semantics research. 9th research conference, MTSR 2015  
Conference
Metadata and Semantics Research Conference (MTSR) 2015  
DOI
10.1007/978-3-319-24129-6_23
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024