• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. OWLStats: Distributed computation of OWL dataset statistics
 
  • Details
  • Full
Options
2020
Conference Paper
Title

OWLStats: Distributed computation of OWL dataset statistics

Abstract
Nowadays, ontologies are used in various application areas, involving Artificial Intelligence, Natural Language Processing, Data Integration, and Knowledge Management. It is essential to know the internal structure, distribution, and coherence of the published datasets to make it easier to reuse, interlink, integrate, infer, or query. Therefore, there is a pressing need to obtain a clear view of OWL datasets became more prevalent. In this paper, we present OWLStats, a software component for computing statistical information about large scale OWL datasets in a distributed manner. We present the primary distributed in-memory approach for computing 32 different statistical criteria for OWL datasets utilizing Apache Spark, which can scale horizontally to a cluster of machines. OWLStats has been integrated into the SANSA framework. The preliminary results prove that OWLStats is linearly scalable in terms of data scalability.
Author(s)
Mohamed, Heba
Universität Bonn
Fathalla, Said M.
Universität Bonn
Lehmann, Jens  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Jabeen, Hajira
Universität zu Köln
Mainwork
IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2020. Proceedings  
Project(s)
Learning, Applying, Multiplying Big Data Analytics  
Digital PLAtform and analytic TOOls for eNergy  
Funder
European Commission  
European Commission  
Conference
International Joint Conference on Web Intelligence and Intelligent Agent Technology 2020  
DOI
10.1109/WIIAT50758.2020.00055
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Apache Spark

  • Distributed Processing

  • Large-scale datasets

  • OWL Statistics

  • SANSA Framework

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024