• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. An ontology-based text mining dataset for extraction of process-structure-property entities
 
  • Details
  • Full
Options
October 10, 2024
Journal Article
Title

An ontology-based text mining dataset for extraction of process-structure-property entities

Abstract
While large language models learn sound statistical representations of the language and information therein, ontologies are symbolic knowledge representations that can complement the former ideally. Research at this critical intersection relies on datasets that intertwine ontologies and text corpora to enable training and comprehensive benchmarking of neurosymbolic models. We present the MaterioMiner dataset and the linked materials mechanics ontology where ontological concepts from the mechanics of materials domain are associated with textual entities within the literature corpus. Another distinctive feature of the dataset is its eminently fine-grained annotation. Specifically, 179 distinct classes are manually annotated by three raters within four publications, amounting to 2191 entities that were annotated and curated. Conceptual work is presented for the symbolic representation of causal composition-process-microstructure-property relationships. We explore the annotation consistency between the three raters and perform fine-tuning of pre-trained language models to showcase the feasibility of training named entity recognition models. Reusing the dataset can foster training and benchmarking of materials language models, automated ontology construction, and knowledge graph generation from textual data.
Author(s)
Durmaz, Ali Riza  
Fraunhofer-Institut für Werkstoffmechanik IWM  
Thomas, Akhil
Fraunhofer-Institut für Werkstoffmechanik IWM  
Mishra, Lokesh
IBM Research
Niranjan Murthy, Rachana
Fraunhofer-Institut für Werkstoffmechanik IWM  
Straub, Thomas  
Fraunhofer-Institut für Werkstoffmechanik IWM  
Journal
Scientific data  
Project(s)
Intelligent-datengeführtes Prozessdesign für ermüdungsresistente Stahlbauteile am Beispiel bainitischer Mikrostruktur iBain  
Funder
Bundesministerium für Bildung und Forschung -BMBF-  
Open Access
DOI
10.1038/s41597-024-03926-5
Additional full text version
Landing Page
Language
English
Fraunhofer-Institut für Werkstoffmechanik IWM  
Keyword(s)
  • ontology

  • materials mechanics

  • text mining

  • named entity recognition

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024