• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Enhancing retrieval-augmented generation for interoperable industrial knowledge representation and inference toward cognitive digital twins
 
  • Details
  • Full
Options
2025
Journal Article
Title

Enhancing retrieval-augmented generation for interoperable industrial knowledge representation and inference toward cognitive digital twins

Abstract
The escalating volume and complexity of digital data within the manufacturing sector highlight an urgent need for an efficient knowledge representation and inference solution. Traditional approaches, which often rely on ontologies, knowledge graphs, or digital twins (DTs) for knowledge representation, and rule-based algorithms for inference, are becoming insufficient. The emergence of generative AI, particularly large language models (LLM) and retrieval-augmented generation (RAG), offers a more efficient and intelligent alternative. However, the performance of an RAG system is heavily dependent on the quality of retrieval results, which can be compromised by domain-specific knowledge and retrieval distractors. To address this challenge, we propose to enhance RAG systems tailored for the manufacturing industry in two aspects. First, we utilize the Asset Administration Shell (AAS), which represents the German industrial perspective on cognitive DTs, to create a representation of assets and knowledge in standardized information models. This establishes a robust foundation for the retrieval sources. Second, we propose a contrastive selection loss (CSL) to fine-tune an open-source LLM to refine the retrieval results. Fine-tuned LLMs possess higher efficiency and accuracy on task- and domain-specific datasets, while the CSL further enhances the model's ability to distinguish true positives from similar distractors. The enhanced RAG system is demonstrated in a robotic work cell integration use case and evaluated through a novel evaluation protocol. Additionally, the retrieval effectiveness of the RAG system, specifically the LLM fine-tuned with CSL, is extensively validated through statistical experiments. The results confirm its superior performance over state-of-the-art methods, including GPT-4 with in-context learning prompts and other fine-tuned models.
Author(s)
Shi, Dachuan
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
Li, Jianzhang
Northwestern Polytechnical University
Meyer, Olga  
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
Bauernhansl, Thomas  
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
Journal
Computers in industry  
Open Access
DOI
10.1016/j.compind.2025.104330
Additional link
Full text
Language
English
Fraunhofer-Institut für Produktionstechnik und Automatisierung IPA  
Keyword(s)
  • Asset administration shell

  • Digital twin

  • Entity matching

  • Large language model

  • Retrieval-augmented generation

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024