• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. SKIE-SRL: Structured Key Information Extraction from Business Documents Using Statistical Relational Learning
 
  • Details
  • Full
Options
2025
Conference Paper
Title

SKIE-SRL: Structured Key Information Extraction from Business Documents Using Statistical Relational Learning

Abstract
Seamless automation of business processes requires the automatic analysis of complex business documents like invoices or contracts. Extracting key information in these semi-structured documents poses a significant challenge. While multimodal language models have demonstrated state-of-the-art results in this field, their application to document types with high complexity is still challenging. They neglect the underlying document structure and existing dependencies between information types. On complex document types commonly encountered in industry, this results in preventable errors such as missing predictions for mandatory elements or wrong extractions of interdependent elements. In this paper, we present SKIE-SRL, a Statistical Relational Learning model for Structured Key Information Extraction. This hybrid approach unifies symbolic reasoning with multimodal language models. It extends the capabilities of pretrained language models with expert-provided knowledge about the document structure and application domain. We evaluate our model on a multilingual invoice data set from industry and compare it to three state-of-the-art language models as well as a stacked ensemble. Our approach outperforms all benchmarks by 4% F1-score.
Author(s)
Kirsch, Birgit  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Yu, Xiang
SAP SE
Chakraborty, Nilesh  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Doll, Niclas
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Giesselbach, Sven  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Rüping, Stefan  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
Machine Learning, Optimization, and Data Science. 10th International Conference, LOD 2024. Pt.II  
Conference
International Conference on Machine Learning, Optimization, and Data 2024  
DOI
10.1007/978-3-031-82484-5_7
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Information Extraction

  • Natural Language Processing

  • Statistical Relational Learning

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024