• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents
 
  • Details
  • Full
Options
December 2022
Conference Paper
Title

KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

Abstract
We introduce KPI-EDGAR, a novel dataset for Joint Named Entity Recognition and Relation Extraction building on financial reports uploaded to the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system, where the main objective is to extract Key Performance Indicators (KPIs) from financial documents and link them to their numerical values and other attributes. We further provide four accompanying baselines for benchmarking potential future research. Additionally, we propose a new way of measuring the success of said extraction process by incorporating a word-level weighting scheme into the conventional F 1 score to better model the inherently fuzzy borders of the entity pairs of a relation in this domain.
Author(s)
Deußer, Tobias  orcid-logo
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Ali, Syed Musharraf
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Hillebrand, Lars Patrick  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Nurchalifah, Desiana Dien
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Jacob, Basil
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Bauckhage, Christian  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Sifa, Rafet  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
21st IEEE International Conference on Machine Learning and Applications, ICMLA 2022. Proceedings  
Conference
International Conference on Machine Learning and Applications 2022  
Open Access
DOI
10.1109/ICMLA55696.2022.00254
10.24406/publica-1319
File(s)
KPI-EDGAR A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents.pdf (255.27 KB)
Rights
Under Copyright
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • text mining

  • natural language processing

  • relation extraction

  • named entity recognition

  • machine learning

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024