Bauckhage, Christian

Prof. Dr.-Ing.

Bauckhage, Christian

0000-0001-6615-2128

Now showing 1 - 4 of 4

Uncovering Inconsistencies and Contradictions in Financial Reports using Large Language Models

( 2023-12)
Deußer, Tobias
;
Leonhard, David
;
Hillebrand, Lars Patrick
;
Berger, Armin
;
Khaled, Mohamed
;
Heiden, Sarah
;
Dilmaghani, Tim
;
Kliem, Bernd
;
Loitz, Rüdiger
;
Bauckhage, Christian
;
Sifa, Rafet

Correct identification and correction of contradictions and inconsistencies within financial reports constitute a fundamental component of the audit process. To streamline and automate this critical task, we introduce a novel approach leveraging large language models and an embedding-based paragraph clustering methodology. This paper assesses our approach across three distinct datasets, including two annotated datasets and one unannotated dataset, all within a zero-shot framework. Our findings reveal highly promising results that significantly enhance the effectiveness and efficiency of the auditing process, ultimately reducing the time required for a thorough and reliable financial report audit.
Informed Machine Learning - A Taxonomy and Survey of Integrating Prior Knowledge into Learning Systems

( 2023)
Rueden, Laura von
;
Mayer, Sebastian
;
Beckh, Katharina
;
Georgiev, Bogdan
;
Giesselbach, Sven
;
Heese, Raoul
;
Kirsch, Birgit
;
Walczak, Michal
;
Pfrommer, Julius
;
Pick, Annika
;
Ramamurthy, Rajkumar
;
Garcke, Jochen
;
Bauckhage, Christian
;
Schuecker, Jannis

Despite its great success, machine learning can have its limits when dealing with insufficient training data. A potential solution is the additional integration of prior knowledge into the training process which leads to the notion of informed machine learning. In this paper, we present a structured overview of various approaches in this field. We provide a definition and propose a concept for informed machine learning which illustrates its building blocks and distinguishes it from conventional machine learning. We introduce a taxonomy that serves as a classification framework for informed machine learning approaches. It considers the source of knowledge, its representation, and its integration into the machine learning pipeline. Based on this taxonomy, we survey related research and describe how different knowledge representations such as algebraic equations, logic rules, or simulation results can be used in learning systems. This evaluation of numerous papers on the basis of our taxonomy uncovers key methods in the field of informed machine learning.
KPI-EDGAR: A Novel Dataset and Accompanying Metric for Relation Extraction from Financial Documents

( 2022-12)
Deußer, Tobias
;
Ali, Syed Musharraf
;
Hillebrand, Lars Patrick
;
Nurchalifah, Desiana Dien
;
Jacob, Basil
;
Bauckhage, Christian
;
Sifa, Rafet

We introduce KPI-EDGAR, a novel dataset for Joint Named Entity Recognition and Relation Extraction building on financial reports uploaded to the Electronic Data Gathering, Analysis, and Retrieval (EDGAR) system, where the main objective is to extract Key Performance Indicators (KPIs) from financial documents and link them to their numerical values and other attributes. We further provide four accompanying baselines for benchmarking potential future research. Additionally, we propose a new way of measuring the success of said extraction process by incorporating a word-level weighting scheme into the conventional F 1 score to better model the inherently fuzzy borders of the entity pairs of a relation in this domain.
Combining Machine Learning and Simulation to a Hybrid Modelling Approach: Current and Future Directions

( 2020)
Rüden, Laura von
;
Mayer, Sebastian
;
Sifa, Rafet
;
Bauckhage, Christian
;
Garcke, Jochen

In this paper, we describe the combination of machine learning and simulation towards a hybrid modelling approach. Such a combination of data-based and knowledge-based modelling is motivated by applications that are partly based on causal relationships, while other effects result from hidden dependencies that are represented in huge amounts of data. Our aim is to bridge the knowledge gap between the two individual communities from machine learning and simulation to promote the development of hybrid systems. We present a conceptual framework that helps to identify potential combined approaches and employ it to give a structured overview of different types of combinations using exemplary approaches of simulation-assisted machine learning and machine-learning assisted simulation. We also discuss an advanced pairing in the context of Industry 4.0 where we see particular further potential for hybrid systems. In this paper, we describe the combination of machine learning and simulation towards a hybrid modelling approach. Such a combination of data-based and knowledge-based modelling is motivated by applications that are partly based on causal relationships, while other effects result from hidden dependencies that are represented in huge amounts of data. Our aim is to bridge the knowledge gap between the two individual communities from machine learning and simulation to promote the development of hybrid systems. We present a conceptual framework that helps to identify potential combined approaches and employ it to give a structured overview of different types of combinations using exemplary approaches of simulation-assisted machine learning and machine-learning assisted simulation. We also discuss an advanced pairing in the context of Industry 4.0 where we see particular further potential for hybrid systems.

Bauckhage, Christian

Filters

Author

Organization

Subject

Has files

Type

Settings

Sort By

Results per page

Options

Bauckhage, Christian

Filters

Author

Organization

Subject

Has files

Type

Settings

Sort By

Results per page