• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. ALiBERT: Improved automated list inspection (ALI) with BERT
 
  • Details
  • Full
Options
August 16, 2021
Conference Paper
Title

ALiBERT: Improved automated list inspection (ALI) with BERT

Abstract
We consider Automated List Inspection (ALI), a content-based text recommendation system that assists auditors in matching relevant text passages from notes in financial statements to specific law regulations. ALI follows a ranking paradigm in which a fixed number of requirements per textual passage are shown to the user. Despite achieving impressive ranking performance, the user experience can still be improved by showing a dynamic number of recommendations. Besides, existing models rely on a feature-based language model that needs to be pre-trained on a large corpus of domain-specific datasets. Moreover, they cannot be trained in an end-to-end fashion by jointly optimizing with language model parameters. In this work, we alleviate these concerns by considering a multi-label classification approach that predicts dynamic requirement sequences. We base our model on pre-trained BERT that allows us to fine-tune the whole model in an end-to-end fashion, thereby avoiding the need for training a language representation model. We conclude by presenting a detailed evaluation of the proposed model on two German financial datasets.
Author(s)
Ramamurthy, Rajkumar  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Pielka, Maren  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Stenzel, Marc Robin
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Bauckhage, Christian  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Sifa, Rafet  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Khameneh, Tim Dilmaghani
PricewaterhouseCoopers GmbH
Warning, Ulrich
PricewaterhouseCoopers GmbH
Kliem, Bernd
PricewaterhouseCoopers GmbH
Loitz, Rüdiger
PricewaterhouseCoopers GmbH
Mainwork
DocEng 2021, ACM Symposium on Document Engineering. Proceedings  
Conference
Symposium on Document Engineering (DocEng) 2021  
DOI
10.1145/3469096.3474928
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • Neural Networks

  • Text Classification

  • Natural Language Processing

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024