• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Pointer-Guided Pre-training: Infusing Large Language Models with Paragraph-Level Contextual Awareness
 
  • Details
  • Full
Options
August 22, 2024
Conference Paper
Title

Pointer-Guided Pre-training: Infusing Large Language Models with Paragraph-Level Contextual Awareness

Abstract
We introduce “pointer-guided segment ordering” (SO), a novel pre-training technique aimed at enhancing the contextual understanding of paragraph-level text representations in large language models. Our methodology leverages a self-attention-driven pointer network to restore the original sequence of shuffled text segments, addressing the challenge of capturing the structural coherence and contextual dependencies within documents. This pre-training approach is complemented by a fine-tuning methodology that incorporates dynamic sampling, augmenting the diversity of training instances and improving sample efficiency for various downstream applications. We evaluate our method on a diverse set of datasets, demonstrating its efficacy in tasks requiring sequential text classification across scientific literature and financial reporting domains. Our experiments show that pointer-guided pre-training significantly enhances the model’s ability to understand complex document structures, leading to state-of-the-art performance in downstream classification tasks.
Author(s)
Hillebrand, Lars Patrick  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Pradhan, Prabhupad
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Sifa, Rafet  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Bauckhage, Christian  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
Machine Learning and Knowledge Discovery in Databases. Research Track. European Conference, ECML PKDD 2024. Proceedings, Part IV  
Conference
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases 2024  
DOI
10.1007/978-3-031-70359-1_23
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024