• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. From Keyterms to Context: Exploring Topic Description Generation in Scientific Corpora
 
  • Details
  • Full
Options
2025
Conference Paper
Title

From Keyterms to Context: Exploring Topic Description Generation in Scientific Corpora

Abstract
Topic models represent topics as ranked term lists, which are often hard to interpret in scientific domains. We explore Topic Description for Scientific Corpora, an approach to generating structured summaries for topic-specific document sets. We propose and investigate two LLM-based pipelines: Selective Context Summarisation (SCS), which uses maximum marginal relevance to select representative documents; and Compressed Context Summarisation (CCS), a hierarchical approach that compresses document sets through iterative summarisation. We evaluate both methods using SUPERT and multi-model LLM-as-a-Judge across three topic modeling backbones and three scientific corpora. Our preliminary results suggest that SCS tends to outperform CCS in quality and robustness, while CCS shows potential advantages on larger topics. Our findings highlight interesting trade-offs between selective and compressed strategies for topic-level summarisation in scientific domains. We release code and data for two of the three datasets.
Author(s)
Achkar, Pierre  orcid-logo
Fraunhofer-Institut für System- und Innovationsforschung ISI  
Murugaboopathy, Satiyabooshan
Fraunhofer-Institut für System- und Innovationsforschung ISI  
Kreuter, Anne
Fraunhofer-Institut für System- und Innovationsforschung ISI  
Campbell, Yuri
Fraunhofer-Institut für System- und Innovationsforschung ISI  
Gollub, Tim
Bauhaus-Universität Weimar
Potthast, Martin
Universität Kassel  
Mainwork
Proceedings of The 5th New Frontiers in Summarization Workshop  
Conference
Workshop on New Frontiers in Summarization 2025  
Conference on Empirical Methods in Natural Language Processing 2025  
DOI
10.18653/v1/2025.newsum-main.8
Language
English
Fraunhofer-Institut für System- und Innovationsforschung ISI  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024