• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. CC-Top: Constrained Clustering for Dynamic Topic Discovery
 
  • Details
  • Full
Options
2022
Conference Paper
Title

CC-Top: Constrained Clustering for Dynamic Topic Discovery

Abstract
Research on multi-class text classification of short texts mainly focuses on supervised (transfer) learning approaches, requiring a finite set of pre-defined classes which is constant over time. This work explores deep constrained clustering (CC) as an alternative to supervised learning approaches in a setting with a dynamically changing number of classes, a task we introduce as dynamic topic discovery (DTD). We do so by using pairwise similarity constraints instead of instance-level class labels which allow for a flexible number of classes while exhibiting a competitive performance compared to supervised approaches. First, we substantiate this through a series of experiments and show that CC algorithms exhibit a predictive performance similar to state-of-the-art supervised learning algorithms while requiring less annotation effort. Second, we demonstrate the overclustering capabilities of deep CC for detecting topics in short text data sets in the absence of the ground truth class cardinality during model training. Third, we showcase how these capabilities can be leveraged for the DTD setting as a step towards dynamic learning over time. Finally, we release our codebase to nurture further research in this area.
Author(s)
Goschenhofer, Jann
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Ragupathy, Pranav
Ludwig-Maximilians-Universität München
Heumann, Christian
Ludwig-Maximilians-Universität München
Bischl, Bernd
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Assenmacher, Matthias
Ludwig-Maximilians-Universität München
Mainwork
Evonlp 2022 1st Workshop on Ever Evolving Nlp Proceedings of the Workshop
Funder
Deutsche Forschungsgemeinschaft  
Conference
1st Workshop on Ever Evolving NLP, EvoNLP 2022, co-located with EMNLP 2022
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024