• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Exploring Curriculum Learning for Languages: Lessons from Regular Language Tasks
 
  • Details
  • Full
Options
2025
Conference Paper
Title

Exploring Curriculum Learning for Languages: Lessons from Regular Language Tasks

Abstract
Despite its intuitive appeal, the effectiveness of data-level curriculum learning (CL) remains debated, mainly due to the absence of unambiguous notions of sample difficulty in real-world tasks. As a step towards a better understanding of the effective use of different curriculum strategies in natural language learning, we study CL in the context of regular languages, where both ground truth and sample difficulty can be precisely defined using deterministic finite automata. We consider two natural measures of difficulty: a data-driven metric based on input length and a task-specific metric derived from the automaton’s structure. Training RNNs and LSTMs across ten regular language classification tasks, we find that CL is not just beneficial but, in some cases, essential for generalisation. Surprisingly, straightforward data-driven curricula outperform more complex task-specific strategies, with the most successful approaches oversampling the shorter lengths early in training.
Author(s)
Toborek, Vanessa
Universität Bonn
Seiffarth, Florian
Universität Bonn
Müller, Sebastian
Universität Bonn
Horvath, Tamas  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Bauckhage, Christian  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
Discovery Science. 28th International Conference, DS 2025. Proceedings  
Conference
International Conference on Discovery Science 2025  
DOI
10.1007/978-3-032-05461-6_37
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • curriculum learning

  • data difficulty

  • regular languages

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024