• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Word Class Based Language Modeling: A Case of Upper Sorbian
 
  • Details
  • Full
Options
June 20, 2022
Conference Paper
Title

Word Class Based Language Modeling: A Case of Upper Sorbian

Abstract
In this paper we show how word class based language modeling can support the integration of a small language in modern applications of speech technology. The methods described in this paper can be applied for any language. We demonstrate the methods on Upper Sorbian. The word classes model the semantic expressions of numerals, date and time of day. The implementation of the created grammars was realized in the form of finite-state-transducers (FSTs) and minimalists grammars (MGs). We practically demonstrate the usage of the FSTs in a simple smart-home speech application, that is able to set wake-up alarms and appointments expressed in a variety of spontaneous and natural sentences. While the created MGs are not integrated in an application for practical use yet, they provide evidence that MGs could potentially work more efficient than FSTs in built-on applications. In particular, MGs can work with a significantly smaller lexicon size, since their more complex structure lets them generate more expressions with less items, while still avoiding wrong expressions.
Author(s)
Maier, Isidor
BTU Cottbus-Senftenberg, Chair of Communications Engineering
Kuhn, Johannes Ferdinand Joachim
BTU Cottbus-Senftenberg, Chair of Communications Engineering
Duckhorn, Frank  orcid-logo
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Kraljevski, Ivan  
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Sobe, Daniel
Foundation for the Sorbian People, Bautzen, Germany
Wolff, Matthias
BTU Cottbus-Senftenberg, Chair of Communications Engineering
Tschöpe, Constanze  
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Mainwork
Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia, EURALI 2022. Proceedings  
Conference
Workshop on Resources and Technologies for Indigenous, Endangered and Lesser-resourced Languages in Eurasia 2022  
Language Resources and Evaluation Conference 2022  
Link
Link
Language
English
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Keyword(s)
  • word classes

  • minimalist grammar

  • language modeling

  • speech recognition

  • Upper Sorbian

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024