• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Cross-lingual acoustic modeling in upper sorbian - preliminary study
 
  • Details
  • Full
Options
2021
Conference Paper
Title

Cross-lingual acoustic modeling in upper sorbian - preliminary study

Abstract
In this paper, we present a preliminary study for acoustic modeling in Upper Sorbian, where a model of German was used in cross-lingual transfer learning. At first, we define the grapheme and phoneme inventories and map the target phonemes from the most similar German source equivalents. Phonetically balanced sentences for the recording prompts were selected from a combination of general and domain-specific textual data. The speech corpora with a total duration of around 11 hours was collected in controlled recording sessions involving an equal number of females, males, and children. The baseline acoustic model was employed to force-align the speech corpora given the knowledge-based phoneme mappings. The goodness of the mappings was evaluated by the phoneme confusions in free-phoneme recognition. The new derived data-driven model with a reduced phoneme set was included in the adaptation and evaluation along with the baseline acoustic model. The model adaptation performance was cross-validated with the ""Leave One Group Out"" strategy. We observed major improvements in phoneme error rates after adaptation for the knowledge-based and data-driven phoneme mappings. The study confirmed the feasibility of transfer learning for acoustic model adaptation in the case of Upper Sorbian, at the same time demonstrating practical usability with a small vocabulary speech recognition application (Smart Lamp).
Author(s)
Kraljevski, Ivan  
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Rjelka, Marek  
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Duckhorn, Frank  orcid-logo
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Tschöpe, Constanze  
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Wolff, Matthias
BTU Cottbus-Senftenberg
Mainwork
Elektronische Sprachsignalverarbeitung 2021  
Conference
Konferenz "Elektronische Sprachsignalverarbeitung" (ESSV) 2021  
File(s)
Download (157.24 KB)
Rights
Use according to copyright law
DOI
10.24406/publica-fhg-410375
Language
English
Fraunhofer-Institut für Keramische Technologien und Systeme IKTS  
Keyword(s)
  • acoustic modeling

  • speech recognition

  • Upper Sorbian

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024