• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Potentials for ASR based on multiple acoustic models and model selection using standard speech features
 
  • Details
  • Full
Options
2012
Conference Paper
Title

Potentials for ASR based on multiple acoustic models and model selection using standard speech features

Abstract
Acoustic modelling is a key issue for successful automatic speech recognition (ASR). Common ASR systems are usually adapted to a certain use case by training robust acoustic models on speech data from the domain recorded in conditions typical for the use case. Varying conditions thus need either multi-conditional or multiple acoustic models. We present a multi-model approach coping with various acoustic conditions in this work. For each utterance the best matching set of acoustic models is selected based on acoustic information of the same acoustic features and acoustic models used for ASR. Our initial experiments show, that we achieve results comparable to a manual selection of the acoustic models but that we are still slightly outperformed by multiconditional models with a comparable number of mixtures. We further show, that an ideal selection would indeed improve the results compared to multi-conditional models.
Author(s)
Winkler, Thomas  
Stein, Daniel  
Bardeli, Rolf  
Schneider, Daniel  
Köhler, Joachim  
Mainwork
Sprachkommunikation 2012  
Conference
Fachtagung Sprachkommunikation 2012  
File(s)
Download (140.4 KB)
Rights
Use according to copyright law
DOI
10.24406/publica-fhg-377544
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • speech recognition

  • multi-model approach

  • multiconditional model

  • audio

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024