• English
  • Deutsch
  • Log In
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Potentials for ASR based on multiple acoustic models and model selection using standard speech features
 
  • Details
  • Full
Options
2012
Conference Paper
Titel

Potentials for ASR based on multiple acoustic models and model selection using standard speech features

Abstract
Acoustic modelling is a key issue for successful automatic speech recognition (ASR). Common ASR systems are usually adapted to a certain use case by training robust acoustic models on speech data from the domain recorded in conditions typical for the use case. Varying conditions thus need either multi-conditional or multiple acoustic models. We present a multi-model approach coping with various acoustic conditions in this work. For each utterance the best matching set of acoustic models is selected based on acoustic information of the same acoustic features and acoustic models used for ASR. Our initial experiments show, that we achieve results comparable to a manual selection of the acoustic models but that we are still slightly outperformed by multiconditional models with a comparable number of mixtures. We further show, that an ideal selection would indeed improve the results compared to multi-conditional models.
Author(s)
Winkler, Thomas
Stein, Daniel
Bardeli, Rolf
Schneider, Daniel
Köhler, Joachim
Hauptwerk
Sprachkommunikation 2012
Konferenz
Fachtagung Sprachkommunikation 2012
File(s)
001.pdf (140.4 KB)
Language
English
google-scholar
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS
Tags
  • speech recognition

  • multi-model approach

  • multiconditional mode...

  • audio

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022