• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews
 
  • Details
  • Full
Options
2019
Conference Paper
Title

Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews

Abstract
In automatic speech recognition, often little training data is available for specific challenging tasks, but training of state-of-the-art automatic speech recognition systems requires large amounts of annotated speech. To address this issue, we propose a two-staged approach to acoustic modeling that combines noise and reverberation data augmentation with transfer learning to robustly address challenges such as difficult acoustic recording conditions, spontaneous speech, and speech of elderly people. We evaluate our approach using the example of German oral history interviews, where a relative average reduction of the word error rate by 19.3% is achieved.
Author(s)
Gref, Michael  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Schmidt, Christoph Andreas  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Behnke, Sven  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Köhler, Joachim  
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Mainwork
IEEE International Conference on Multimedia and Expo, ICME 2019. Proceedings  
Project(s)
KA3 Kölner Zentrum für Analyse und Archivierung audiovisueller Daten
Funder
Bundesministerium für Bildung und Forschung BMBF (Deutschland)  
Conference
International Conference on Multimedia and Expo (ICME) 2019  
Open Access
DOI
10.1109/ICME.2019.00142
Additional full text version
Landing Page
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • acoustic signal processing

  • learning (artificial intelligence)

  • speech recognition

  • spontaneous speech

  • german oral history interview

  • acoustic modeling adaption

  • robust speech recognition

  • annotated speech

  • reverberation data augmentation

  • automatic speech recognition system

  • acoustic recording condition

  • elderly people speech

  • word error rate

  • transfer learning

  • training

  • training data

  • history

  • data model

  • reverberation

  • adaptation model

  • robust speech recognition

  • domain adaption

  • transfer learning

  • multi-condition training

  • data augmentation

  • oral history

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024