• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition
 
  • Details
  • Full
Options
2015
Conference Paper
Title

A CHiME-3 challenge system: Long-term acoustic features for noise robust automatic speech recognition

Abstract
The paper describes an automatic speech recognition (ASR) system for the 3rd CHiME challenge that addresses noisy acoustic scenes within public environments. The proposed system includes a multi-channel speech enhancement front-end including a microphone channel failure detection method that is based on cross-comparing the modulation spectra of speech to detect erroneous microphone recordings. The main focus of the submission is the investigation of the amplitude modulation filter bank (AMFB) as a method to extract long-term acoustic cues prior to a Gaussian mixture model (GMM) or deep neural network (DNN) based ASR classifier. It is shown that AMFB features outperform the commonly used frame splicing technique of filter bank features even on a performance optimized ASR challenge system. I.e., temporal analysis of speech by hand-crafted and auditory motivated AMFBs is shown to be more robust compared to a data-driven method based on extracting temporal dynamics with a DNN.
Author(s)
Moritz, N.
Gerlach, S.
Adiloglu, K.
Anemulle, J.
Goetze, S.
Kollmeier, Birger  
Mainwork
IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2015. Proceedings  
Conference
Workshop on Automatic Speech Recognition and Understanding (ASRU) 2015  
DOI
10.1109/ASRU.2015.7404832
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024