• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments
 
  • Details
  • Full
Options
2011
Conference Paper
Title

Amplitude modulation spectrogram based features for robust speech recognition in noisy and reverberant environments

Abstract
In this contribution we present a feature extraction method that relies on the modulation-spectral analysis of amplitude fluctuations within sub-bands of the acoustic spectrum by a STFT. The experimental results indicate that the optimal temporal filter extension for amplitude modulation analysis is around 310 ms. It is also demonstrated that the phase information of the modulation spectrum contains important cues for speech recognition. In this context, the advantage of an odd analysis basis function is considered. The best presented features reached a total relative improvement of 53,5 % for clean-condition training on Aurora-2. Furthermore, it is shown that modulation features are more robust against room reverberation than conventional cepstral and dynamic features and that they strongly benefit from a high early-to-late energy ratio of the characteristic RIR.
Author(s)
Moritz, N.
Anemüller, J.
Kollmeier, Birger  
Mainwork
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2011. Vol.7  
Conference
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2011  
DOI
10.1109/ICASSP.2011.5947602
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024