Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Amplitude modulation filters as feature sets for robust ASR: Constant absolute or relative bandwidth?

: Moritz, N.; Anemüller, J.; Kollmeier, B.

International Speech Communication Association -ISCA-:
13th Annual Conference of the International Speech Communication Association, INTERSPEECH 2012. Vol.2 : Portland, Oregon, USA, 9 - 13 September 2012
Red Hook, NY: Curran, 2013
ISBN: 978-1-62276-759-5
International Speech Communication Association (Annual Conference INTERSPEECH) <13, 2012, Portland/Or.>
Conference Paper
Fraunhofer IDMT ()

Many research efforts in the field of feature extraction for automatic speech recognition are focused on analyzing slow amplitude fluctuations of speech. In this study the importance of spectral and temporal resolution for the amplitude modulation frequency analysis are investigated in order to provide guidance for the appropriate filter design. Therefore, different wavelet and Fourier transform like filter time scales are examined, i.e. The importance of time and frequency separation is compared. The results demonstrate that analyzing three separate amplitude modulation frequency bands of constant absolute bandwidth that cover the range from about 2 to 16 Hz are sufficient for automatic speech recognition.