Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

A Multichannel Feature Compensation Approach for Robust ASR in Noisy and Reverberant Environments

: Astudillo, R.F.; Braun, S.; Habets, E.

Volltext ()
Präsentation ()

REVERB Challenge Workshop 2014, REverberant Voice Enhancement and Recognition Benchmark. Online resource : 10 May 2014, Florence, Italy
Florence, 2014
8 S.
"REverberant Voice Enhancement and Recognition Benchmark" Challenge Workshop (REVERB) <2014, Florence>
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IIS ()

In this paper we propose a multichannel feature compensation approach for automatic speech recognition in reverberant and noisy environments. The proposed technique propagates the posterior of the clean signal estimated by a multichannel Wiener filter in short-time Fourier transform (STFT) domain
into Mel-frequency cepstrum coefficients (MFCC) domain. The multichannelWiener filter reduces both reverberation and additive noise. Furthermore, we approximate the propagation of the prior distributions of speech and interference through the inverse STFT and the STFT with different time-frequency resolutions. This allows us to derive a multichannel minimum
mean square error MFCC estimator with an STFT resolution that is different from the resolution in the speech enhancement stage. The proposed approach is able to outperform a multichannel short-time spectral amplitude estimation approach on both the clean training and multi-condition training ASR tasks
of the REVERB challenge.