Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

An expectation-maximization algorithm for multimicrophone speech dereverberation and noise reduction with coherence matrix estimation

: Schwartz, O.; Gannot, S.; Habets, E.A.P.


IEEE ACM transactions on audio, speech, and language processing 24 (2016), No.9, pp.1495-1510
ISSN: 2329-9290
ISSN: 2329-9304
Journal Article
Fraunhofer IIS ()

In speech communication systems, the microphone signals are degraded by reverberation and ambient noise. The reverberant speech can be separated into two components, namely, an early speech component that consists of the direct path and some early reflections and a late reverberant component that consists of all late reflections. In this paper, a novel algorithm to simultaneously suppress early reflections, late reverberation, and ambient noise is presented. The expectation-maximization (EM) algorithm is used to estimate the signals and spatial parameters of the early speech component and the late reverberation components. As a result, a spatially filtered version of the early speech component is estimated in the E-step. The power spectral density (PSD) of the anechoic speech, the relative early transfer functions, and the PSD matrix of the late reverberation are estimated in the M-step of the EM algorithm. The algorithm is evaluated using real room impulse response recorded in our acoustic lab with a reverberation time set to 0.36 s and 0.61 s and several signal-to-noise ratio levels. It is shown that significant improvement is obtained and that the proposed algorithm outperforms baseline single-channel and multichannel dereverberation algorithms, as well as a state-of-the-art multichannel dereverberation algorithm.