Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Performance comparison of real-time single-channel speech dereverberation algorithms

: Xiong, F.; Meyer, B.T.; Cauchi, B.; Jukić, A.; Doclo, S.; Goetze, S.


Bellegarda, J. ; IEEE Signal Processing Society:
Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017. Proceedings : March 1-3, 2017, San Francisco, California, U.S.A.
Piscataway, NJ: IEEE, 2017
ISBN: 978-1-5090-5925-6
ISBN: 978-1-5090-5924-9
ISBN: 978-1-5090-5926-3
Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA) <5, 2017, San Francisco/Calif.>
Fraunhofer IDMT ()

This paper investigates four single-channel speech dereverberation algorithms, i.e., two unsupervised approaches based on (i) spectral enhancement and (ii) linear prediction, as well as two supervised approaches relying on machine learning which incorporate deep neural networks to predict either (iii) the magnitude spectrogram or (iv) the ideal ratio mask. The relative merits of the four algorithms in terms of several objective measures, automatic speech recognition performance, robustness against noise, variations between simulated and recorded reverberant speech, computation time and latency are discussed. Experimental results show that all four algorithms are capable of providing benefits in reverberant environments even with moderate background noises. In addition, low complexity and latency indicate their potential for real-time applications.