Performance comparison of real-time single-channel speech dereverberation algorithms

Xiong, F.; Meyer, B.T.; Cauchi, B.; Jukic, A.; Goetze, S.; Doclo, Simon

doi:10.1109/HSCMA.2017.7895575

2017

Conference Paper

Abstract

This paper investigates four single-channel speech dereverberation algorithms, i.e., two unsupervised approaches based on (i) spectral enhancement and (ii) linear prediction, as well as two supervised approaches relying on machine learning which incorporate deep neural networks to predict either (iii) the magnitude spectrogram or (iv) the ideal ratio mask. The relative merits of the four algorithms in terms of several objective measures, automatic speech recognition performance, robustness against noise, variations between simulated and recorded reverberant speech, computation time and latency are discussed. Experimental results show that all four algorithms are capable of providing benefits in reverberant environments even with moderate background noises. In addition, low complexity and latency indicate their potential for real-time applications.

Author(s)

Xiong, F.

Meyer, B.T.

Cauchi, B.

Jukic, A.

Goetze, S.

Doclo, Simon

Mainwork

Hands-Free Speech Communications and Microphone Arrays, HSCMA 2017. Proceedings

Conference

Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA) 2017

Options

Performance comparison of real-time single-channel speech dereverberation algorithms