Predicting the quality of processed speech by combining modulation-based features and model trees

Cauchi, B.; Goetze, S.; Naylor, P.A.; Doclo, Simon

2016

Conference Paper

Abstract

Many signal processing methods have been proposed to improve the quality of speech recorded in the presence of noise and reverberation. The evaluation of these methods either requires the use of perceptual measures, i.e. listening tests, or instrumental measures. Perceptual measures are typically more reliable but are quite costly and timeconsuming. On the other hand, instrumental measures may correlate poorly with the perceived speech quality. In this paper we propose to train an instrumental measure, combining modulation-based features and model trees, on the basis of perceptual scores obtained on a small corpus of speech data that has been processed by a combination of beamforming and spectral postfiltering. For evaluation purposes the resulting measure is then applied to a larger corpus. Results show that the use of model trees to train the predicting function of an instrumental measure increases its correlation with perceptual scores.

Author(s)

Cauchi, B.

Goetze, S.

Naylor, P.A.

Doclo, Simon

Hauptwerk

Speech Communication. 12. ITG-Fachtagung Sprachkommunikation 2016

Konferenz

Fachtagung Sprachkommunikation 2016

Options

Predicting the quality of processed speech by combining modulation-based features and model trees