
Publica
Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten. Blind estimation of reverberation time based on spectro-temporal modulation filtering
| IEEE Signal Processing Society; Institute of Electrical and Electronics Engineers -IEEE-: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013. Proceedings. Vol.1 : Vancouver, British Columbia, Canada, 26 - 31 May 2013 Piscataway, NJ: IEEE, 2013 ISBN: 978-1-4799-0357-3 ISBN: 978-1-4799-0356-6 pp.443-447 |
| International Conference on Acoustics, Speech, and Signal Processing (ICASSP) <38, 2013, Vancouver> |
|
| English |
| Conference Paper |
| Fraunhofer IDMT () |
Abstract
A novelmethod for blind estimation of the reverberation time (RT60) is proposed based on applying spectro-temporal modulation filters to time-frequency representations. 2D-Gabor filters arranged in a filterbank enable an analysis of the properties of temporal, spectral, and spectro-temporal filtering for this task. Features are used as input to a multi-layer perceptron (MLP) classifier combined with a simple decision rule that attributes a specific RT60 to a given utterance and allows to assess the reliability of the approach for different resolutions of RT60 classification. While the filter set including temporal, spectral, and spectro-temporal filters already outperforms an MFCC baseline, the error rates are further reduced when relying on diagonal spectro-temporal filters alone. The average error rate is 1.9% for the best feature set, which corresponds to a relative reduction of 58.3% compared to the MFCC baseline for RT60s in 0.1 s resolution.