Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Acoustic event classification using multi-resolution HMM

 
: Baggenstoss, Paul

:

Institute of Electrical and Electronics Engineers -IEEE-; IEEE Signal Processing Society; European Association for Speech, Signal and Image Processing -EURASIP-:
26th European Signal Processing Conference, EUSIPCO 2018 : 3-7 September 2018, Roma, Italy
Piscataway, NJ: IEEE, 2018
ISBN: 978-9-0827-9701-5
ISBN: 978-90-827970-0-8
ISBN: 978-1-5386-3736-4
S.972-976
European Signal Processing Conference (EUSIPCO) <26, 2018, Roma>
Englisch
Konferenzbeitrag
Fraunhofer FKIE ()

Abstract
Real-world acoustic events span a wide range of time and frequency resolutions, from short clicks to longer tonals. This is a challenge for the hidden Markov model (HMM), which uses a fixed segmentation and feature extraction, forcing a compromise between time and frequency resolution. The multiresolution HMM (MR-HMM) is an extension of the HMM that assumes not only an underlying (hidden) random state sequence, but also an underlying random segmentation, with segments spanning a wide range of sizes and processed using a variety of feature extraction methods. It is shown that the MR-HMM alone, as an acoustic event classifier, has performance comparable to state of the art discriminative classifiers on three open data sets. However, as a generative classifier, the MR-HMM models the underlying data generation process and can generate synthetic data, allowing weaknesses of individual class models to be discovered and corrected. To demonstrate this point, the MR-HMM is combined with auxiliary features that capture temporal information, resulting in significantly improved performance.

: http://publica.fraunhofer.de/dokumente/N-524017.html