Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

A robust speech/music discriminator for switched audio coding

: Fuchs, G.


European Association for Signal Processing -EURASIP-; Institute of Electrical and Electronics Engineers -IEEE-:
23rd European Signal Processing Conference, EUSIPCO 2015 : August 31 - September 4, 2015, Nice
Piscataway, NJ: IEEE, 2015
ISBN: 978-0-9928626-3-3
ISBN: 978-0-9928626-4-0
European Signal Processing Conference (EUSIPCO) <23, 2015, Nice>
Fraunhofer IIS ()

Switching between speech coding and generic audio coding schemes was recently proven to be very efficient for coding a large range of audio materials at low bit-rates. However, it strongly relies on a robust classification of the input signal. The aim of the paper is to design a reliable speech and music discriminator (SMD) for such an application. Main attention was laid on getting a good tradeoff between accuracy, reactivity and stability of the decision while keeping the delay and complexity reasonably low. To this end, short-term and long-term features are dissociated before being conveyed to two different classifiers. The two classifier outputs are combined in a final decision using a hysteresis. Objective measures show that a more reliable switching decision is achievable. The SMD was successfully implemented in MPEG Unified Speech and Audio Coding (USAC). It allows the codec to show unprecedented audio quality.