Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Advanced time shrinking using a drop classifier based on codec features

: Issing, J.; Färber, N.; German, R.

Volltext (PDF; )

International Speech Communication Association -ISCA-:
Speech beyond speech towards a better understanding of the most important biosignal. Vol.1 : 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015); Dresden, Germany, 6-10 September 2015
Red Hook, NY: Curran, 2015
ISBN: 978-1-5108-1790-6
International Speech Communication Association (Interspeech Annual Conference) <16, 2015, Dresden>
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IIS ()

We present an integrated approach of full-band audio time scale modification for Voice over IP communication. The concept is based on a low complexity adaptive playout method that uses frame dropping and audio concealment for time shrinking and stretching, respectively. The existing version of this method is improved using a classifier that assists in choosing which audio frames can be dropped with the least subjective impact on audio quality. To maintain low complexity, we exclusively use audio signal features that are available in the audio codec. The classification of audio frames improves audio quality of the existing method without classification by 0:5 Mean Opinion Score points while requiring significantly less computational complexity by a factor of ca 104.