Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Temporal tile shaping for spectral gap filling in audio transform coding in EVS

: Disch, Sascha; Neukam, Christian; Schmidt, Konstantin


Institute of Electrical and Electronics Engineers -IEEE-; IEEE Signal Processing Society:
IEEE 40th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015. Proceedings. Vol.8 : 19-24 April 2015, Brisbane, Australia
Piscataway, NJ: IEEE, 2015
ISBN: 978-1-4673-6997-8
ISBN: 978-1-4673-6998-5
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) <40, 2015, Brisbane>
Fraunhofer IIS ()
Standards in der Audiocodierung; Parametrische Audio Synthese; MPEG Unified Speech and Audio Coder; Bandbreitenerweiterung; Audio-Synthese; Audio-Software; Audio Analyse; audio

At low bitrates, next generation audio coders apply waveform preserving transform coding only for the perceptually most relevant parts of the signal. The resulting spectral gaps are filled in the decoder through techniques like Intelligent Gap Filling (IGF). IGF is currently being standardized in MPEG-H 3D-Audio and also in 3GPP Enhanced Voice Service (EVS). In IGF processing, spectral tiles are copied from a spectral source location into a target location and subsequently adapted by parameter steered post-processing to best match relevant properties of the original signal. Important properties include the spectral and temporal envelope. Since IGF operates on Modified Discrete Cosine Transform (MDCT) spectra of rather long time blocks, temporal envelope shaping is not trivial. In this paper, Temporal Tile Shaping (TTS) is presented. TTS is based on linear prediction in the MDCT domain for shaping the temporal structure of the gap filling signal in the target tiles with subblock granularity. A listening test demonstrates the advantage of the proposed method.