• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Efficient MDCT-Based Multi-Channel Coding with Perceptual Whitening and Broadband ILD Compensation
 
  • Details
  • Full
Options
2025
Conference Paper
Title

Efficient MDCT-Based Multi-Channel Coding with Perceptual Whitening and Broadband ILD Compensation

Abstract
Stereo and multi-channel audio coding for mobile communication applications is challenging due to bitrate, latency and computational complexity constraints. To address these constraints, a novel MDCT-based stereo coding scheme is introduced. A key feature of this scheme is the broadband inter-channel level compensation of the spectrally and temporally whitened channels, followed by a robust mechanism for selecting between Mid/Side and Left/Right coding for each frequency sub-band. For lower bitrates, inter-channel time and phase difference compensation methods are additionally utilized. Enhancements comprising stereo coding of spectral and temporal noise shaping parameters and stereo-aware bandwidth extension further increase coding efficiency. The paper outlines the fundamental concepts of this MDCTbased stereo coding incorporated into the newly introduced 3GPP IVAS codec and their extension to multi-channel coding. Finally, the IVAS codec is compared to state-of-the-art counterparts, demonstrating its advantages in communication scenarios across a wide range of content types.
Author(s)
Markovic, Goran  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Fotopoulou, Eleni
DSP Solutions GmbH & Co. KG
Kiene, Jan Frederik
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Helmrich, Christian  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Mainwork
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025. Proceedings  
Conference
International Conference on Acoustics, Speech and Signal Processing 2025  
DOI
10.1109/ICASSP49660.2025.10889614
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Immersive Voice and Audio Services (IVAS)

  • inter-channel level difference (ILD)

  • Modified Discrete Cosine Transform (MDCT)

  • multichannel coding tool (MCT)

  • stereo coding

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024