• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Enhancement of Coded Speech Using a Mask-Based Post-Filter
 
  • Details
  • Full
Options
2020
Conference Paper
Title

Enhancement of Coded Speech Using a Mask-Based Post-Filter

Abstract
The quality of speech codecs deteriorates at low bitrates due to high quantization noise. A post-filter is generally employed to enhance the quality of the coded speech. In this paper, a data-driven post-filter relying on masking in the time-frequency domain is proposed. A fully connected neural network (FCNN), a convolutional encoder-decoder (CED) network and a long short-term memory (LSTM) network are implemeted to estimate a real-valued mask per time-frequency bin. The proposed models were tested on the five lowest operating modes (6.65 kbps-15.85 kbps) of the Adaptive Multi-Rate Wideband codec (AMR-WB). Both objective and subjective evaluations confirm the enhancement of the coded speech and also show the superiority of the mask-based neural network system over a conventional heuristic post-filter used in the standard like ITU-T G.718.
Author(s)
Korse, S.
Gupta, K.
Fuchs, G.
Mainwork
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2020. Proceedings  
Conference
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020  
Open Access
DOI
10.1109/ICASSP40776.2020.9053283
Additional link
Full text
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024