• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Benchmarking Neural Speech Codec Intelligibility with SITool
 
  • Details
  • Full
Options
2025
Conference Paper
Title

Benchmarking Neural Speech Codec Intelligibility with SITool

Abstract
Speech intelligibility assessment is essential for evaluating neural speech codecs, yet most evaluation efforts focus on overall quality rather than intelligibility. Only a few publicly available tools exist for conducting standardized intelligibility tests, like the Diagnostic Rhyme Test (DRT) and Modified Rhyme Test (MRT). We introduce the Speech Intelligibility Toolkit for Subjective Evaluation (SITool), a Flask-based web application for conducting DRT and MRT in laboratory and crowdsourcing settings. We use SITool to benchmark 13 neural and traditional speech codecs, analyzing phoneme-level degradations and comparing subjective DRT results with objective intelligibility metrics. Our findings show that, while neural speech codecs can outperform traditional ones in subjective intelligibility, only STOI and ESTOI - not WER - significantly correlate with subjective results, although they struggle to capture gender and wordlist-specific variations observed in subjective evaluations.
Author(s)
Leschanowsky, Anna Katharina  orcid-logo
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Kayyar Lakshminarayana, Kishor
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Rajasekhar, Anjana
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Behringer, Lyonel
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Kilinc, Ibrahim
Electrical and Computer Engineering Department
Fuchs, Guillaume  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Habets, Emanuël Anco Peter
International Audio Laboratories Erlangen
Mainwork
Interspeech 2025  
Conference
International Speech Communication Association (INTERSPEECH Annual Conference) 2025  
DOI
10.21437/Interspeech.2025-984
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Rhyme Test

  • speech coding

  • speech intelligibility

  • subjective evaluation

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024