• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. A Data-Driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment
 
  • Details
  • Full
Options
2022
Conference Paper
Title

A Data-Driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment

Abstract
Objective audio quality measurement systems often use perceptual models to predict the subjective quality scores of processed signals, as reported in listening tests. Most systems map different metrics of perceived degradation into a single quality score predicting subjective quality. This requires a quality mapping stage that is informed by real listening test data using statistical learning (i. e., a data-driven approach) with distortion metrics as input features. However, the amount of reliable training data is limited in practice, and usually not sufficient for a comprehensive training of large learning models. Models of cognitive effects in objective systems can, however, improve the learning model. Specifically, considering the salience of certain distortion types, they provide additional features to the mapping stage that improve the learning process, especially for limited amounts of training data. We propose a novel data-driven salience model that informs the quality mapping stage by explicitly estimating the cognitive/degradation metric interactions using a salience measure. Systems incorporating the novel salience model are shown to outperform equivalent systems that only use statistical learning to combine cognitive and degradation metrics, as well as other well-known measurement systems, for a representative validation dataset.
Author(s)
Delgado, Pablo  
International Audio Laboratories Erlangen
Herre, Jürgen  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Mainwork
IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022. Proceedings  
Conference
International Conference on Acoustics, Speech, and Signal Processing 2022  
Open Access
DOI
10.1109/ICASSP43922.2022.9747064
Additional full text version
Landing Page
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Cognitive Modeling

  • Objective Audio Quality Assessment

  • PEAQ

  • Psychoacoustics

  • ViSQOL

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024