• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Human and Machine Performance in Counting Sound Classes in Single-Channel Soundscapes
 
  • Details
  • Full
Options
2023
Journal Article
Title

Human and Machine Performance in Counting Sound Classes in Single-Channel Soundscapes

Abstract
Individual sounds are difficult to detect in complex soundscapes because of a strong overlap. This article explores the task of estimating sound polyphony, which is defined here as the number of audible sound classes. Sound polyphony measures the complexity of a soundscape and can be used to inform sound classification algorithms. First, a listening test is performed to assess the difficulty of the task. The results show that humans are only able to reliably count up to three simultaneous sound sources and that they underestimate the degree of polyphony for more complex soundscapes. Human performance depends mainly on the spectral characteristics of the sounds and, in particular, on the number of overlapping noise-like and transient sounds. In a second step, four deep neural network architectures, including an object detection approach for natural images, are compared to contrast human performance with machine learning-based approaches. The results show that machine listening systems can outperform human listeners for the task at hand. Based on these results, an implicit modeling of the sound polyphony based on the number of previously detected sound classes seems less promising than the explicit modeling strategy.
Author(s)
Abeßer, Jakob  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Ullah, Asad
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Ziegler, Sebastian
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Grollmisch, Sascha  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Journal
Journal of the Audio Engineering Society  
DOI
10.17743/jaes.2022.0106
Additional full text version
Landing Page
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Environmental Sound Analysis

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024