• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Sound recurrence analysis for acoustic scene classification
 
  • Details
  • Full
Options
January 14, 2025
Journal Article
Title

Sound recurrence analysis for acoustic scene classification

Abstract
In everyday life, people experience different soundscapes in which natural sounds, animal noises, and man-made sounds blend together. Although there have been several studies on the importance of recurring sound patterns in music and language, the relevance of this phenomenon in natural soundscapes is still largely unexplored. In this article, we study the repetition patterns of harmonic and transient sound events as potential cues for acoustic scene classification (ASC). In the first part of our study, our aim is to identify acoustic scene classes that exhibit characteristic sound repetition patterns concerning harmonic and transient sounds. We propose three metrics to measure the overall prevalence of sound repetitions as well as their repetition periods and temporal stability. In the second part, we evaluate three strategies to incorporate self-similarity matrices as an additional input feature to a convolutional neural network architecture for ASC. We observe the characteristic repetition of transient sounds in recordings of "park" and "street traffic" as well as harmonic sound repetitions in acoustic scene classes related to public transportation. In the ASC experiments, hybrid network architectures, which combine spectrogram features and features from sound recurrence analysis, show increased accuracy for those classes with prominent sound repetition patterns. Our findings provide additional perspective on the distinctions among acoustic scenes previously primarily ascribed in the literature to their spectral features.
Author(s)
Abeßer, Jakob  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Liang, Zhiwei
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Seeber, Bernhard  
Technical University of Munich
Journal
EURASIP Journal on audio, speech, and music processing : EURASIP JASMP  
Open Access
DOI
10.1186/s13636-024-00390-2
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Acoustic scene classification

  • Sound recurrence analysis

  • Sound repetition patterns

  • Self-similarity matrix

  • Harmonic-percussive source separation

  • Result fusion

  • Ensemble models

  • Environmental Sound Analysis

  • Media Forensics

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024