• English
  • Deutsch
  • Log In
    Password Login
    Have you forgotten your password?
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. DOL3: Distilled OpenL3 audio embeddings for lightweight audio classification
 
  • Details
  • Full
Options
October 4, 2024
Conference Paper
Title

DOL3: Distilled OpenL3 audio embeddings for lightweight audio classification

Abstract
Deep audio representations, also known as embeddings, recently became a popular alternative to conventional features like spectrograms for a wide range of audio classification tasks because of their domain-agnostic character and reduced training costs. Still, the usage is often limited to rather computationally intensive system due to the nature of their extraction from large networks. This paper aims to minimize the computational costs of embedding extraction by distilling the knowledge of the OpenL3 audio network to a smaller student network. Results show that the student network maintains comparable performance as the teacher network on various music and ambient noise classification tasks, while reducing the network size by over 90\% and the computational load by five times.
Author(s)
Kehling, Christian
Gourishetti, Saichand  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Mainwork
INTER-NOISE and NOISE-CON Congress and Conference Proceedings 2024  
Conference
International Congress & Exposition on Noise Control Engineering 2024  
DOI
10.3397/IN_2024_3214
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Analyse Industriegeräusche

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024