• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Bridging the Gap: GANs as a Solution for Data-Scarce Industrial Audio Classification
 
  • Details
  • Full
Options
March 2025
Conference Paper
Title

Bridging the Gap: GANs as a Solution for Data-Scarce Industrial Audio Classification

Abstract
Data scarcity remains challenging when training deep learning models. This is particularly true in audio classification tasks within the Industrial Sound Analysis (ISA) domain, where the collection of training data represents a significant investment. In this work, we investigate the usage of Generative Adversarial Networks (GANs) in generating synthetic data to improve model performance in low-resource domains. An audio classification model is then trained on a combination of this synthetic data with (comparatively few) real-world examples, and evaluated on unseen real data. We view this approach as analogous to data augmentation, where instead of transforming existing data, the GAN generates novel data for training diversification. To explore the method's potential for various use cases, we run experiments on two different ISA datasets, each exhibiting different audio characteristics, imposing various levels of training data scarcity on each. We show that our method leads to a significant increase in classification accuracy for one of the two datasets, and we analyze the factors which make it successful.
Author(s)
Ngamthipwatthana, Pitchapa
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Gourishetti, Saichand  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
McLeod, Andrew  orcid-logo
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Grollmisch, Sascha  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Mainwork
DAS/DAGA 2025, 51st Annual Meeting on Acoustics. Proceedings  
Conference
Annual Meeting on Acoustics 2025  
DOI
10.71568/dasdaga2025.028
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Analyse Industriegeräusche

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024