• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Audio Augmentations for Semi-Supervised Learning with FixMatch
 
  • Details
  • Full
Options
2022
Conference Paper not in Proceedings
Title

Audio Augmentations for Semi-Supervised Learning with FixMatch

Title Supplement
Paper presented at the Extended Abstracts for the Late-Breaking Demo Session of the 23rd Int. Society for Music Information Retrieval Conf., Bengaluru, India, 2022
Abstract
FixMatch, a semi-supervised learning method proposed for image classification, includes unlabeled data instances into the training procedure by predicting labels for differently augmented versions of the unlabeled data. In our previous work, we adapted FixMatch to audio classification by applying image augmentations to spectral representations of the audio signal. While this approach matched the performance of the supervised baseline with only a fraction of the training data, the performance of audio-specific augmentation techniques, and their effect on the FixMatch approach was not evaluated. In this work, we replace all image-based augmentation techniques with audio-specific ones and keep the feature extraction unchanged. The audio-specific approach improved upon the supervised baseline which confirms the effectiveness of the FixMatch approach for semi-supervised learning even with a completely different set of augmentations. However, the image-based approach outperforms the audio-based approach on the three audio classification tasks evaluated.
Author(s)
Grollmisch, Sascha  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Cano, Estefania
Abeßer, Jakob  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Conference
International Society for Music Information Retrieval (ISMIR Conference) 2022  
Open Access
File(s)
Download (108.37 KB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.24406/publica-4779
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Automatic Music Analysis

  • Semi-Supervised Learning

  • Audio Classification

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024