• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. No Data Required: Zero-Shot Domain Adaptation for Automatic Music Transcription
 
  • Details
  • Full
Options
April 6, 2025
Conference Paper
Title

No Data Required: Zero-Shot Domain Adaptation for Automatic Music Transcription

Abstract
Automatic music transcription (AMT) takes a music recording and outputs a transcription of the underlying music. Deep learning models trained for AMT rely on large amounts of annotated training data, which are available only for some domains such as Western classical piano music. Using pre-trained models on out-of-domain inputs can lead to significantly lower performance. Fine-tuning or retraining on new target domains is expensive and relies on the presence of labeled data. In this work, we propose a method for taking a pre-trained transcription model and improving its performance on out-of-domain data without the need for any training data, requiring no fine-tuning or retraining of the original model. Our method uses the model to transcribe pitch-shifted versions of an input, aggregating the output across these versions where the original model is unsure. We take a model originally trained for piano transcription and present experiments under two domain shift scenarios: recording condition mismatch (piano with different recording setups) and instrument mismatch (guitar and choral data). We show that our method consistently improves note- and frame-based performance.
Author(s)
McLeod, Andrew  orcid-logo
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Mainwork
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025. Proceedings  
Conference
International Conference on Acoustics, Speech and Signal Processing 2025  
DOI
10.1109/ICASSP49660.2025.10890396
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Measurement

  • Training

  • Adaptation models

  • Uncertainty

  • Instruments

  • Training data

  • Data models

  • Recording

  • Multiple signal classification

  • Speech processing

  • Automatic music transcription

  • Zero-shot learning

  • Domain adaptation

  • Automatic Music Analysis

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024