• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. Can Masked Autoencoders Also Listen to Birds?
 
  • Details
  • Full
Options
August 2025
Journal Article
Title

Can Masked Autoencoders Also Listen to Birds?

Abstract
Masked Autoencoders (MAEs) learn rich representations in audio classification throughan efficient self-supervised reconstruction task. Yet, general-purpose models struggle infine-grained audio domains such as bird sound classification, which demands distinguishingsubtle inter-species differences under high intra-species variability. We show that bridgingthis domain gap requires full-pipeline adaptation beyond domain-specific pretraining data.Using BirdSet, a large-scale bioacoustic benchmark, we systematically adapt pretraining,fine-tuning, and frozen feature utilization. Our Bird-MAE sets new state-of-the-art resultson BirdSet’s multi-label classification benchmark. Additionally, we introduce the parameterefficientprototypical probing, which boosts the utility of frozen MAE features by achievingup to 37 mAP points over linear probes and narrowing the gap to fine-tuning in low-resourcesettings. Bird-MAE also exhibits strong few-shot generalization with prototypical probeson our newly established few-shot benchmark on BirdSet, underscoring the importance oftailored self-supervised learning pipelines for fine-grained audio domains.
Author(s)
Rauch, Lukas
Universität Kassel  
Heinrich, René Patrick Gerald
Fraunhofer-Institut für Energiewirtschaft und Energiesystemtechnik IEE  
Moummad, Ilyass
Joly, Alexis
Sick, Bernhard
Scholz, Christoph
Fraunhofer-Institut für Energiewirtschaft und Energiesystemtechnik IEE  
Journal
Transactions on Machine Learning Research  
Project(s)
DeepBirdDetect
Funder
Bundesministerium für Umwelt, Naturschutz, nukleare Sicherheit und Verbraucherschutz -BMUV-
Link
Link
Language
English
Fraunhofer-Institut für Energiewirtschaft und Energiesystemtechnik IEE  
Keyword(s)
  • Self-supervised learning

  • masked autoencoders

  • bird-sound classification

  • bioacoustics

  • prototypical probing

  • few-shot learning

  • multi-label classification

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024