• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. STAR Drums: A Dataset for Automatic Drum Transcription
 
  • Details
  • Full
Options
July 29, 2025
Journal Article
Title

STAR Drums: A Dataset for Automatic Drum Transcription

Abstract
Current state-of-the-art automatic drum transcription (ADT) algorithms make use of neural networks. To train such models, large amounts of annotated data are needed. We introduce the Separate-Tracks-Annotate-Resynthesize Drums (STAR Drums) dataset, derived from full audio recordings that include mixtures of drum instruments, melodic instruments, and vocals. First, we separate the music recordings into a drum stem and a non-drum stem by applying a music source separation algorithm, then automatically annotate the drum stem with an ADT algorithm. The annotations are used for the re-synthesis of the drum stem using sample-based virtual drum instruments. Finally, we mix the re-synthesized drum stem with the original non-drum stem to obtain the final mix. In summary, STAR Drums includes annotated synthesized drum sounds mixed with real recordings of melodic instruments and vocals, offering several benefits: high temporal accuracy of annotations; training data that include recordings of instruments played by musicians, rather than solely relying on MIDI-rendered audio; a large number of supported drum classes; the possibility to customize the final mix by, for instance, applying additional processing to the drum stem, as both drum and non-drum stems are provided; and suitable licenses of audio files for making the dataset fully available to the research community. We demonstrate that, in the context of ADT, training with STAR Drums achieves superior performance compared to training with datasets solely relying on MIDI-rendered data and that the synthesized nature of the drum stem does not diminish performance.
Author(s)
Weber, Philipp
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Uhle, Christian  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Müller, Meinard  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Lang, Matthias  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Journal
Transactions of the International Society for Music Information Retrieval  
Open Access
File(s)
Download (2.28 MB)
Rights
CC BY 4.0: Creative Commons Attribution
DOI
10.5334/tismir.244
10.24406/publica-6635
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Automatic drum transcription

  • automatic music transcription

  • dataset

  • audio

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024