• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Cross-Version Singing Voice Detection in Opera Recordings: Challenges for Supervised Learning
 
  • Details
  • Full
Options
2020
Conference Paper
Title

Cross-Version Singing Voice Detection in Opera Recordings: Challenges for Supervised Learning

Abstract
In this paper, we approach the problem of detecting segments of singing voice activity in opera recordings. We consider three state-of-the-art methods for singing voice detection based on supervised deep learning. We train and test these models on a novel dataset comprising three annotated performances (versions) of Richard Wagner's opera ""Die Walküre."" The results of our cross-version experiments indicate that the models do not sufficiently generalize across versions even in the case that another version of the same musical work is available for training. By further analyzing the systems' predictions, we highlight certain correlations between prediction errors and the presence of specific singers, instrument families, and dynamic aspects of the performance. With these findings, our case study provides a first step towards tackling singing voice detection with deep learning in challenging scenarios such as Wagner's operas.
Author(s)
Mimilakis, Stylianos Ioannis  
Weiss, Christof
Arifi-Müller, Vlora  
Abeßer, Jakob  
Müller, Meinard  
Mainwork
Machine Learning and Knowledge Discovery in Databases. Proceedings. Pt.II  
Conference
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD) 2019  
DOI
10.1007/978-3-030-43887-6_35
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Automatic Music Analysis

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024