• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Bootstrapping a system for phoneme recognition and keyword spotting in unaccompanied singing
 
  • Details
  • Full
Options
2016
Conference Paper
Title

Bootstrapping a system for phoneme recognition and keyword spotting in unaccompanied singing

Abstract
Speech recognition in singing is still a largely unsolved problem. Acoustic models trained on speech usually produce unsatisfactory results when used for phoneme recognition in singing. On the flipside, there is no phonetically annotated singing data set that could be used to train more accurate acoustic models for this task. In this paper, we attempt to solve this problem using the DAMP data set which contains a large number of recordings of amateur singing in good quality. We first align them to the matching textual lyrics using an acoustic model trained on speech. We then use the resulting phoneme alignment to train new acoustic models using only subsets of the DAMP singing data. These models are then tested for phoneme recognition and, on top of that, keyword spotting. Evaluation is performed for different subsets of DAMP and for an unrelated set of the vocal tracks of commercial pop songs. Results are compared to those obtained with acoustic models trained on the TIMIT speech data set and on a version of TIMIT augmented for singing. Our new approach shows significant improvements over both.
Author(s)
Kruspe, Anna M.
Mainwork
17th International Society for Music Information Retrieval Conference, ISMIR 2016. Proceedings  
Conference
International Society for Music Information Retrieval (ISMIR Conference) <17, 2016, New York/NY)  
Link
Link
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • vocal analysis

  • phoneme recognition

  • keyword spotting

  • lyrics alignment

  • Automatic Music Analysis

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024