• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Keyword spotting in singing with duration-modeled HMMs
 
  • Details
  • Full
Options
2015
Conference Paper
Titel

Keyword spotting in singing with duration-modeled HMMs

Abstract
Keyword spotting in speech is a very well-researched problem, but there are almost no approaches for singing. Most speech-based approaches cannot be applied easily to singing because the phoneme durations in singing vary a lot more than in speech, especially the vowel durations. To represent expected phoneme durations, several duration modeling techniques have been developed over the years in the field of ASR. To the best of our knowledge, these approaches have not been used for keyword spotting yet. In this paper, we present a new approach for keyword spotting in singing. We first extract various features (MFCC, TRAP, PLP, RASTA-PLP) and generate phoneme posteriograms from these features. We then perform keyword spotting on these posteriograms using keyword-filler HMMs and test two different duration modeling techniques on these HMMs: Explicit-duration modeling and Post-processor duration modeling. We evaluate our approach on a small singing data set without accompaniment.
Author(s)
Kruspe, Anna M.
Hauptwerk
23rd European Signal Processing Conference, EUSIPCO 2015
Konferenz
European Signal Processing Conference (EUSIPCO) 2015
Thumbnail Image
DOI
10.1109/EUSIPCO.2015.7362592
Language
English
google-scholar
Fraunhofer-Institut für Digitale Medientechnologie IDMT
Tags
  • keyword spotting

  • vocal analysis

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022