• English
  • Deutsch
  • Log In
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Recognition of phonemes in A-cappella recordings using temporal patterns and mel frequency cepstral coefficients
 
  • Details
  • Full
Options
2012
Conference Paper
Titel

Recognition of phonemes in A-cappella recordings using temporal patterns and mel frequency cepstral coefficients

Abstract
In this paper, a new method for recognizing phonemes in singing is proposed. Recognizing phonemes in singing is a task that has not yet matured to a standardized method, in comparison to regular speech recognition. The standard methods for regular speech recognition have already been evaluated on vocal records, but their performances are lower compared to regular speech. In this paper, two alternative classification methods dealing with this issue are proposed. One uses Mel-Frequency Cepstral Coefficient features, while another uses Temporal Patterns. They are combined to create a new type of classifier which produces a better performance than the two separate classifiers. The classifications are done with US English songs. The preliminary result is a phoneme recall rate of 48.01% in average of all audio frames within a song.
Author(s)
Hansen, Jens Kofod
Hauptwerk
9th Sound and Music Computing Conference, SMC 2012. Proceedings
Konferenz
Sound and Music Computing Conference (SMC) 2012
Thumbnail Image
Language
English
google-scholar
Fraunhofer-Institut für Digitale Medientechnologie IDMT
Tags
  • phoneme detection

  • vocal analysis

  • audio features

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022