• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Abschlussarbeit
  4. Mapping representations of speaker characteristics using deep learning
 
  • Details
  • Full
Options
2016
Master Thesis
Title

Mapping representations of speaker characteristics using deep learning

Abstract
An automatic model-based system is proposed to estimate the corner vowel formant frequencies and the acoustic measure known as the triangle Vowels Space Area (tVSA) directly from unlabeled natural speech. The proposed algorithm is able to estimate the tVSA automatically from the speech signal without phonetical or vowel transcriptions. The i-Vector features are employed as the speaker characteristic representation from which the formant frequencies of the corner vowels of the speaker are estimated by regression classiffiers. Two regression classiffiers, Deep Neural Networks (DNN) and Support Vector Regression (SVR) are investigated in this thesis. The best configuration uses the SVR, which is able to predict the formant frequencies of the test speakers with evaluation measures R2 up to 0 .56719 and rho up to 0.76485.
Thesis Note
Aachen, TH, Master Thesis, 2016
Author(s)
Tanuadji, Maureen
Publishing Place
Aachen
Project(s)
i-Prognosis  
Funder
European Commission  
File(s)
Download (1.09 MB)
Rights
Use according to copyright law
DOI
10.24406/publica-fhg-281512
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024