• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. The Sound of Language: A Bilingual Analysis of Voice Conversion and Text-to-Speech Synthesis
 
  • Details
  • Full
Options
April 6, 2025
Conference Paper
Title

The Sound of Language: A Bilingual Analysis of Voice Conversion and Text-to-Speech Synthesis

Abstract
With the rise of audio deepfakes, there is an increasing need for comprehensive studies on their generation methods, especially regarding their quality. Areas such as languages beyond English and Chinese, as well as comparisons between voice conversion (VC) and text-to-speech synthesis (TTS), remain underexplored. In our study, we generated samples in English and German using 10 recent VC and TTS methods, including two publicly accessible online tools. We compared these samples using various evaluation methods to gain insights into their quality across different factors. Our analysis indicates that TTS performs slightly better than VC, with minor differences between English and German data. Interestingly, in VC, the gender of the source speaker has minimal influence on the generated samples. Instead, the cross-gender factor appears to affect VC. For both VC and TTS, the target speaker samples used for generation seem to influence the quality of the generated samples.
Author(s)
Choi, Jeong-Eun  
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Schäfer, Karla
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Steinebach, Martin  
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
Mainwork
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2025. Proceedings  
Conference
International Conference on Acoustics, Speech and Signal Processing 2025  
DOI
10.1109/ICASSP49660.2025.10888796
Language
English
Fraunhofer-Institut für Sichere Informationstechnologie SIT  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024