• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. An Open Dataset of Synthetic Speech
 
  • Details
  • Full
Options
2024
Conference Paper
Title

An Open Dataset of Synthetic Speech

Abstract
This paper introduces a multilingual, multispeaker dataset composed of synthetic and natural speech, designed to foster research and benchmarking in synthetic speech detection. The dataset encompasses 18,993 audio utterances synthesized from text, alongside with their corresponding natural equiva-lents, representing approximately 17 hours of synthetic audio data. The dataset features synthetic speech generated by 156 voices spanning three languages, namely, English, German, and Spanish, with a balanced gender representation. It targets state-of-the-art synthesis methods, and has been released with a license allowing seamless extension and redistribution by the research community.
Author(s)
Yaroshchuk, Artem
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Papastergiopoulos, Christoforos
Cuccovillo, Luca  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Aichroth, Patrick  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Votis, Konstantinos
Tzovaras, Dimitrios
Mainwork
IEEE Workshop on Information Forensics and Security, WIFS 2023  
Conference
International Workshop on Information Forensics and Security 2023  
Open Access
DOI
10.1109/WIFS58808.2023.10374863
Additional full text version
Landing Page
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • Dataset

  • Audio Deepfakes

  • Audio Forensics

  • Media Forensics

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024