• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. DiSCo - a German evaluation corpus for challenging problems in the broadcast domain
 
  • Details
  • Full
Options
2010
Conference Paper
Title

DiSCo - a German evaluation corpus for challenging problems in the broadcast domain

Abstract
Typical broadcast material contains not only studio-recorded texts read by trained speakers, but also spontaneous and dialect speech, debates with cross-talk, voice-overs, and on-site reports with difficult acoustic environments. Standard approaches to speech and speaker recognition usually deteriorate under such conditions. This paper reports on the design, construction, and experimental analysis of DiSCo, a German corpus for the evaluation of speech and speaker recognition on challenging material from the broadcast domain. One of the key requirements for the design of this corpus was a good coverage of different types of serious programmes beyond clean speech and planned speech broadcast news. Corpus annotation encompasses manual segmentation, an orthographic transcription, and labelling with speech mode, dialect, and noise type. We indicate typical use cases for the corpus by reporting results from ASR, speech search, and speaker recognition on the new corpus, thereby obtaining insights into the difficulty of audio recognition on the various classes.
Author(s)
Baum, D.
Schneider, Daniel  
Bardeli, Rolf  
Schwenninger, Jochen  
Samlowski, B.
Winkler, Thomas  
Köhler, Joachim  
Mainwork
LREC 2010, 7th International Conference on Language Resources and Evaluation. Proceedings  
Conference
International Conference on Language Resources and Evaluation (LREC) 2010  
File(s)
Download (321.46 KB)
Rights
Use according to copyright law
DOI
10.24406/publica-fhg-366289
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024