• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. DiSCo - a speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain
 
  • Details
  • Full
Options
2009
Conference Paper
Title

DiSCo - a speaker and speech recognition evaluation corpus for challenging problems in the broadcast domain

Abstract
Systems for speech and speaker recognition already achieve low error rates when applied to high-quality audiovisual broadcast data, such as news shows recorded in a studio environment. Several evaluation corpora exist for this domain in various languages. However, in actual applications for broadcast data analysis, the data requirements are more complex. There are many data types beyond the planned speech of the news anchorperson. For example, interesting live recordings from prominent politicians are often recorded in an environment with challenging acoustic properties. Discussions typically expose highly spontaneous speech, with different speakers talking at the same time. The performance of standard approaches to speech and speaker recognition typically deteriorates under such data characteristics, and dedicated techniques have to be developed to handle these problems. Corresponding evaluation corpora are needed which re?ect the challenging conditions of the actual applications. Currently, no German evaluation corpus is available which covers the required acoustic conditions and diverse language properties. This contribution describes the design of a new speaker and speech recognition evaluation corpus for the broad- cast domain, re?ecting the typical problems encountered in actual applications.
Author(s)
Baum, D.
Samlowski, B.
Winkler, Thomas  
Bardeli, Rolf  
Schneider, Daniel  
Mainwork
GSCL Symposium 'Sprachtechnologie und eHumanities' 2009. Proceedings  
Conference
Symposium 'Sprachtechnologie und eHumanities' 2009  
File(s)
Download (111.49 KB)
Rights
Use according to copyright law
DOI
10.24406/publica-fhg-361861
Language
English
Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS  
Keyword(s)
  • corpus

  • speech recognition

  • broadcast

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024