Adaptation and Training of a Swiss German Speech Recognition System using Data-driven Pronunciation Modelling

Stadtschnitzer, Michael; Schmidt, Christoph Andreas

doi:10.24406/publica-fhg-400968

2018

Conference Paper

Abstract

Automatic speech recognition is a very important technique for numerous applications like automatic subtitling, dialogue systems and information retrieval systems. Given an annotated speech corpus, a phonetic lexicon and a text corpus, the training of speech recognition systems is straight forward. However, sometimes some of these resources are not available, and strategies must be explored to fill this gap. In this work we train a Swiss German speech recognition system. The only resources that are available is a small Swiss German speech corpus, which is annotated with standard German text. Standard German is the desired output of the speech recognition system, since there is no standardized way to write Swiss German. We use a data-driven approach to estimate the Swiss German pronunciatio ns from a standard German speech recognition model, to improve the Swiss German speech recognition system. Evaluations of the Swiss German speech recognition system show promising results.

Author(s)

Stadtschnitzer, Michael

Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Schmidt, Christoph Andreas

Fraunhofer-Institut für Intelligente Analyse- und Informationssysteme IAIS

Mainwork

Fortschritte der Akustik. DAGA 2018

Conference

Deutsche Jahrestagung für Akustik (DAGA) 2018

Options

Adaptation and Training of a Swiss German Speech Recognition System using Data-driven Pronunciation Modelling