Adaptation and Training of a Swiss German Speech Recognition System using Data-driven Pronunciation Modelling
Automatic speech recognition is a very important technique for numerous applications like automatic subtitling, dialogue systems and information retrieval systems. Given an annotated speech corpus, a phonetic lexicon and a text corpus, the training of speech recognition systems is straight forward. However, sometimes some of these resources are not available, and strategies must be explored to fill this gap. In this work we train a Swiss German speech recognition system. The only resources that are available is a small Swiss German speech corpus, which is annotated with standard German text. Standard German is the desired output of the speech recognition system, since there is no standardized way to write Swiss German. We use a data-driven approach to estimate the Swiss German pronunciatio ns from a standard German speech recognition model, to improve the Swiss German speech recognition system. Evaluations of the Swiss German speech recognition system show promising results.