Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Gradient-free decoding parameter optimization on automatic speech recognition

: Nguyen, T.L.; Stein, D.; Stadtschnitzer, M.


Institute of Electrical and Electronics Engineers -IEEE-; IEEE Signal Processing Society:
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014. Vol.4 : Florence, Italy, 4 - 9 May 2014
Piscataway, NJ: IEEE, 2014
ISBN: 978-1-4799-2894-1
ISBN: 978-1-4799-2892-7
DOI: 978-1-4799-2893-4
International Conference on Acoustics, Speech and Signal Processing (ICASSP) <39, 2014, Florence>
Fraunhofer IAIS ()

Finding the optimal decoding parameters in speech recognition is often done manually in a rather tedious manner, although automatic gradient-free optimization techniques have been shown to perform quite well for this task. While there have been recent scientific contributions in this field, no thorough comparison of possible methods, in terms of convergence speed and performance, has been undertaken. In this paper, we conduct a series of experiments with three decoding paradigms and four different optimization techniques found in recent literature, both on unconstrained and time-constrained decoder optimization. We offer our findings on the German Difficult Speech Corpus and on the LinkedTV test sets.