Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

A real-time speech command detector for a smart control room

 
: Reich, Daniel; Putze, Felix; Heger, Dominic; Ijsselmuiden, Joris; Stiefelhagen, Rainer; Schultz, Tanja

:
Postprint urn:nbn:de:0011-n-1908732 (57 KByte PDF)
MD5 Fingerprint: f5eebfeb1aed3add0015acbd776fcbe1
Erstellt am: 21.12.2011


International Speech Communication Association -ISCA-:
INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association : Florence, Italy, 28-31 August 2011
Florence, 2011
ISSN: 1990-9772
S.2641-2644
International Speech Communication Association (Annual Conference INTERSPEECH) <12, 2011, Florence>
Englisch
Konferenzbeitrag, Elektronische Publikation
Fraunhofer IOSB ()
smart environment; command detection; trigger- free always-on speech interface; prosody feature; decoding feature

Abstract
In this work we present an online ASR system that is able to discriminate voice commands directed to an operationable screen from irrelevant speech segments. For classification of the sound segments we explored several features that are based on prosody as well as properties generated during the decoding process. For a vocabulary of 259 words and more than 10k possible commands, our realtime Verbal Command Detector managed to detect 88.3% of the commands in our evaluation data while maintaining a low False Positive Rate (FPR) of 1.5%. On an evaluation task using an episode of Star Trek, our system was able to detect 91.2% of all commands with a FPR of 1.8% with only minor adjustments. The system is part of and used in the Smart Control Room at the Fraunhofer IOSB in Karlsruhe [1], an experimental smart environment that uses multiple input modalities for crisis response.

: http://publica.fraunhofer.de/dokumente/N-190873.html