• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Konferenzschrift
  4. Perceptual audio features for unsupervised key-phrase detection
 
  • Details
  • Full
Options
2010
Conference Paper
Title

Perceptual audio features for unsupervised key-phrase detection

Abstract
We propose a new type of audio feature (HFCC-ENS) as well as an unsupervised method for detecting short sequences of spoken words (key-phrases) within long speech recordings. Our technical contributions are threefold: Firstly, we propose to use bandwidth-adapted filterbanks instead of classical MFCC-style filters in the feature extraction step. Secondly, the time resolution of the resulting features is adapted to account for the temporal characteristics of the spoken phrases. Thirdly, the key-phrase detection step is performed by matching sequences of the resulting HFCC-ENS features with features extracted from a target speech recording. We evaluate the proposed method using the German Kiel Corpus and furthermore investigate speech-related properties of the proposed feature.
Author(s)
Zeddelmann, D. von
Kurth, F.
Müller, M.
Mainwork
IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010. Proceedings. Vol.1  
Conference
International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2010  
DOI
10.1109/ICASSP.2010.5495974
Language
English
Fraunhofer-Institut für Kommunikation, Informationsverarbeitung und Ergonomie FKIE  
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024