• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Multi-Sample Dynamic Time Warping for Few-Shot Keyword Spotting
 
  • Details
  • Full
Options
2024
Conference Paper
Title

Multi-Sample Dynamic Time Warping for Few-Shot Keyword Spotting

Abstract
In multi-sample keyword spotting, each keyword class is represented by multiple spoken instances, called samples. A naïve approach to detect keywords in a target sequence consists of querying all samples of all classes using sub-sequence dynamic time warping. However, the resulting processing time increases linearly with respect to the number of samples belonging to each class. Alternatively, only a single Fréchet mean can be queried for each class, resulting in reduced processing time but usually also in worse detection performance as the variability of the query samples is not captured sufficiently well. In this work, multi-sample dynamic time warping is proposed to compute class-specific cost-tensors that include the variability of all query samples. To significantly reduce the computational complexity during inference, these cost tensors are converted to cost matrices before applying dynamic time warping. In experimental evaluations for few-shot keyword spotting, it is shown that this method yields a very similar performance as using all individual query samples as templates while having a runtime that is only slightly slower than when using Fréchet means.
Author(s)
Wilkinghoff, Kevin  
Fraunhofer-Institut für Kommunikation, Informationsverarbeitung und Ergonomie FKIE  
Cornaggia-Urrigshardt, Alessia  
Fraunhofer-Institut für Kommunikation, Informationsverarbeitung und Ergonomie FKIE  
Mainwork
32nd European Signal Processing Conference, EUSIPCO 2024. Proceedings  
Conference
European Signal Processing Conference 2024  
DOI
10.23919/EUSIPCO63174.2024.10714966
Language
English
Fraunhofer-Institut für Kommunikation, Informationsverarbeitung und Ergonomie FKIE  
Keyword(s)
  • dynamic time warping

  • few-shot learning

  • keyword spotting

  • pattern matching

  • sound event detection

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024