• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Speaker-Conditioning Single-Channel Target Speaker Extraction using Conformer-based Architectures
 
  • Details
  • Full
Options
2022
Conference Paper
Title

Speaker-Conditioning Single-Channel Target Speaker Extraction using Conformer-based Architectures

Abstract
Target speaker extraction aims at extracting the target speaker from a mixture of multiple speakers exploiting auxiliary information about the target speaker. In this paper, we consider a complete time-domain target speaker extraction system consisting of a speaker embedder network and a speaker separator network which are jointly trained in an end-to-end learning process. We propose two different architectures for the speaker separator network which are based on the convolutional augmented transformer (conformer). The first architecture uses stacks of conformer and external feed-forward blocks (Conformer-FFN), while the second architecture uses stacks of temporal convolutional network (TCN) and conformer blocks (TCN-Conformer). Experimental results for 2-speaker mixtures, 3-speaker mixtures, and noisy mixtures of 2-speakers show that among the proposed separator networks, the TCN-Conformer significantly improves the target speaker extraction performance compared to the Conformer-FFN and a TCN-based baseline system.
Author(s)
Sinha, Ragini  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Tammen, Marvin
Rollwage, Christian  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Doclo, Simon  
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Mainwork
International Workshop on Acoustic Signal Enhancement, IWAENC 2022. Proceedings  
Conference
International Workshop on Acoustic Signal Enhancement 2022  
DOI
10.1109/IWAENC53105.2022.9914691
Language
English
Fraunhofer-Institut für Digitale Medientechnologie IDMT  
Keyword(s)
  • attention

  • conformer

  • multi-task learning

  • target speaker extraction

  • TCN

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024