Glottal Mixture Model (GLOMM) for speaker identification on telephone channels

Baggenstoss, Paul; Wilkinghoff, K.; Kurth, F.

2017

Conference Paper

Abstract

The Glottal Mixture Model (GLOMM) extracts speaker-dependent voice source information from speech data. It has previously been shown to provide speaker identification performance on clean speech comparable to universal background model (UBM), a state of the art method based on MFCC. And, when combined with UBM, the error rate was reduced by a factor of three, showing that the voice source information is largely independent of the information contained in the MFCC, yet holds as much speaker-related information. We now describe how GLOMM can be adapted for telephone quality audio and provide significant error reduction when combined with UBM and I-vector approaches. We demonstrate a factor of two error reduction on the NTIMIT data set with respect to the best published results.

Author(s)

Baggenstoss, Paul

Wilkinghoff, K.

Kurth, F.

Hauptwerk

25th European Signal Processing Conference, EUSIPCO 2017

Konferenz

European Signal Processing Conference (EUSIPCO) 2017

Options

Glottal Mixture Model (GLOMM) for speaker identification on telephone channels