Feature Extraction Using Power-Law Adjusted Linear Prediction with Application to Speaker Recognition under Severe Vocal Effort Mismatch

Saeidi, R.; Alku, P.; Backstrom, T.

2016

Journal Article

Abstract

Linear prediction is one of the most established techniques in signal estimation, and it is widely utilized in speech signal processing. It has been long understood that the nerve firing rate of human auditory system can be approximated by power law non-linearity, and this has been the motivation behind using perceptual linear prediction in extracting acoustic features in a variety of speech processing applications. In this paper, we revisit the application of power law non-linearity in speech spectrum estimation by compressing/expanding power spectrum in autocorrelation-based linear prediction. The development of so-called LP-Î± is motivated by a desire to obtain spectral features that present less mismatch than conventionally used spectrum estimation methods when speech of normal loudness is compared to speech under vocal effort.

Author(s)

Saeidi, R.

Alku, P.

Backstrom, T.

Zeitschrift

IEEE ACM transactions on audio, speech, and language processing

Options

Feature Extraction Using Power-Law Adjusted Linear Prediction with Application to Speaker Recognition under Severe Vocal Effort Mismatch