Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Feature Extraction Using Power-Law Adjusted Linear Prediction with Application to Speaker Recognition under Severe Vocal Effort Mismatch

: Saeidi, R.; Alku, P.; Backstrom, T.


IEEE ACM transactions on audio, speech, and language processing 24 (2016), No.1, pp.42-53
ISSN: 2329-9290
ISSN: 2329-9304
Journal Article
Fraunhofer IIS ()

Linear prediction is one of the most established techniques in signal estimation, and it is widely utilized in speech signal processing. It has been long understood that the nerve firing rate of human auditory system can be approximated by power law non-linearity, and this has been the motivation behind using perceptual linear prediction in extracting acoustic features in a variety of speech processing applications. In this paper, we revisit the application of power law non-linearity in speech spectrum estimation by compressing/expanding power spectrum in autocorrelation-based linear prediction. The development of so-called LP-α is motivated by a desire to obtain spectral features that present less mismatch than conventionally used spectrum estimation methods when speech of normal loudness is compared to speech under vocal effort.