Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Comparison of two tonality estimation methods used in a psychoacoustic model

: Chen, H.; Taghipour, A.; Edler, B.


International Conference on Audio, Language and Image Processing, ICALIP 2014 : 7th - 9th July 2014, Shanghai, China
Piscataway, NJ: IEEE, 2014
ISBN: 978-1-4799-3904-6
ISBN: 978-1-4799-3903-9
International Conference on Audio, Language and Image Processing (ICALIP) <4, 2014, Shanghai>
Fraunhofer IIS ()

Perceptual audio codecs apply psychoacoustic principles such as masking effects of the human auditory system in order to reduce irrelevancies in the input audio signal. Psychoacoustic studies show differences between masking strength of tonal and noise maskers: the masking effect of narrowband noise is stronger than that of a tone which has the same power and is placed in the center frequency of the noise. In this paper, two tonality estimation methods are discussed which are implemented in a filter bank based psychoacoustic model. The first method is called Partial Spectral Flatness Measure (PSFM) and the second is referred to as Amplitude Modulation Ratio (AM-R). The psychoacoustic model uses a set of complex band-pass filters. It was designed according to the temporal/spectral resolution of the human auditory system, and takes into account post masking as well as the spreading effect of individual local maskers in simultaneous masking. This paper describes the model, tonality estimation methods and their implementation. The estimators are compared to each other by subjective tests. The results are presented and discussed.