Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

Unifying Local and Global Methods for Harmonic-Percussive Source Separation

: Dittmar, C.; Lopez-Serrano, P.; Müller, M.


Institute of Electrical and Electronics Engineers -IEEE-; IEEE Signal Processing Society:
IEEE International Conference on Acoustics, Speech, and Signal Processing 2018. Proceedings : April 15-20, 2018, Calgary Telus Convention Center, Calgary, Alberty, Canada
Piscataway, NJ: IEEE, 2018
ISBN: 978-1-5386-4658-8
ISBN: 978-1-5386-4657-1
ISBN: 978-1-5386-4659-5
International Conference on Acoustics, Speech, and Signal Processing (ICASSP) <2018, Calgary>
Fraunhofer IIS ()

This paper addresses the separation of drums from music recordings, a task closely related to harmonic-percussive source separation (HPSS). In previous works, two families of algorithms have been prominently applied to this problem. They are based either on local filtering and diffusion schemes, or on global low-rank models. In this paper, we propose to combine the advantages of both paradigms. To this end, we use a local approach based on Kernel Additive Modeling (KAM) to extract an initial guess for the percussive and harmonic parts. Subsequently, we use Non-Negative Matrix Factorization (NMF) with soft activation constraints as a global approach to jointly enhance both estimates. As an additional contribution, we introduce a novel constraint for enhancing percussive activations and a scheme for estimating the percussive weight of NMF components. Throughout the paper, we use a real-world music example to illustrate the ideas behind our proposed method. Finally, we report promising BSS Eval results achieved with the publicly available test corpora ENST-Drums and QUASI, which contain isolated drum and accompaniment tracks.