Knowledge Transfer from Neural Networks for Speech Music Classification

Kehling, Christian; Cano, Estefanía

2021

Conference Paper

Abstract

A frequent problem when dealing with audio classification tasks is the scarcity of suitable training data. This work investigates ways of mitigating this problem by applying transfer learning techniques to neural network architectures for several classification tasks from the field of Music Information Retrieval (MIR). First, three state-of-the-art architectures are trained and evaluated with several datasets for the task of speech/music classification. Second, feature representations or embeddings are extracted from the trained networks to classify new tasks with unseen data. The effect of pre-training with respect to the similarity of the source and target tasks are investigated in the context of transfer learning, as well as different fine-tuning strategies.

Author(s)

Kehling, Christian

Cano, Estefanía

Mainwork

CMMR 2021, 15th International Symposium on Computer Music Multidisciplinary Research. Proceedings

Conference

International Symposium on Computer Music Multidisciplinary Research (CMMR) 2021

Options

Knowledge Transfer from Neural Networks for Speech Music Classification