Fraunhofer-Gesellschaft

Publica

Hier finden Sie wissenschaftliche Publikationen aus den Fraunhofer-Instituten.

A mid-level approach to local tonality analysis: Extracting key signatures from audio

 
: Weiss, C.; Cano, E.; Lukashevich, Hanna

Dittmar, C. ; Audio Engineering Society -AES-:
53rd International Conference on Semantic Audio 2014 : London, United Kingdom 26 – 29 January 2014
Red Hook, NY: Curran, 2014
ISBN: 978-1-63266-284-2
ISBN: 1-63266-284-1
pp.44-52
International Conference on Semantic Audio <53, 2014, London>
English
Conference Paper
Fraunhofer IDMT ()
tonality analysis; key estimation; audio features

Abstract
We propose a new method to automatically determine key signature changes. In automatic music transcription, unconsidered sections in distantly related keys may lead to music scores that are hard to read due to a high number of notated accidentals. The problem of key change is commonly addressed by finding the correct local key out of the 24 major and minor keys. However, to provide the best matching key signature, choosing the right mode (major or minor) is not necessary and thus, we only estimate the local underlying diatonic scale. After extracting chroma features and a beat grid from the audio data, we calculate local probabilities for the different diatonic scales. For this purpose, we present a multiplicative procedure that shows promising results for visualizing complex tonal structures. From the obtained diatonic scale estimates, we identify candidates for key signature changes. By clustering similar segments and applying minimum segment length constraints, we ge t the tonal segmentation. Our method was tested on a dataset containing 30 hand-annotated pop songs. To evaluate our results, we calculate scores based on the number of frames correctly annotated, as well as segment border F-measures and perform a cross-validation study. Our rulebased method yields up to 90 % class accuracy, and up to 70 % F-measure score for segment borders. These results are promising and qualify the approach to be applied for automatic music transcription.

: http://publica.fraunhofer.de/documents/N-301109.html