• English
  • Deutsch
  • Log In
    Password Login
    or
  • Research Outputs
  • Projects
  • Researchers
  • Institutes
  • Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Artikel
  4. A robust fitness measure for capturing repetitions in music recordings with applications to audio thumbnailing
 
  • Details
  • Full
Options
2013
Journal Article
Titel

A robust fitness measure for capturing repetitions in music recordings with applications to audio thumbnailing

Abstract
The automatic extraction of structural information from music recordings constitutes a central research topic. In this paper, we deal with a subproblem of audio structure analysis called audio thumbnailing with the goal to determine the audio segment that best represents a given music recording. Typically, such a segment has many (approximate) repetitions covering large parts of the recording. As the main technical contribution, we introduce a novel fitness measure that assigns a fitness value to each segment that expresses how much and how well the segment "explains" the repetitive structure of the entire recording. The thumbnail is then defined to be the fitness-maximizing segment. To compute the fitness measure, we describe an optimization scheme that jointly performs two error-prone steps, path extraction and grouping, which are usually performed successively. As a result, our approach is even able to cope with strong musical and acoustic variations that may occur within and across related segments. As a further contribution, we introduce the concept of fitness scape plots that reveal global structural properties of an entire recording. Finally, to show the robustness and practicability of our thumbnailing approach, we present various experiments based on different audio collections that comprise popular music, classical music, and folk song field recordings.
Author(s)
Müller, M.
Jiang, N.Z.
Grosche, P.
Zeitschrift
IEEE transactions on audio, speech and language processing
Funder
Deutsche Forschungsgemeinschaft DFG
Thumbnail Image
DOI
10.1109/TASL.2012.2227732
Language
English
google-scholar
Fraunhofer-Institut für Integrierte Schaltungen IIS
  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Send Feedback
© 2022