• English
  • Deutsch
  • Log In
    Password Login
    Research Outputs
    Fundings & Projects
    Researchers
    Institutes
    Statistics
Repository logo
Fraunhofer-Gesellschaft
  1. Home
  2. Fraunhofer-Gesellschaft
  3. Scopus
  4. Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer
 
  • Details
  • Full
Options
2023
Journal Article
Title

Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer

Abstract
Attention-based Transformer models have been increasingly employed for automatic music generation. To condition the generation process of such a model with a user-specified sequence, a popular approach is to take that conditioning sequence as a priming sequence and ask a Transformer decoder to generate a continuation. However, this prompt-based conditioning cannot guarantee that the conditioning sequence would develop or even simply repeat itself in the generated continuation. In this paper, we propose an alternative conditioning approach, called theme-based conditioning, that explicitly trains the Transformer to treat the conditioning sequence as a thematic material that has to manifest itself multiple times in its generation result. This is achieved with two main technical contributions. First, we propose a deep learning-based approach that uses contrastive representation learning and clustering to automatically retrieve thematic materials from music pieces in the training data. Second, we propose a novel gated parallel attention module to be used in a sequence-to-sequence (seq2seq) encoder/decoder architecture to more effectively account for a given conditioning thematic material in the generation process of the Transformer decoder. We report on objective and subjective evaluations of variants of the proposed Theme Transformer and the conventional prompt-based baseline, showing that our best model can generate, to some extent, polyphonic pop piano music with repetition and plausible variations of a given condition.
Author(s)
Shih, Yi-Jen
National Taiwan University
Wu, Shih-Lun
National Taiwan University
Zalkow, Frank  orcid-logo
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Müller, Meinard  
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Yang, Yi-Hsuan
Academia Sinica, Research Center for Information Technology Innovation
Journal
IEEE transactions on multimedia  
DOI
10.1109/TMM.2022.3161851
Language
English
Fraunhofer-Institut für Integrierte Schaltungen IIS  
Keyword(s)
  • Transformers

  • Music

  • Decoding

  • Bars

  • Training

  • Testing

  • Deep learning

  • Cookie settings
  • Imprint
  • Privacy policy
  • Api
  • Contact
© 2024