Skip to main navigation Skip to search Skip to main content

Pitch Class and Octave-Based Pitch Embedding Training Strategies for Symbolic Music Generation

  • Yuqiang Li*
  • , Shengchen Li
  • , György Fazekas
  • *Corresponding author for this work
  • Queen Mary University of London

Research output: Chapter in Book or Report/Conference proceedingConference Proceedingpeer-review

Abstract

This paper presents two strategies to prevent the learned pitch representation from worsening so as to improve pitch and pitch class distributions in symbolic music generation. The first strategy is to switch the input pitch representation from the flat MIDI number representation to a hierarchical representation consisting of pitch class (chroma) and octave, which forces musically similar pitches to share part of the embedding vectors. The second strategy freezes the pitch embeddings during training according to the proposed evaluation metric of the pitch embedding space, maintaining the robustness of the embedding obtained in the first strategy. The experiments show that, when both strategies were applied to training an auto-regressive neural network for melody generation, the generated samples exhibited significant improvement in pitch class entropy (from 19% to 34% overlapping with the test dataset), and a modest but still significant improvement on pitch entropy (from 24% to 28%).

Original languageEnglish
Title of host publicationMusic and Sound Generation in the AI Era - 16th International Symposium, CMMR 2023, Revised Selected Papers
EditorsSølvi Ystad, Richard Kronland-Martinet, Mitsuko Aramaki, Tetsuro Kitahara, Keiji Hirata
PublisherSpringer Science and Business Media Deutschland GmbH
Pages149-167
Number of pages19
ISBN (Print)9783032020413
DOIs
Publication statusPublished - 2026
Event16th International Symposium on Computer Music Multidisciplinary Research, CMMR 2023 - Tokyo, Japan
Duration: 13 Nov 202317 Nov 2023

Publication series

NameLecture Notes in Computer Science
Volume15236 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th International Symposium on Computer Music Multidisciplinary Research, CMMR 2023
Country/TerritoryJapan
CityTokyo
Period13/11/2317/11/23

Keywords

  • music feature representation
  • pitch representation
  • symbolic music generation

Cite this