research-article

Free access

An Interpretable, Flexible, and Interactive Probabilistic Framework for Melody Generation

Authors:

Yue JiangAuthors Info & Claims

KDD '23: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

Pages 4089 - 4099

https://doi.org/10.1145/3580305.3599772

Published: 04 August 2023 Publication History

PDF eReader

Abstract

The fast-growing demand for algorithmic music generation is found throughout entertainment, art, education, etc. Unfortunately, most recent models are practically impossible to interpret or musically fine-tune, as they use deep neural networks with thousands of parameters. We introduce an interpretable, flexible, and interactive model, SchenkComposer, for melody generation that empowers users to be creative in all aspects of the music generation pipeline and allows them to learn from the process. We divide the task of melody generation into steps based on the process that a human composer using music-theoretical domain knowledge might use. First, the model determines phrase structure based on form analysis and identifies an appropriate number of measures. Using concepts from Schenkerian analysis, the model then finds a fitting harmonic rhythm, middleground harmonic progression, foreground rhythm, and melody in a hierarchical, scaffolded approach using a probabilistic context-free grammar based on musical contours. By incorporating theories of musical form and harmonic structure, our model produces music with long-term structural coherence. In extensive human experiments, we find that music generated with our approach successfully passes a Turing test in human experiments while current state-of-the-art approaches fail, and we further demonstrate superior performance and preference for our melodies compared to existing melody generation methods. Additionally, we developed and deployed a public website for SchenkComposer, and conducted preliminary user surveys. Through analysis, we show the strong viability and enjoyability of SchenkComposer.

Supplementary Material

MP4 File (adfp490-2min-promo.mp4)

Short promotional video for SchenkComposer: An Interpretable, Flexible, and Interactive Probabilistic Framework for Melody Generation.

Download
13.29 MB

References

[1]

Gérard Assayag and Shlomo Dubnov. 2004. Using factor oracles for machine improvisation. Soft Computing, Vol. 8, 9 (2004), 604--610.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Rhythm metadata enabled intra-track navigation and content modification in a music player

Interpretable Melody Generation from Lyrics with Discrete-Valued Adversarial Training

Lyrics-Conditioned Neural Melody Generation

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations