extended-abstract

ReStyle-MusicVAE: Enhancing User Control of Deep Generative Music Models with Expert Labeled Anchors

Authors:

Damjan Prvulovic,

Richard Vogl,

Peter KneesAuthors Info & Claims

UMAP '22 Adjunct: Adjunct Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization

Pages 63 - 66

https://doi.org/10.1145/3511047.3536412

Published: 04 July 2022 Publication History

Get Access

Abstract

Deep generative models have emerged as one of the most actively researched topics in artificial intelligence. An area that draws increasing attention is the automatic generation of music, with various applications including systems that support and inspire the process of music composition. For these assistive systems, in order to be successful and accepted by users, it is imperative to give the user agency and express their personal style in the process of composition.

In this paper, we demonstrate ReStyle-MusicVAE, a system for human-AI co-creation in music composition. More specifically, ReStyle-MusicVAE combines the automatic melody generation and variation approach of MusicVAE and adds semantic control dimensions to further steer the process. To this end, expert-annotated melody lines created for music production are used to define stylistic anchors, which serve as semantic references for interpolation. We present an easy-to-use web app built on top of the Magenta.js JavaScript library and pre-trained MusicVAE checkpoints.

Supplementary Material

MP4 File (UMAP22_ReStyle-MusicVAE_EnhancingUserControlOfDeepGenerativeMusicModelsWithExpertLabeledAnchors.mp4)

Video Presentation

Download
47.52 MB

MP4 File (UMAP22_ReStyle-MusicVAE_EnhancingUserControlOfDeepGenerativeMusicModelsWithExpertLabeledAnchors.mp4)

Video Presentation

Download
47.52 MB

References

[1]

Jean-Julien Aucouturier and François Pachet. 2003. Representing Musical Genre: A State of the Art. Journal of New Music Research 32, 1 (2003), 83–93. https://doi.org/10.1076/jnmr.32.1.83.16801 arXiv:https://www.tandfonline.com/doi/pdf/10.1076/jnmr.32.1.83.16801

Crossref

Google Scholar

[2]

Jean-Pierre Briot, Gaëtan Hadjeres, and François-David Pachet. 2020. Deep Learning Techniques for Music Generation. Springer International Publishing, Cham. https://doi.org/10.1007/978-3-319-70163-9_1

Crossref

Google Scholar

[3]

Filippo Carnovalini and Antonio Rodà. 2020. Computational Creativity and Music Generation Systems: An Introduction to the State of the Art. Frontiers in Artificial Intelligence 3 (2020). https://doi.org/10.3389/frai.2020.00014

Crossref

Google Scholar

[4]

Prafulla Dhariwal, Heewoo Jun, Christine Payne, Jong Wook Kim, Alec Radford, and Ilya Sutskever. 2020. Jukebox: A Generative Model for Music. arXiv preprint arXiv:2005.00341(2020). https://doi.org/10.48550/ARXIV.2005.00341

Crossref

Google Scholar

[5]

Monica Dinculescu, Jesse Engel, and Adam Roberts. 2019. MidiMe: Personalizing a MusicVAE model with user data. In Workshop on Machine Learning for Creativity and Design, NeurIPS.

Google Scholar

[6]

Florian Grote, Kristina Andersen, and Peter Knees. 2015. Collaborating with Intelligent Machines: Interfaces for Creative Sound. In Proceedings of the 33rd Annual ACM Conference Extended Abstracts on Human Factors in Computing Systems (Seoul, Republic of Korea) (CHI EA ’15). Association for Computing Machinery, New York, NY, USA, 2345–2348. https://doi.org/10.1145/2702613.2702650

Digital Library

Google Scholar

[7]

Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Simon, Curtis Hawthorne, Andrew M. Dai, Matthew D. Hoffman, Monica Dinculescu, and Douglas Eck. 2018. Music Transformer. arXiv preprint arXiv:1809.04281(2018). https://doi.org/10.48550/ARXIV.1809.04281

Crossref

Google Scholar

[8]

Omer Levy and Yoav Goldberg. 2014. Dependency-Based Word Embeddings. In 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference, Vol. 2. 302–308. https://doi.org/10.3115/v1/P14-2050

Crossref

Google Scholar

[9]

Ryan Louie, Andy Coenen, Cheng Zhi Huang, Michael Terry, and Carrie J Cai. 2020. Novice-AI music co-creation via AI-steering tools for deep generative models. In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3313831.3376739

Digital Library

Google Scholar

[10]

Ryan Louie, Jesse Engel, and Anna Huang. 2021. Expressive Communication: A Common Framework for Evaluating Developments in Generative Models and Steering Interfaces. arxiv:2111.14951 [cs.HC] https://arxiv.org/abs/2111.14951

Google Scholar

[11]

Adam Roberts, Jesse Engel, Colin Raffel, Curtis Hawthorne, and Douglas Eck. 2018. A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music. In Proceedings of the 35th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 80), Jennifer Dyand Andreas Krause (Eds.). PMLR, 4364–4373. https://proceedings.mlr.press/v80/roberts18a.html

Google Scholar

[12]

Minhyang Suh, Emily Youngblom, Michael Terry, and Carrie J Cai. 2021. AI as Social Glue: Uncovering the Roles of Deep Generative AI during Social Music Composition. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. https://doi.org/10.1145/3411764.3445219

Digital Library

Google Scholar

[13]

Laurens Van der Maaten and Geoffrey Hinton. 2008. Visualizing data using t-SNE.Journal of machine learning research 9, 11 (2008).

Google Scholar

[14]

Òscar Celma, Perfecto Herrera, and Xavier Serra. 2006. Bridging the Music Semantic Gap. In ESWC 2006 Workshop on Mastering the Gap: From Information Extraction to Semantic Representation. http://hdl.handle.net/10230/34294

Google Scholar

Cited By

View all

Correia ASchneider DFonseca BMohseni HKujala TKärkkäinen T(2024)And Justice for Art(ists): Metaphorical Design as a Method for Creating Culturally Diverse Human-AI Music Composition Experiences2024 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)10.1109/HORA61326.2024.10550680(1-4)Online publication date: 23-May-2024
https://doi.org/10.1109/HORA61326.2024.10550680

Recommendations

Pop Music Generation: From Melody to Multi-style Arrangement
Special Issue on KDD 2018, Regular Papers and Survey Paper

Music plays an important role in our daily life. With the development of deep learning and modern generation techniques, researchers have done plenty of works on automatic music generation. However, due to the special requirements of both melody and ...
Everybody Compose: Deep Beats To Music
MMSys '23: Proceedings of the 14th ACM Multimedia Systems Conference

This project presents a deep learning approach to generate monophonic melodies based on input beats, allowing even amateurs to create their own music compositions. Three effective methods - LSTM with Full Attention, LSTM with Local Attention, and ...
A novel Xi’an drum music generation method based on Bi-LSTM deep reinforcement learning
Abstract
Chinese Folk Drum music is an excellent traditional cultural resource, it has brilliant historical and cultural heritage and excellent traditional cultural connotation. However, the survey found that the social and cultural values, tourism ...

Comments

Information & Contributors

Information

Published In

UMAP '22 Adjunct: Adjunct Proceedings of the 30th ACM Conference on User Modeling, Adaptation and Personalization

July 2022

409 pages

ISBN:9781450392327

DOI:10.1145/3511047

General Chairs:
Alejandro Bellogin
Universidad Autonoma de Madrid, Spain
,
Ludovico Boratto
University of Cagliari, Italy
,
Olga C. Santos
UNED, Spain
,
Program Chairs:
Liliana Ardissono
Universita di Torino, Italy
,
Bart Knijnenburg
Clemson University, United States

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 July 2022

Check for updates

Author Tags

Qualifiers

Extended-abstract
Research
Refereed limited

Conference

UMAP '22

Sponsor:

UMAP '22: 30th ACM Conference on User Modeling, Adaptation and Personalization

July 4 - 7, 2022

Barcelona, Spain

Acceptance Rates

Overall Acceptance Rate 162 of 633 submissions, 26%

Upcoming Conference

UMAP '25

Sponsor:
sigchi
sigchi

33rd ACM Conference on User Modeling, Adaptation and Personalization

June 16 - 19, 2025

New York City , NY , USA

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
188
Total Downloads

Downloads (Last 12 months)53
Downloads (Last 6 weeks)5

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Correia ASchneider DFonseca BMohseni HKujala TKärkkäinen T(2024)And Justice for Art(ists): Metaphorical Design as a Method for Creating Culturally Diverse Human-AI Music Composition Experiences2024 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA)10.1109/HORA61326.2024.10550680(1-4)Online publication date: 23-May-2024
https://doi.org/10.1109/HORA61326.2024.10550680

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

Supplementary Material

References

Cited By

Recommendations

Pop Music Generation: From Melody to Multi-style Arrangement

Everybody Compose: Deep Beats To Music

A novel Xi’an drum music generation method based on Bi-LSTM deep reinforcement learning

Comments

Information

Published In

Sponsors

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

HTML Format

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations