VampNet: Music Generation via Masked Acoustic Token Modeling

doi:10.5281/zenodo.10265299

Published November 4, 2023 | Version v1

Conference paper Open

VampNet: Music Generation via Masked Acoustic Token Modeling

We introduce VampNet, a masked acoustic token modeling approach to music synthesis, compression, inpainting, and variation. We use a variable masking schedule during training which allows us to sample coherent music from the model by applying a variety of masking approaches (called prompts) during inference. VampNet is non-autoregressive, leveraging a bidirectional transformer architecture that attends to all tokens in a forward pass. With just 36 sampling passes, VampNet can generate coherent high-fidelity musical waveforms. We show that by prompting VampNet in various ways, we can apply it to tasks like music compression, inpainting, outpainting, continuation, and looping with variation (vamping). Appropriately prompted, VampNet is capable of maintaining style, genre, instrumentation, and other high-level aspects of the music. This flexible prompting capability makes VampNet a powerful music co-creation tool. Code and audio samples are available online.

Files

000042.pdf

Files (847.0 kB)

Name	Size	Download all
000042.pdf md5:5a7d848f28c022712c642483260259ee	847.0 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	94	94
Downloads	94	94
Data volume	94.0 MB	94.0 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 24th International Society for Music Information Retrieval Conference, 359-366. Milan, Italy.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2023) , Milan, Italy, November 5-9, 2023

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: December 5, 2023
Modified: December 10, 2023

VampNet: Music Generation via Masked Acoustic Token Modeling

Creators

Description

Files

000042.pdf

Files (847.0 kB)