High-Resolution Violin Transcription Using Weak Labels

doi:10.5281/zenodo.10265263

Published November 4, 2023 | Version v1

Conference paper Open

High-Resolution Violin Transcription Using Weak Labels

A descriptive transcription of a violin performance requires detecting not only the notes but also the fine-grained pitch variations, such as vibrato. Most existing deep learning methods for music transcription do not capture these variations and often need frame-level annotations, which are scarce for the violin. In this paper, we propose a novel method for high-resolution violin transcription that can leverage piece-level weak labels for training. Our conformer-based model works on the raw audio waveform and transcribes violin notes and their corresponding pitch deviations with 5.8 ms frame resolution and 10-cent frequency resolution. We demonstrate that our method (1) outperforms generic systems in the proxy tasks of violin transcription and pitch estimation, and (2) can automatically generate new training labels by aligning its feature representations with unseen scores. We share our model along with 34 hours of score-aligned solo violin performance dataset, notably including the 24 Paganini Caprices.

Files

000025.pdf

Files (482.4 kB)

Name	Size	Download all
000025.pdf md5:605ff4514b671583d7983aabfaa59e8c	482.4 kB	Preview Download

Views

Downloads

Show more details

	All versions	This version
Views	74	74
Downloads	64	64
Data volume	34.2 MB	34.2 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

ISMIR

Imprint

Proceedings of the 24th International Society for Music Information Retrieval Conference, 223-230. Milan, Italy.

Conference

International Society for Music Information Retrieval Conference (ISMIR 2023) , Milan, Italy, November 5-9, 2023

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: December 5, 2023
Modified: December 5, 2023

High-Resolution Violin Transcription Using Weak Labels

Creators

Description

Files

000025.pdf

Files (482.4 kB)