short-paper

Music-Graph2Vec: An Efficient Method for Embedding Pitch Segment

Authors:

Yuanzhe CaiAuthors Info & Claims

MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia

Article No.: 97, Pages 1 - 5

https://doi.org/10.1145/3595916.3626740

Published: 01 January 2024 Publication History

Abstract

Learning low-dimensional continuous vector representation for short pitch segment extracted from songs is has been confirmed to contain tonal features of music, which is key to melody modeling that can be utilized in many music investigations, such as genre classification, emotion classification, and music retrieval, and so on. The skip-gram version of Word2Vec is ubiquitous, and widely used approach for music pitch segment embedding, but it poorly scales to large data sets due to its extremely long training time. In this paper, we propose a novel efficient graph-based embedding method, named Music-Graph2Vec, to tackle this concern. This approach converts music files into graphs, extracts the rhythmic sequence through random walking, and trains the rhythmic embedding model using skip-gram. Experimental results demonstrate that Music-Graph2Vec outperforms Word2Vec in training rhythmic embedding, with the advantage of being 55 times faster on the top-MAGD dataset (2,134.7s for Word2Vec and 38.9s for Music-Graph2Vec), with the same accuracy for Word2Vec in terms of music genre classification.

References

[1]

Jean-Pierre Briot, Gaëtan Hadjeres, and François-David Pachet. 2017. Deep learning techniques for music generation–a survey. arXiv preprint arXiv:1709.01620 (2017).

[2]

Gino Brunner, Yuyi Wang, Roger Wattenhofer, and Sumu Zhao. 2018. Symbolic music genre transfer with cyclegan. In 2018 ieee 30th international conference on tools with artificial intelligence (ictai). IEEE, 786–793.

[3]

Edward M Burns. 1999. Intervals, scales, and tuning. In The psychology of music. Elsevier, 215–264.

[4]

Ching-Hua Chuan, Kat Agres, and Dorien Herremans. 2020. From context to concept: exploring semantic relationships in music with word2vec. Neural Computing and Applications 32 (2020).

[5]

Padraig Cunningham and Sarah Jane Delany. 2021. k-Nearest neighbour classifiers-A Tutorial. ACM computing surveys (CSUR) 54, 6 (2021), 1–25.

[6]

James J Deng, Clement HC Leung, Alfredo Milani, and Li Chen. 2015. Emotional states associated with music: Classification, prediction of changes, and consideration in recommendation. ACM Transactions on Interactive Intelligent Systems (TiiS) 5, 1 (2015), 1–36.

Digital Library

[7]

Michael Good 2001. MusicXML: An internet-friendly format for sheet music. In Xml conference and expo. Citeseer, 03–04.

[8]

Andrew Hankinson, Perry Roland, and Ichiro Fujinaga. 2011. The Music Encoding Initiative as a Document-Encoding Framework. In ISMIR. 293–298.

[9]

Dorien Herremans and Ching-Hua Chuan. 2017. Modeling musical context with word2vec. arXiv preprint arXiv:1706.09088 (2017).

[10]

Tatsunori Hirai and Shun Sawada. 2019. Melody2vec: Distributed representations of melodic phrases based on melody segmentation. Journal of Information Processing 27 (2019), 278–286.

[11]

Cheng-Zhi Anna Huang, David Duvenaud, and Krzysztof Z Gajos. 2016. Chordripple: Recommending chords to help novice composers go beyond the ordinary. In Proceedings of the 21st international conference on intelligent user interfaces. 241–250.

Digital Library

[12]

Cheng-Zhi Anna Huang, Ashish Vaswani, Jakob Uszkoreit, Noam Shazeer, Ian Simon, Curtis Hawthorne, Andrew M Dai, Matthew D Hoffman, Monica Dinculescu, and Douglas Eck. 2018. Music transformer. arXiv preprint arXiv:1809.04281 (2018).

[13]

Shulei Ji, Jing Luo, and Xinyu Yang. 2020. A comprehensive survey on deep music generation: Multi-level representations, algorithms, evaluations, and future directions. arXiv preprint arXiv:2011.06801 (2020).

[14]

Sephora Madjiheurem, Lizhen Qu, and Christian Walder. 2016. Chord2vec: Learning musical chord embeddings. In Proceedings of the constructive machine learning workshop at 30th conference on neural information processing systems (NIPS2016), Barcelona, Spain.

[15]

The Midi Man. 2015. The Largest MIDI Collection on the Internet, collected and sorted diligently by yours truly.Web. https://www.reddit.com/r/datasets/comments/3akhxy/the_largest_midi_collection_on_the_internet/

[16]

MIDIWORLD. 2022. midiworld. Web. https://midiworld.com

[17]

Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).

[18]

Sageev Oore, Ian Simon, Sander Dieleman, Douglas Eck, and Karen Simonyan. 2020. This time with feeling: Learning expressive musical performance. Neural Computing and Applications 32 (2020), 955–967.

Digital Library

[19]

Nikki Pelchat and Craig M Gelowitz. 2020. Neural network music genre classification. Canadian Journal of Electrical and Computer Engineering 43, 3 (2020), 170–173.

[20]

Markus Schedl. 2016. The lfm-1b dataset for music retrieval and recommendation. In Proceedings of the 2016 ACM on international conference on multimedia retrieval. 103–110.

Digital Library

[21]

Markus Schedl. 2019. Deep learning in music recommendation systems. Frontiers in Applied Mathematics and Statistics (2019), 44.

[22]

Markus Schedl, Hamed Zamani, Ching-Wei Chen, Yashar Deldjoo, and Mehdi Elahi. 2018. Current challenges and visions in music recommender systems research. International Journal of Multimedia Information Retrieval 7 (2018), 95–116.

[23]

Hendrik Schreiber. 2015. Improving Genre Annotations for the Million Song Dataset. In ISMIR. Málaga.

[24]

Eerola T Toiviainen P. 2016. MIDI toolbox 1.1. Web. https://github.com/miditoolbox/

Cited By

Index Terms

Music-Graph2Vec: An Efficient Method for Embedding Pitch Segment
1. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Music retrieval

Recommendations

Music Genre Classification and Feature Comparison using ML
ICMLT '22: Proceedings of the 2022 7th International Conference on Machine Learning Technologies

An essential feature of the music is the genre, which can be considered a high-level description of an individual piece of music. In this sense, genre as a music feature is similar to typical descriptive features from the ML perspective. Although a ...
Aggregate features and ADABOOST for music classification

We present an algorithm that predicts musical genre and artist from an audio waveform. Our method uses the ensemble learner A DA B OOST to select from a set of audio features that have been extracted from segmented audio and then aggregated. Our ...
Genre classification of symbolic music with SMBGT
PETRA '13: Proceedings of the 6th International Conference on PErvasive Technologies Related to Assistive Environments

Automatic music genre classification is a task that has attracted the interest of the music community for more than two decades. Music can be of high importance within the area of assistive technologies as it can be seen as an assistive technology with ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia

December 2023

745 pages

ISBN:9798400702051

DOI:10.1145/3595916

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper
Research
Refereed limited

Conference

MMAsia '23

Sponsor:

SIGMM

MMAsia '23: ACM Multimedia Asia

December 6 - 8, 2023

Tainan, Taiwan

Acceptance Rates

Overall Acceptance Rate 59 of 204 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
86
Total Downloads

Downloads (Last 12 months)52
Downloads (Last 6 weeks)6

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten