Abstract
While there are perhaps millions of MIDI files available over the Internet, it is difficult to find performances of a particular piece because well labeled metadata and indexes are unavailable. We address the particular problem of finding performances of compositions for piano, which is different from often-studied problems of Query-by-Humming and Music Fingerprinting. Our MidiFind system is designed to search a million MIDI files with high precision and recall. By using a hybrid search strategy, it runs more than 1000 times faster than naive competitors, and by using a combination of bag-of-words and enhanced Levenshtein distance methods for similarity, our system achieves a precision of 99.5 % and recall of 89.8 %.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Crestmuse. http://www.crestmuse.jp/pedb/
Levenshtein distance. http://en.wikipedia.org/wiki/Levenshtein_distance
Bastian, M., Heymann, S., Jacomy, M.: Gephi: an open source software for exploring and manipulating networks. In: International AAAI Conference on Weblogs and Social Media (2009)
Ciaccia, P., Patella, M., Zezula, P.: M-tree: an effcient access method for similarity search in metric spaces. In: Proceedings of the 23rd Athens International Conference on VLDB, pp. 426–425 (1997)
Dannenberg, R.B., Birmingham, W.P., Pardo, B., Hu, N., Meek, C., Tzanetakis, G.: A comparative evaluation of search techniques for query-by-humming using the musart testbed. J. Am. Soc. Inf. Sci. Technol. 58(5), 687–701 (2007)
Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and mining of time series data: experimental comparison of representations and distance measures. Proc. VLDB Endow. 1(2), 1542–1552 (2008)
Ellis, D., Whitman, B., Jehan, T., Lamere, P.: The echo nest musical fingerprint (ENMFP). In: International Symposium on Music Information Retrieval (2010)
Ghias, A., Logan, J., Chamberlin, D., Smith, B.C.: Query by humming: musical information retrieval in an audio database. In: ACM Multimedia, pp. 231–236 (1995)
Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of the 25th International Conference on Very Large Data Bases, VLDB ’99, pp. 518–529. Morgan Kaufmann Publishers Inc., San Francisco (1999)
Haitsma, J., Kalker, T.: A highly robust audio fingerprinting system. In: International Symposium on Music Information Retrieval (2002)
Jang, R., Lee, H.: Hierarchical filtering method for content-based music retrieval via acoustic input. In: Proceedings of the ACM Multimedia, pp. 401–410. ACM Press (2001)
Krumhansl, C.L.: Cognitive Foundations of Musical Pitch. Oxford University Press, New York (2001)
Lodhi, H., Saunders, C., Shawe-Taylor, J., Cristianini, N., Watkins, C.: Text classification using string kernels. J. Mach. Learn. Res. 2, 419–444 (2002)
Martin, S., Brown, W.M., Klavans, R., Boyack, K.W.: Openord: an open-source toolbox for large graph layout. In: IS&T/SPIE Electronic Imaging, pp. 786806–786806. International Society for Optics and Photonics (2011)
Mazzoni, D., Dannenberg, R.B.: Melody matching directly from audio. In: International Symposium on Music Information Retrieval, pp. 17–18 (2001)
Mcnab, R.J., Smith, L.A., Bainbridge, D., Witten, I.H.: The new zealand digital library MELody inDEX. D-Lib Mag. 3(5), 4–15 (1997)
Paass, G., Leopold, E., Larson, M., Kindermann, J., Eickeler, S.: SVM classification using sequences of phonemes and syllables. In: Elomaa, T., Mannila, H., Toivonen, H. (eds.) PKDD 2002. LNCS (LNAI), vol. 2431, p. 373. Springer, Heidelberg (2002)
Papapetrou, P., Athitsos, V., Potamias, M., Kollios, G., Gunopulos, D.: Embedding-based subsequence matching in time-series databases. ACM Trans. Database Syst. (TODS) 36(3), 17 (2011)
Scholkopf, B., Smola, A., Müller, K.R.: Kernel principal component analysis. In: Schölkopf, B., Burges, C.J.C., Smola, A.J. (eds.) Advances in Kernel Methods - Support Vector Learning, pp. 327–352. MIT Press, Cambridge (1999)
Shlens, J.: A tutorial on principal component analysis. In: Systems Neurobiology Laboratory, Salk Institute for Biological Studies (2005)
Uitdenbogerd, A., Zobel, J.: Manipulation of music for melody matching. In: ACM Multimedia, pp. 235–240 (1998)
Uitdenbogerd, A., Zobel, J.: Melodic matching techniques for large music databases. In: Proceedings of the Seventh ACM International Conference on Multimedia (Part 1), Multimedia ’99, pp. 57–66. ACM, New York (1999)
Widmer, G., Flossmann, S., Grachten, M.: YQX plays Chopin. AI Mag. 30(3), 35–48 (2009)
Widmer, G., Tobudic, A.: Playing Mozart by analogy: learning multi-level timing and dynamics strategies. J. New Music Res. 32(3), 259–268 (2003)
Zhu, Y., Shasha, D.: Warping indexes with envelope transforms for query by humming. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data, SIGMOD ’03, pp. 181–192. ACM, New York (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Xia, G., Huang, T., Ma, Y., Dannenberg, R., Faloutsos, C. (2014). MidiFind: Similarity Search and Popularity Mining in Large MIDI Databases. In: Aramaki, M., Derrien, O., Kronland-Martinet, R., Ystad, S. (eds) Sound, Music, and Motion. CMMR 2013. Lecture Notes in Computer Science(), vol 8905. Springer, Cham. https://doi.org/10.1007/978-3-319-12976-1_17
Download citation
DOI: https://doi.org/10.1007/978-3-319-12976-1_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12975-4
Online ISBN: 978-3-319-12976-1
eBook Packages: Computer ScienceComputer Science (R0)