Abstract
So far, few cover song identification systems aim at practical application. On one hand, existing sequence alignment methods achieve a high precision at the expense of high time cost. On the other hand, for large-scale identification, researchers attempt to exploit fixed low-dimensional features to reduce time cost. However, such highly compressed representations often result in a worse accuracy. In this paper, we propose an efficient two-layer system which takes advantage of the two kinds of methods. The proposed approach outperforms existing approaches and achieves high precision with relatively small time complexity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bertin-Mahieux, T., Ellis, D.P.W.: Large-scale cover song recognition using hashed chroma landmarks. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 117–120 (2011)
Bertin-Mahieux, T., Ellis, D.P.: Large-scale cover song recognition using the 2D Fourier transform magnitude. In: International Society for Music Information Retrieval Conference (2012)
Bertin-Mahieux, T., Ellis, D.P., Whitman, B., Lamere, P.: The million song dataset. In: International Society for Music Information Retrieval Conference (2011)
Chen, N., Li, W., Xiao, H.: Fusing similarity functions for cover song identification. Multimed. Tools Appl., 1–24 (2017)
Ellis, D.P., Poliner, G.E.: Identifying cover songs with chroma features and dynamic programming beat tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2007)
Foster, P., Dixon, S., Klapuri, A.: Identifying cover songs using information-theoretic measures of similarity. IEEE/ACM Trans. Audio Speech Lang. Process. 23(6), 993–1005 (2015)
Fujishima, T.: Realtime chord recognition of musical sound: a system using common Lisp music. In: ICMC, pp. 464–467 (1999)
Gómez, E.: Tonal description of polyphonic audio for music content processing. INFORMS J. Comput. 18, 294–304 (2006)
Humphrey, E.J., Nieto, O., Bello, J.P.: Data driven and discriminative projections for large-scale cover song identification. In: International Society for Music Information Retrieval Conference, pp. 149–154 (2013)
Julia, J.S.: Music similarity based on sequences of descriptors: tonal features applied to audio cover song identification. Master’s thesis (2007)
Khadkevich, M., Omologo, M.: Large-scale cover song identification using chord profiles. In: International Society for Music Information Retrieval Conference, pp. 233–238 (2013)
Martin, B., Brown, D.G., Hanna, P., Ferraro, P.: Blast for audio sequences alignment: a fast scalable cover identification. In: International Society for Music Information Retrieval Conference (2012)
Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-74048-3
Müller, M., Ewert, S.: Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: International Society for Music Information Retrieval Conference, Miami, USA (2011)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Ravuri, S., Ellis, D.P.: Cover song detection: from high scores to general classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 65–68 (2010)
Seetharaman, P., Rafii, Z.: Cover song identification with 2D Fourier transform sequences. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2017)
Serrà, J., Kantz, H., Serra, X.: Predictability of music descriptor time series and its application to cover song detection. IEEE Trans. Audio Speech Lang. Process. 20(2), 514–525 (2012)
Serrà, J., Serra, X., Andrzejak, R.G.: Cross recurrence quantification for cover song identification. New J. Phys. 11(9), 093017 (2009)
Serrà, J.: Identification of versions of the same musical composition by processing audio descriptions. Ph.D. thesis (2011)
Serrà, J., Gómez, E., Herrera, P.: Audio cover song identification and similarity: background, approaches, evaluation, and beyond. In: Raś, Z.W., Wieczorkowska, A.A. (eds.) Advances in Music Information Retrieval. SCI. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-11674-2_14
Serrà, J., Gómez, E., Herrera, P., Serra, X.: Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. Audio Speech Lang. Process. 16(6), 1138–1151 (2008)
Silva, D.F., Yeh, C.C.M., Batista, G.E.A.P.A., Keogh, E., et al.: SiMPle: assessing music similarity using subsequences joins. In: International Society for Music Information Retrieval Conference (2016)
Silva, D.F., Souza, V.M.A.D., Batista, G.E.A.P.A., et al.: Music shapelets for fast cover song regognition. In: International Society for Music Information Retrieval Conference (2015)
Tralie, C., Paul, B.: Cover song identification with timbral shape sequences. In: 16th International Society for Music Information Retrieval Conference (2015)
Acknowledgments
This work was supported by the Natural Science Foundation of China (No. 61370116).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Xu, X., Cheng, Y., Chen, X., Yang, D. (2018). Efficient Two-Layer Model Towards Cover Song Identification. In: Schoeffmann, K., et al. MultiMedia Modeling. MMM 2018. Lecture Notes in Computer Science(), vol 10705. Springer, Cham. https://doi.org/10.1007/978-3-319-73600-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-319-73600-6_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73599-3
Online ISBN: 978-3-319-73600-6
eBook Packages: Computer ScienceComputer Science (R0)