Efficient Two-Layer Model Towards Cover Song Identification

Xu, Xiaoshuo; Cheng, Yao; Chen, Xiaoou; Yang, Deshun

doi:10.1007/978-3-319-73600-6_11

Xiaoshuo Xu²¹,
Yao Cheng²¹,
Xiaoou Chen²¹ &
…
Deshun Yang²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10705))

Included in the following conference series:

International Conference on Multimedia Modeling

2767 Accesses
1 Citations

Abstract

So far, few cover song identification systems aim at practical application. On one hand, existing sequence alignment methods achieve a high precision at the expense of high time cost. On the other hand, for large-scale identification, researchers attempt to exploit fixed low-dimensional features to reduce time cost. However, such highly compressed representations often result in a worse accuracy. In this paper, we propose an efficient two-layer system which takes advantage of the two kinds of methods. The proposed approach outperforms existing approaches and achieves high precision with relatively small time complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Bertin-Mahieux, T., Ellis, D.P.W.: Large-scale cover song recognition using hashed chroma landmarks. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 117–120 (2011)
Google Scholar
Bertin-Mahieux, T., Ellis, D.P.: Large-scale cover song recognition using the 2D Fourier transform magnitude. In: International Society for Music Information Retrieval Conference (2012)
Google Scholar
Bertin-Mahieux, T., Ellis, D.P., Whitman, B., Lamere, P.: The million song dataset. In: International Society for Music Information Retrieval Conference (2011)
Google Scholar
Chen, N., Li, W., Xiao, H.: Fusing similarity functions for cover song identification. Multimed. Tools Appl., 1–24 (2017)
Google Scholar
Ellis, D.P., Poliner, G.E.: Identifying cover songs with chroma features and dynamic programming beat tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2007)
Google Scholar
Foster, P., Dixon, S., Klapuri, A.: Identifying cover songs using information-theoretic measures of similarity. IEEE/ACM Trans. Audio Speech Lang. Process. 23(6), 993–1005 (2015)
Article Google Scholar
Fujishima, T.: Realtime chord recognition of musical sound: a system using common Lisp music. In: ICMC, pp. 464–467 (1999)
Google Scholar
Gómez, E.: Tonal description of polyphonic audio for music content processing. INFORMS J. Comput. 18, 294–304 (2006)
Article Google Scholar
Humphrey, E.J., Nieto, O., Bello, J.P.: Data driven and discriminative projections for large-scale cover song identification. In: International Society for Music Information Retrieval Conference, pp. 149–154 (2013)
Google Scholar
Julia, J.S.: Music similarity based on sequences of descriptors: tonal features applied to audio cover song identification. Master’s thesis (2007)
Google Scholar
Khadkevich, M., Omologo, M.: Large-scale cover song identification using chord profiles. In: International Society for Music Information Retrieval Conference, pp. 233–238 (2013)
Google Scholar
Martin, B., Brown, D.G., Hanna, P., Ferraro, P.: Blast for audio sequences alignment: a fast scalable cover identification. In: International Society for Music Information Retrieval Conference (2012)
Google Scholar
Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-74048-3
Book Google Scholar
Müller, M., Ewert, S.: Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: International Society for Music Information Retrieval Conference, Miami, USA (2011)
Google Scholar
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Ravuri, S., Ellis, D.P.: Cover song detection: from high scores to general classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 65–68 (2010)
Google Scholar
Seetharaman, P., Rafii, Z.: Cover song identification with 2D Fourier transform sequences. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2017)
Google Scholar
Serrà, J., Kantz, H., Serra, X.: Predictability of music descriptor time series and its application to cover song detection. IEEE Trans. Audio Speech Lang. Process. 20(2), 514–525 (2012)
Google Scholar
Serrà, J., Serra, X., Andrzejak, R.G.: Cross recurrence quantification for cover song identification. New J. Phys. 11(9), 093017 (2009)
Article Google Scholar
Serrà, J.: Identification of versions of the same musical composition by processing audio descriptions. Ph.D. thesis (2011)
Google Scholar
Serrà, J., Gómez, E., Herrera, P.: Audio cover song identification and similarity: background, approaches, evaluation, and beyond. In: Raś, Z.W., Wieczorkowska, A.A. (eds.) Advances in Music Information Retrieval. SCI. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-11674-2_14
Google Scholar
Serrà, J., Gómez, E., Herrera, P., Serra, X.: Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. Audio Speech Lang. Process. 16(6), 1138–1151 (2008)
Article Google Scholar
Silva, D.F., Yeh, C.C.M., Batista, G.E.A.P.A., Keogh, E., et al.: SiMPle: assessing music similarity using subsequences joins. In: International Society for Music Information Retrieval Conference (2016)
Google Scholar
Silva, D.F., Souza, V.M.A.D., Batista, G.E.A.P.A., et al.: Music shapelets for fast cover song regognition. In: International Society for Music Information Retrieval Conference (2015)
Google Scholar
Tralie, C., Paul, B.: Cover song identification with timbral shape sequences. In: 16th International Society for Music Information Retrieval Conference (2015)
Google Scholar

Download references

Acknowledgments

This work was supported by the Natural Science Foundation of China (No. 61370116).

Author information

Authors and Affiliations

Institute of Computer Science and Technology, Peking University, 128 Zhongguancun North Street, Haidian District, Beijing, 100871, People’s Republic of China
Xiaoshuo Xu, Yao Cheng, Xiaoou Chen & Deshun Yang

Authors

Xiaoshuo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Yao Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoou Chen
View author publications
You can also search for this author in PubMed Google Scholar
Deshun Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoshuo Xu .

Editor information

Editors and Affiliations

Alpen-Adria-Universität Klagenfurt, Klagenfurt, Austria
Klaus Schoeffmann
Chulalongkorn University, Bangkok, Thailand
Thanarat H. Chalidabhongse
City University of Hong Kong, Hong Kong, China
Chong Wah Ngo
Chulalongkorn University, Bangkok, Thailand
Supavadee Aramvith
Dublin City University, Dublin, Ireland
Noel E. O’Connor
Gwangju Institute of Science and Technology, Gwangju, Korea (Republic of)
Yo-Sung Ho
Tampere University of Technology, Tampere, Finland
Moncef Gabbouj
Rutgers University, Piscataway, New Jersey, USA
Ahmed Elgammal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, X., Cheng, Y., Chen, X., Yang, D. (2018). Efficient Two-Layer Model Towards Cover Song Identification. In: Schoeffmann, K., et al. MultiMedia Modeling. MMM 2018. Lecture Notes in Computer Science(), vol 10705. Springer, Cham. https://doi.org/10.1007/978-3-319-73600-6_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-73600-6_11
Published: 13 January 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-73599-3
Online ISBN: 978-3-319-73600-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics