Skip to main content

Efficient Two-Layer Model Towards Cover Song Identification

  • Conference paper
  • First Online:
MultiMedia Modeling (MMM 2018)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10705))

Included in the following conference series:

Abstract

So far, few cover song identification systems aim at practical application. On one hand, existing sequence alignment methods achieve a high precision at the expense of high time cost. On the other hand, for large-scale identification, researchers attempt to exploit fixed low-dimensional features to reduce time cost. However, such highly compressed representations often result in a worse accuracy. In this paper, we propose an efficient two-layer system which takes advantage of the two kinds of methods. The proposed approach outperforms existing approaches and achieves high precision with relatively small time complexity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://github.com/MTG/essentia/.

  2. 2.

    https://labrosa.ee.columbia.edu/projects/coversongs/covers80/.

  3. 3.

    https://labrosa.ee.columbia.edu/millionsong/secondhand.

References

  1. Bertin-Mahieux, T., Ellis, D.P.W.: Large-scale cover song recognition using hashed chroma landmarks. In: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 117–120 (2011)

    Google Scholar 

  2. Bertin-Mahieux, T., Ellis, D.P.: Large-scale cover song recognition using the 2D Fourier transform magnitude. In: International Society for Music Information Retrieval Conference (2012)

    Google Scholar 

  3. Bertin-Mahieux, T., Ellis, D.P., Whitman, B., Lamere, P.: The million song dataset. In: International Society for Music Information Retrieval Conference (2011)

    Google Scholar 

  4. Chen, N., Li, W., Xiao, H.: Fusing similarity functions for cover song identification. Multimed. Tools Appl., 1–24 (2017)

    Google Scholar 

  5. Ellis, D.P., Poliner, G.E.: Identifying cover songs with chroma features and dynamic programming beat tracking. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2007)

    Google Scholar 

  6. Foster, P., Dixon, S., Klapuri, A.: Identifying cover songs using information-theoretic measures of similarity. IEEE/ACM Trans. Audio Speech Lang. Process. 23(6), 993–1005 (2015)

    Article  Google Scholar 

  7. Fujishima, T.: Realtime chord recognition of musical sound: a system using common Lisp music. In: ICMC, pp. 464–467 (1999)

    Google Scholar 

  8. Gómez, E.: Tonal description of polyphonic audio for music content processing. INFORMS J. Comput. 18, 294–304 (2006)

    Article  Google Scholar 

  9. Humphrey, E.J., Nieto, O., Bello, J.P.: Data driven and discriminative projections for large-scale cover song identification. In: International Society for Music Information Retrieval Conference, pp. 149–154 (2013)

    Google Scholar 

  10. Julia, J.S.: Music similarity based on sequences of descriptors: tonal features applied to audio cover song identification. Master’s thesis (2007)

    Google Scholar 

  11. Khadkevich, M., Omologo, M.: Large-scale cover song identification using chord profiles. In: International Society for Music Information Retrieval Conference, pp. 233–238 (2013)

    Google Scholar 

  12. Martin, B., Brown, D.G., Hanna, P., Ferraro, P.: Blast for audio sequences alignment: a fast scalable cover identification. In: International Society for Music Information Retrieval Conference (2012)

    Google Scholar 

  13. Müller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007). https://doi.org/10.1007/978-3-540-74048-3

    Book  Google Scholar 

  14. Müller, M., Ewert, S.: Chroma Toolbox: MATLAB implementations for extracting variants of chroma-based audio features. In: International Society for Music Information Retrieval Conference, Miami, USA (2011)

    Google Scholar 

  15. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)

    MathSciNet  MATH  Google Scholar 

  16. Ravuri, S., Ellis, D.P.: Cover song detection: from high scores to general classification. In: IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 65–68 (2010)

    Google Scholar 

  17. Seetharaman, P., Rafii, Z.: Cover song identification with 2D Fourier transform sequences. In: IEEE International Conference on Acoustics, Speech and Signal Processing (2017)

    Google Scholar 

  18. Serrà, J., Kantz, H., Serra, X.: Predictability of music descriptor time series and its application to cover song detection. IEEE Trans. Audio Speech Lang. Process. 20(2), 514–525 (2012)

    Google Scholar 

  19. Serrà, J., Serra, X., Andrzejak, R.G.: Cross recurrence quantification for cover song identification. New J. Phys. 11(9), 093017 (2009)

    Article  Google Scholar 

  20. Serrà, J.: Identification of versions of the same musical composition by processing audio descriptions. Ph.D. thesis (2011)

    Google Scholar 

  21. Serrà, J., Gómez, E., Herrera, P.: Audio cover song identification and similarity: background, approaches, evaluation, and beyond. In: Raś, Z.W., Wieczorkowska, A.A. (eds.) Advances in Music Information Retrieval. SCI. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-11674-2_14

    Google Scholar 

  22. Serrà, J., Gómez, E., Herrera, P., Serra, X.: Chroma binary similarity and local alignment applied to cover song identification. IEEE Trans. Audio Speech Lang. Process. 16(6), 1138–1151 (2008)

    Article  Google Scholar 

  23. Silva, D.F., Yeh, C.C.M., Batista, G.E.A.P.A., Keogh, E., et al.: SiMPle: assessing music similarity using subsequences joins. In: International Society for Music Information Retrieval Conference (2016)

    Google Scholar 

  24. Silva, D.F., Souza, V.M.A.D., Batista, G.E.A.P.A., et al.: Music shapelets for fast cover song regognition. In: International Society for Music Information Retrieval Conference (2015)

    Google Scholar 

  25. Tralie, C., Paul, B.: Cover song identification with timbral shape sequences. In: 16th International Society for Music Information Retrieval Conference (2015)

    Google Scholar 

Download references

Acknowledgments

This work was supported by the Natural Science Foundation of China (No. 61370116).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiaoshuo Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xu, X., Cheng, Y., Chen, X., Yang, D. (2018). Efficient Two-Layer Model Towards Cover Song Identification. In: Schoeffmann, K., et al. MultiMedia Modeling. MMM 2018. Lecture Notes in Computer Science(), vol 10705. Springer, Cham. https://doi.org/10.1007/978-3-319-73600-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-73600-6_11

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-73599-3

  • Online ISBN: 978-3-319-73600-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics