Skip to main content
Log in

Alleviating the cold-start playlist continuation in music recommendation using latent semantic indexing

  • Regular Paper
  • Published:
International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Abstract

The cold-start problem is a grand challenge in music recommender systems aiming to provide users with a better and continuous music listening experience. When a new user creates a playlist, the recommender system remains in a cold-start state until enough information is collected to identify the user’s musical taste. In such cases, playlist metadata, such as title or description, have been successfully employed to create intent recommendation models. In this paper, we propose a multi-stage retrieval system utilizing user-generated titles to alleviate the cold-start problem in automatic playlist continuation. Initially, playlists are clustered to form a music documents collection. Then, the system applies latent semantic indexing to the collection to discover hidden patterns between tracks and playlist titles. For similarity calculation, singular value decomposition is performed on a track-cluster matrix. When the system is given a new playlist as a cold-start instance, it first retrieves neighboring clusters and then produces a ranked list of recommendations by weighting candidate tracks in these clusters. We scrutinize the performance of the proposed system on a large, real-world music playlists dataset supplied by the Spotify platform. Our empirical results show that the proposed system outperforms the state-of-the-art approaches and improves recommendation accuracy significantly in three primary evaluation metrics.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Availability of data and material

The Million Playlist Dataset is publicly available at the time of this writing and can be accessed by registering on AIcrowd.

Notes

  1. https://www.spotify.com

  2. https://radimrehurek.com/gensim/

  3. https://en.wikipedia.org/wiki/Alan_Jackson

  4. https://en.wikipedia.org/wiki/George_Strait

  5. https://www.aicrowd.com

  6. https://github.com/aliyurekli/lsi-cold-start-apc

References

  1. Aizawa A (2003) An information-theoretic perspective of tf-idf measures. Inf Process Manage 39(1):45–65. https://doi.org/10.1016/S0306-4573(02)00021-3

    Article  MATH  Google Scholar 

  2. Antenucci S, Boglio S, Chioso E, Dervishaj E, Kang S, Scarlatti T, Dacrema MF (2018) Artist-driven layering and user’s behaviour impact on recommendations in a playlist continuation scenario. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, BC, Canada, pp 4:1–4:6

  3. Berry MW, Dumais ST, O’Brien GW (1995) Using linear algebra for intelligent information retrieval. SIAM Rev 37(4):573–595

    Article  MathSciNet  Google Scholar 

  4. Bollen D, Knijnenburg BP, Willemsen MC, Graus M (2010) Understanding choice overload in recommender systems. In: Proceedings of the Fourth ACM Conference on Recommender Systems, Barcelona, Spain, p 63-70

  5. Bonnin G, Jannach D (2014) Automated generation of music playlists: Survey and experiments. ACM Comput Surv 47(2):1–35. https://doi.org/10.1145/2652481

    Article  Google Scholar 

  6. Chen CW, Lamere P, Schedl M, Zamani H (2018) Recsys challenge 2018: Automatic music playlist continuation. In: Proceedings of the 12th ACM Conference on Recommender Systems, Vancouver, BC, Canada, p 527-528

  7. Cremonesi P, Turrin R, Airoldi F (2011) Hybrid algorithms for recommending new items. In: Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems, Chicago, Illinois, p 33-40

  8. De Gemmis M, Lops P, Musto C, Narducci F, Semeraro G (2015) Semantics-aware content-based recommender systems. In: Ricci F, Rokach L, Shapira B (eds) Recommender systems handbook. Springer, US, Boston, MA, pp 119–159

    Chapter  Google Scholar 

  9. Faggioli G, Polato M, Aiolli F (2018) Efficient similarity based methods for the playlist continuation task. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, BC, Canada, pp 15:1–15:6

  10. Ferraro A (2019) Music cold-start and long-tail recommendation: Bias in deep representations. In: Proceedings of the 13th ACM Conference on Recommender Systems, Copenhagen, Denmark, p 586-590

  11. Ferraro A, Bogdanov D, Yoon J, Kim K, Serra X (2018) Automatic playlist continuation using a hybrid recommender system combining features from text and audio. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, BC, Canada, pp 1–5

  12. Isinkaye F, Folajimi Y, Ojokoh B (2015) Recommendation systems: Principles, methods and evaluation. Egypt Informatics J 16(3):261–273. https://doi.org/10.1016/j.eij.2015.06.005

    Article  Google Scholar 

  13. Järvelin K, Kekäläinen J (2002) Cumulated gain-based evaluation of IR techniques. ACM Trans Inf Syst 20(4):422–446. https://doi.org/10.1145/582415.582418

    Article  Google Scholar 

  14. Kallumadi S, Mitra B, Iofciu T (2018) A line in the sand: Recommendation or ad-hoc retrieval? In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, Canada, pp 7:1–7:6

  15. Kamehkhosh I, Bonnin G, Jannach D (2020) Effects of recommendations on the playlist creation behavior of users. User Model User-Adapt Interaction 30(2):285–322. https://doi.org/10.1007/s11257-019-09237-4

    Article  Google Scholar 

  16. Kaminskas M, Bridge D (2016) Diversity, serendipity, novelty, and coverage: A survey and empirical analysis of beyond-accuracy objectives in recommender systems. ACM Trans Interact Intell Syst 7(1):1–42. https://doi.org/10.1145/2926720

    Article  Google Scholar 

  17. Kaya M, Bridge D (2018) Automatic playlist continuation using subprofile-aware diversification. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, BC, Canada, pp 1:1–1:6

  18. Kelen DM, Berecz D, Béres F, Benczúr AA (2018) Efficient k-nn for playlist continuation. In: Proceedings of the ACM Recommender Systems Challenge (2018) Vancouver. BC, Canada

  19. Kim J, Won M, Liem CCS, Hanjalic A (2018) Towards seed-free music playlist generation: Enhancing collaborative filtering with playlist title information. In: Proceedings of the ACM Recommender Systems Challenge (2018) Vancouver. BC, Canada

  20. Kuhn A, Ducasse S, Girba T (2005) Enriching reverse engineering with semantic clustering. In: Proceedings of the 12th Working Conference on Reverse Engineering, p 133-142

  21. Lam XN, Vu T, Le TD, Duong AD (2008) Addressing cold-start problem in recommendation systems. In: Proceedings of the 2nd International Conference on Ubiquitous Information Management and Communication, Suwon, Korea, p 208-211

  22. Landauer TK, McNamara DS, Dennis S, Kintsch W (2013) Handbook of latent semantic analysis. Psychology Press, London

    Google Scholar 

  23. Li J, Jing M, Lu K, Zhu L, Yang Y, Huang Z (2019) From zero-shot learning to cold-start recommendation. Proceed AAAI Conf Artif Intell 33(01):4189–4196. https://doi.org/10.1609/aaai.v33i01.33014189

    Article  Google Scholar 

  24. Lika B, Kolomvatsos K, Hadjiefthymiades S (2014) Facing the cold start problem in recommender systems. Expert Systems with Applications 41(4):2065–2073, https://doi.org/10.1016/j.eswa.2013.09.005

  25. Liu NN, Meng X, Liu C, Yang Q (2011) Wisdom of the better few: Cold start recommendation via representative based rating elicitation. In: Proceedings of the Fifth ACM Conference on Recommender Systems, Chicago, Illinois, USA, p 37-44

  26. Logan B (2002) Content-based playlist generation: Exploratory experiments. ISMIR. France, Paris, pp 295–296

    Google Scholar 

  27. Ludewig M, Kamehkhosh I, Landia N, Jannach D (2018) Effective nearest-neighbor music recommendations. In: Proceedings of the ACM Recommender Systems Challenge (2018) Vancouver. BC, Canada

  28. Mimno D, Wallach HM, Talley E, Leenders M, McCallum A (2011) Optimizing semantic coherence in topic models. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, Edinburgh, United Kingdom, p 262-272

  29. Musto C, Semeraro G, de Gemmis M, Lops P (2016) Learning word embeddings from wikipedia for content-based recommender systems. In: European Conference on Information Retrieval, Springer, pp 729–734

  30. van Niedek T, de Vries AP (2018) Random walk with restart for automatic playlist continuation and query-specific adaptations. In: Proceedings of the ACM Recommender Systems Challenge (2018) Vancouver. BC, Canada

  31. Papadimitriou CH, Raghavan P, Tamaki H, Vempala S (2000) Latent semantic indexing: A probabilistic analysis. J Comput Syst Sci 61(2):217–235. https://doi.org/10.1006/jcss.2000.1711

    Article  MathSciNet  MATH  Google Scholar 

  32. Park YJ, Tuzhilin A (2008) The long tail of recommender systems and how to leverage it. In: Proceedings of the 2008 ACM Conference on Recommender Systems, Lausanne, Switzerland, p 11-18

  33. Pichl M, Zangerle E, Specht G (2015) Towards a context-aware music recommendation approach: What is hidden in the playlist name? 2015 IEEE International Conference on Data Mining Workshop. Atlantic City, USA, pp 1360–1365

    Chapter  Google Scholar 

  34. Polignano M, Basile P, de Gemmis M, Semeraro G (2019) Social tags and emotions as main features for the next song to play in automatic playlist continuation. Adjunct Publication of the 27th Conference on User Modeling. Adaptation and Personalization, Larnaca, Cyprus, pp 235–239

    Chapter  Google Scholar 

  35. Rashid AM, Karypis G, Riedl J (2008) Learning preferences of new users in recommender systems: An information theoretic approach. SIGKDD Explor Newsl 10(2):90–100. https://doi.org/10.1145/1540276.1540302

    Article  Google Scholar 

  36. Robertson S (2004) Understanding inverse document frequency: on theoretical arguments for idf. J Document 60(5):503–520. https://doi.org/10.1108/00220410410560582

    Article  Google Scholar 

  37. Rubtsov V, Kamenshchikov M, Valyaev I, Leksin V, Ignatov DI (2018) A hybrid two-stage recommender system for automatic playlist continuation. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, Canada, pp 16:1–16:4

  38. Saveski M, Mantrach A (2014) Item cold-start recommendations: Learning local collective embeddings. In: Proceedings of the 8th ACM Conference on Recommender Systems, Foster City, Silicon Valley, California, USA, p 89-96

  39. Schedl M, Zamani H, Chen CW, Deldjoo Y, Elahi M (2018) Current challenges and visions in music recommender systems research. Int J Multimedia Inf Retr 7:95–116. https://doi.org/10.1007/s13735-018-0154-2

    Article  Google Scholar 

  40. Stevens K, Kegelmeyer P, Andrzejewski D, Buttler D (2012) Exploring topic coherence over many models and many topics. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island, Korea, p 952-961

  41. Symeonidis P (2008) Content-based dimensionality reduction for recommender systems. In: Preisach C, Burkhardt H, Schmidt-Thieme L, Decker R (eds) Data Analysis, Machine Learning and Applications, Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg, pp 619–626

    Google Scholar 

  42. Tran T, Sweeney R, Lee K (2019) Adversarial mahalanobis distance-based attentive song recommender for automatic playlist continuation. In: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, p 245-254

  43. Vall A, Dorfer M, Eghbal-Zadeh H, Schedl M, Burjorjee K, Widmer G (2019) Feature-combination hybrid recommender systems for automated music playlist continuation. User Modeling and User-Adapted Interaction 29(2):527–572. https://doi.org/10.1007/s11257-018-9215-8

    Article  Google Scholar 

  44. Volkovs M, Rai H, Cheng Z, Wu G, Lu Y, Sanner S (2018) Two-stage model for automatic playlist continuation at scale. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, Canada, pp 9:1–9:6

  45. Voorhees EM, Harman DK et al (2005) TREC: Experiment and evaluation in information retrieval, vol 63. MIT press Cambridge, MA

    Google Scholar 

  46. Weston J, Bengio S, Usunier N (2011) Wsabie: Scaling up to large vocabulary image annotation. In: Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence - Volume Volume Three, Barcelona, Catalonia, Spain, p 2764-2770

  47. Yang H, Jeong Y, Choi M, Lee J (2018) Mmcf: Multimodal collaborative filtering for automatic playlist continuation. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, Canada, pp 11:1–11:6

  48. Yürekli A, Bilge A, Kaleli C (2020) Exploring playlist titles for cold-start music recommendation: an effectiveness analysis. Journal of Ambient Intelligence and Humanized Computing pp 1–20

  49. Zamani H, Schedl M, Lamere P, Chen CW (2019) An analysis of approaches taken in the acm recsys challenge 2018 for automatic music playlist continuation. ACM Trans Intell Syst Technol 10:57:1–57:21, https://doi.org/10.1145/3344257

  50. Zhao X, Song Q, Caverlee J, Hu X (2018) Trailmix: An ensemble recommender system for playlist curation and continuation. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, BC, Canada, pp 8:1–8:6

  51. Zhu L, He B, Ji M, Ju C, Chen Y (2018) Automatic music playlist continuation via neighbor-based collaborative filtering and discriminative reweighting/reranking. In: Proceedings of the ACM Recommender Systems Challenge 2018, Vancouver, Canada, pp 10:1–10:6

Download references

Acknowledgements

This work was supported by the Grant 20ADP172 from Eskişehir Technical University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alper Bilge.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Code availability

The source code required to reproduce the experimental results is publicly available on a GitHub repository.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yürekli, A., Kaleli, C. & Bilge, A. Alleviating the cold-start playlist continuation in music recommendation using latent semantic indexing. Int J Multimed Info Retr 10, 185–198 (2021). https://doi.org/10.1007/s13735-021-00214-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s13735-021-00214-5

Keywords

Navigation