skip to main content
10.1145/3095713.3095722acmotherconferencesArticle/Chapter ViewAbstractPublication PagescbmiConference Proceedingsconference-collections
research-article

Prediction of User Demographics from Music Listening Habits

Authors Info & Claims
Published:19 June 2017Publication History

ABSTRACT

Online activities such as social networking, shopping, and consuming multi-media create digital traces often used to improve user experience and increase revenue, e.g., through better-fitting recommendations and targeted marketing. We investigate to which extent the music listening habits of users of the social music platform Last.fm can be used to predict their age, gender, and nationality. We propose a TF-IDF-like feature modeling approach for artist listening information and artist tags combined with additionally extracted features. We show that we can substantially outperform a baseline majority voting approach and can compete with existing approaches. Further, regarding prediction accuracy vs. available listening data we show that even one single listening event per user is enough to outperform the baseline in all prediction tasks. We conclude that personal information can be derived from music listening information, which indeed can help better tailoring recommendations.

References

  1. Z. Cheng, J. Caverlee, and K. Lee. 2010. You Are Where You Tweet: A Content-based Approach to Geo-locating Twitter Users. In Proceedings of the 19th ACM International Conference on Information and Knowledge Management. ACM, 759--768. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. J. Golbeck, C. Robles, and K. Turner. 2011. Predicting Personality with Social Media. In Proceedings of the 2011 Annual Conference on Human Factors in Computing Systems. ACM, 253--262. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten. 2009. The WEKA Data Mining Software: An Update. SIGKDD Explorations 11,1 (2009), 10--18. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. T. Hastie and R. Tibshirani. 1998. Classification by Pairwise Coupling. In Proceedings of the 1997 Conference on Advances in Neural Information Processing Systems 10. MIT Press. Google ScholarGoogle ScholarCross RefCross Ref
  5. G. Holmes, M. Hall, and E. Frank. 1999. Generating Rule Sets from Model Trees. In Proceedings of the 12th Australian Joint Conference on Artificial Intelligence. Springer, 1--12. Google ScholarGoogle ScholarCross RefCross Ref
  6. H. Hotelling. 1933. Analysis of a Complex of Statistical Variables Into Principal Components. Journal of Educational Psychology 24, 6 (1933), 417--441 and 498--520.Google ScholarGoogle ScholarCross RefCross Ref
  7. Y. Hu, Y. Koren, and C. Volinsky. 2008. Collaborative filtering for implicit feedback datasets. In Proceedings of the 2008 8th IEEE International Conference on Data Mining. IEEE, 263--272. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. John and P. Langley. 1995. Estimating Continuous Distributions in Bayesian Classifiers. In Proceedings of the 11th Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann, 338--345.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S.S. Keerthi, S.K. Shevade, C. Bhattacharyya, and K.R.K. Murthy. 2001. Improvements to Platt's SMO Algorithm for SVM Classifier Design. Neural Computation 13, 3 (2001), 637--649. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. M. Kosinski, D. Stillwell, and T. Graepel. 2013. Private traits and attributes are predictable from digital records of human behavior. Proceedings of the National Academy of Sciences 110, 15 (2013), 5802--5805. Google ScholarGoogle ScholarCross RefCross Ref
  11. S. le Cessie and J.C. van Houwelingen. 1992. Ridge Estimators in Logistic Regression. Applied Statistics 41, 1 (1992), 191--201. Google ScholarGoogle ScholarCross RefCross Ref
  12. J. Liu and Y. Yang. 2012. Inferring Personal Traits from Music Listening History. In Proceedings of the 2nd International ACM Workshop on Music Information Retrieval with User-centered and Multimodal Strategies. ACM, 31--36. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. 2011. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research 12 (2011), 2825--2830.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Platt. 1998. Fast Training of Support Vector Machines using Sequential Minimal Optimization. In Advances in Kernel Methods - Support Vector Learning, B. Schoelkopf, C. Burges, and A. Smola (Eds.). MIT Press.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. R. Quinlan. 1992. Learning with Continuous Classes. In Proceedings of the 5th Australian Joint Conference on Artificial Intelligence. World Scientific, 343--348.Google ScholarGoogle Scholar
  16. M. Schedl. 2016. The LFM-1b Dataset for Music Retrieval and Recommendation. In Proceedings of the ACM International Conference on Multimedia Retrieval. ACM, 103--110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Schedl, D. Hauger, K. Farrahi, and M. Tkalčič. 2015. On the Influence of User Characteristics on Music Recommendation. In Proceedings of the 37th European Conference on Information Retrieval. Springer. Google ScholarGoogle ScholarCross RefCross Ref
  18. S.K. Shevade, S.S. Keerthi, C. Bhattacharyya, and K.R.K. Murthy. 1999. Improvements to the SMO Algorithm for SVM Regression. IEEE Transactions on Neural Networks 11 (1999), 1188--1193. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A.J. Smola and B. Schoelkopf. 1998. A tutorial on support vector regression. Technical Report. NeuroCOLT2 Tech. Rep. NC2-TR-1998-030.Google ScholarGoogle Scholar
  20. J. Su, H. Zhang, C. Ling, and S. Matwin. 2008. Discriminative Parameter Learning for Bayesian Networks. In Proceedings of the 25th International Conference on Machine Learning. ACM, 1016--1023. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. Wang and I.H. Witten. 1997. Induction of model trees for predicting continuous classes. In Poster papers of the 9th European Conference on Machine Learning. Springer.Google ScholarGoogle Scholar
  22. M. Wu, J. Jang, and C. Lu. 2014. Gender Identification and Age Estimation of Users Based on Music Metadata. In Proceedings of the 15th International Society for Music Information Retrieval Conference. ISMIR, 555--560.Google ScholarGoogle Scholar
  23. W. Youyou, M. Kosinski, and D. Stillwell. 2015. Computer-based personality judgments are more accurate than those made by humans. Proceedings of the National Academy of Sciences 112, 4 (2015), 1036--1040. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Prediction of User Demographics from Music Listening Habits

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        CBMI '17: Proceedings of the 15th International Workshop on Content-Based Multimedia Indexing
        June 2017
        237 pages
        ISBN:9781450353335
        DOI:10.1145/3095713

        Copyright © 2017 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 19 June 2017

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader