Music emotion classification and context-based music recommendation

Han, Byeong-jun; Rho, Seungmin; Jun, Sanghoon; Hwang, Eenjun

doi:10.1007/s11042-009-0332-6

Music emotion classification and context-based music recommendation

Published: 05 August 2009

Volume 47, pages 433–460, (2010)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Byeong-jun Han¹,
Seungmin Rho²,
Sanghoon Jun¹ &
…
Eenjun Hwang¹

4158 Accesses
87 Citations
Explore all metrics

Abstract

Context-based music recommendation is one of rapidly emerging applications in the advent of ubiquitous era and requires multidisciplinary efforts including low level feature extraction and music classification, human emotion description and prediction, ontology-based representation and recommendation, and the establishment of connections among them. In this paper, we contributed in three distinctive ways to take into account the idea of context awareness in the music recommendation field. Firstly, we propose a novel emotion state transition model (ESTM) to model human emotional states and their transitions by music. ESTM acts like a bridge between user situation information along with his/her emotion and low-level music features. With ESTM, we can recommend the most appropriate music to the user for transiting to the desired emotional state. Secondly, we present context-based music recommendation (COMUS) ontology for modeling user’s musical preferences and context, and for supporting reasoning about the user’s desired emotion and preferences. The COMUS is music-dedicated ontology in OWL constructed by incorporating domain-specific classes for music recommendation into the Music Ontology, which includes situation, mood, and musical features. Thirdly, for mapping low-level features to ESTM, we collected various high-dimensional music feature data and applied nonnegative matrix factorization (NMF) for their dimension reduction. We also used support vector machine (SVM) as emotional state transition classifier. We constructed a prototype music recommendation system based on these features and carried out various experiments to measure its performance. We report some of the experimental results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exploring contactless techniques in multimodal emotion recognition: insights into diverse applications, challenges, solutions, and prospects

Article Open access 06 April 2024

Umair Ali Khan, Qianru Xu, … Janne Kauttonen

A survey of music emotion recognition

Article 22 January 2022

Donghong Han, Yanru Kong, … Guoren Wang

RETRACTED ARTICLE: Customer centric hybrid recommendation system for E-Commerce applications by integrating hybrid sentiment analysis

Article 29 October 2022

Arodh Lal Karn, Rakshha Kumari Karna, … Sudhakar Sengan

References

Allwein E, Schapire R, Singer Y (2000) Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers. Journal of Machine Learning Research 1:113–141
Article MathSciNet Google Scholar
Boser BE, Guyon I and Vapnik V (1992) A training algorithm for optimal margin classifiers. In proceedings of the Fifth Annual Workshop on Computational Learning Theory : 144–152, ACM Press
Chang C-C and Lin C-J (2001) LIBSVM: a library for support vector machines, Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm of subordinate document. Access 30 Jun 2008.
Cord M, Cunningham P (2008) Machine Learning Techniques for Multimedia. Springer-Verlag Berlin Heidelberg.
Cortes C, Vapnik V (1995) Support-vector network. Machine Learning 20:273–297
MATH Google Scholar
Ellis D, and Poliner G (2007) Identifying `Cover Songs’ with Chroma Features and Dynamic Programming Beat Tracking. Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007) 4:1429–1432
Feng Yazhong , Zhuang, Yueting, Pan Yunhe. (2003) Popular Music Retrieval by Detecting Mood. SIGIR Forum (ACM Special Interest Group on Information Retrieval), Page(s) : 375–376.
Grüninger M, Fox MS (1994) The Role of Mariano Fernández López 4–12 Competency Questions in Enterprise Engineering. IFIP WG 5.7 Workshop on Benchmarking. Theory and Practice. Trondheim, Norway.
Han B, Hwang E, Rho S, Kim M (2007) M-MUSICS: Mobile Content-based Music Retrieval System. ACM Multimedia 2007:496–497
Google Scholar
Holzapfel A, Stylianou Y (2008) Musical Genre Classification Using Nonnegative Matrix Factorization-Based Features. IEEE Transactions on Audio, Speech, and Language Processing 16(2):424–434
Article Google Scholar
ISO 226 :2003 Acoustics — Normal equal-loudness level contours.
Jun S, Rho S, Han B, Hwang E (2008) A Fuzzy Inference-based Music Emotion Recognition System. IEEE International Conferences on Visual Information Engineering 2008 (VIE ’08), to appear in Jul. 2008
Juslin PN (2000) Cue utilization in communication of emotion in music performance: Relating performance to perception. J. Experimental Psychology 26:1797–1813
Google Scholar
Kalat JW and Shiota MN (2007) Emotion. Thomson. 1/e.
Klapuri A(ed.) and Davy, M(ed.) (2006) Signal Processing Methods for Music Transcription. Springer Science + Business Media LLC.
Krumhansl C (1990) Cognitive Foundations of Musical Pitch. Oxford University Press.
Lee DD, Seung S (1999) Learning the parts of objects by non-negative matrix factorization. Nature 401(6755):788–791
Article Google Scholar
Lee DD and Seung S (2001) Algorithms for Non-negative Matrix Factorization. Advances in Neural Information Processing System 13 : Proceedings of the 2000 Conference. 556–562. MIT Press.
Lu L, Liu D, Zhang H-J (2006) Automatic MoodDetection and Tracking of Music Audio Signals. IEEE Transactions on Audio, Speech, and Language Processing 14(1):5–18
Article MathSciNet Google Scholar
Müller M (2008) Information Retrieval for Music and Motion. Springer Berlin Heidelberg New York
Oscar C (2006) Foafing the Music: Bridging the semantic gap in music recommendation. Proceedings of 5th International Semantic Web Conference (ISWC)
Oscar C, Perfecto H, and Xavier S (2006) A multimodal approach to bridge the Music Semantic Gap. International Conference on Semantic and Digital Media Technologies (SAMT)
Pachet F, Casaly D (2000) “A taxonomy of musical genres,” in Proceedings of the 6th Conference on Content-Based Multimedia Information Access (RIAO’00). France, April, Paris
Google Scholar
Rabiner LR, Juang B-H (1993) Fundamentals of Speech Recognition. Prentice Hall, New Jersey
Google Scholar
Rho S, Han B, Hwang E, Kim M (2008) MUSEMBLE: A Novel Music Retrieval System with Automatic Voice Query Transcription and Reformulation. Journal of Systems and Software 81(7):1065–1080
Article Google Scholar
Robinson DW et al (1956) A re-determination of the equal-loudness relations for pure tones. British Journal of Applied Physics 7:166–181
Article Google Scholar
Russel JA (1980) A circumplex model of affect. Journal of Personality Social Psychology 39:1161–1178
Article Google Scholar
Scherer K (1992) What does facial expression express ? In K.T. Strongman(Ed.) International Review of Studies on Emotions 2 :139–165. Chichester : Wiley.
Scherer KR (2005) What are emotions? And how can they be measures? Social Science Information 44(4):695–729
Article Google Scholar
Schölkopf B, Platt JC et al (1999) Estimating the support of a high-dimensional distribution. Microsoft research corporation technical report MSR-TR-99-87.
Thayer RE (1989) The Biopsychology of Mood and Arousal. Oxford Univ. Press, Oxford, U.K
Google Scholar
Xiang H, Ren F, Kuroiwa S, Jiang P (2005) An Experimentation on Creating a Mental State Transition Network. Proceedings of the 2005 IEEE International Confernece on Information Acquisition:432–436
Yazhong Feng; Yueting Zhuang; Yunhe Pan. (2003) Music information retrieval by detecting mood via computational media aesthetics. Web Intelligence, Proceedings. IEEE/WIC International Conference onVolume, Issue, 13–17. Page(s): 235 – 241.
Yves R and Frederick G (2007) Music Ontology Specification, http://www.musicontology.com/
Yves R, Samer A, Mark S, Frederick G (2007) The Music Ontology. Proceedings of the International Conference on Music Information Retrieval, ISMIR 2007:417–422
Google Scholar
Zwicker E, Fastl H (1990) Psychoacoustics — Facts and Models. (1st Ed.) Springer.
Kanzaki Music Vocabulary. http://www.kanzaki.com/ns/music
MusicBrainz. http://musicbrainz.org
Music Ontology Specification, http://www.musicontology.com/
COMUS Ontology, http://mil.korea.ac.kr/ontology/0.1/musicontology.owl
All Music Guide, http://www.allmusic.com/
BPM Database, http://www.bpmdatabase.com/
Protégé editor, http://protege.stanford.edu/
Mood Logic, Available at: http://www.moodlogic.com/

Download references

Acknowledgement

This work was supported by the Korea Research Foundation Grant funded by the Korean Government (MOEHRD). (KRF-2007-313-D00758)

Author information

Authors and Affiliations

School of Electrical Engineering, Korea University, Seoul, Korea
Byeong-jun Han, Sanghoon Jun & Eenjun Hwang
School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Seungmin Rho

Authors

Byeong-jun Han
View author publications
You can also search for this author in PubMed Google Scholar
Seungmin Rho
View author publications
You can also search for this author in PubMed Google Scholar
Sanghoon Jun
View author publications
You can also search for this author in PubMed Google Scholar
Eenjun Hwang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eenjun Hwang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Han, Bj., Rho, S., Jun, S. et al. Music emotion classification and context-based music recommendation. Multimed Tools Appl 47, 433–460 (2010). https://doi.org/10.1007/s11042-009-0332-6

Download citation

Published: 05 August 2009
Issue Date: May 2010
DOI: https://doi.org/10.1007/s11042-009-0332-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Music emotion classification and context-based music recommendation

Abstract

Access this article

Similar content being viewed by others

Exploring contactless techniques in multimodal emotion recognition: insights into diverse applications, challenges, solutions, and prospects

A survey of music emotion recognition

RETRACTED ARTICLE: Customer centric hybrid recommendation system for E-Commerce applications by integrating hybrid sentiment analysis

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Music emotion classification and context-based music recommendation

Abstract

Access this article

Similar content being viewed by others

Exploring contactless techniques in multimodal emotion recognition: insights into diverse applications, challenges, solutions, and prospects

A survey of music emotion recognition

RETRACTED ARTICLE: Customer centric hybrid recommendation system for E-Commerce applications by integrating hybrid sentiment analysis

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation