Abstract
Song opinion is an important criterion when people organize and access songs. The ever growing amount of song data in the Web, which includes multilingual songs, calls for the development of automatic tools in classifying songs by opinion polarity. Sony lyric is a critical resource for song opinion classification. In this paper, we propose an approach to mine the opinion polarity of songs based on song lyrics in a multilingual environment. This approach is based on classification and translation. Firstly, we build monolingual opinion classifiers using supervised learning techniques for resource-rich languages, i.e., languages that are rich of labeled training data. However, it is difficult to build a classifier for a resource-rare language. In this case, we employ Language Grid, which is an infrastructure that is built on the top of the Internet, and provides easy-to-use services for multilingual translation, to bridge the gap between the resources in different languages. Song lyrics are translated from resource-rare languages into resource-rich languages, then the pre-trained monolingual opinion classifiers can be used to classify the translated unseen lyrics. To build effective monolingual opinion classifiers, we employ statistical information of song lyrics as features rather than individual words in the song lyrics. Experiments show that, our proposed approach performs better than two typical baseline approaches.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
This dictionary contains 17,887 entries, and consists of 12 subsets, i.e., Chinese/English positive/negative feeling, Chines/English positive/negative sentiment, Chinese/English opinion, and Chinese/English degree.
- 5.
- 6.
References
Vignoli, F.: Digital music interaction concepts: a user study. In: Proceedings of the 5th International Conference on Music Information Retrieval (2004)
Lu, L., Liu, D., Zhang, H.J.: Automatic mood detection and tracking of music audio signals. IEEE Trans. Audio Speech Lang. Process. 14, 5–18 (2006)
Trohidis, K., Tsoumakas, G., Kalliris, G., Vlahavas, I.P.: Multi-label classification of music into emotions. In: ISMIR, pp. 325–330 (2008)
He, H., Jin, J., Xiong, Y., Chen, B., Sun, W., Zhao, L.: Language feature mining for music emotion classification via supervised learning from lyrics. In: Kang, L., Cai, Z., Yan, X., Liu, Y. (eds.) ISICA 2008. LNCS, vol. 5370, pp. 426–435. Springer, Heidelberg (2008)
Hu, Y., Chen, X., Yang, D.: Lyric-based song emotion detection with affective lexicon and fuzzy clustering method. In: ISMIR 2009, pp. 123–128 (2009)
Lu, B., Tan, C., Cardie, C., Tsou, B.K.: Joint bilingual sentiment classification with unlabeled parallel corpora. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, vol. 1, pp. 320–330. Association for Computational Linguistics (2011)
Seki, Y., Evans, D.K., Ku, L.W., Chen, H.H., Kando, N., Lin, C.Y.: Overview of opinion analysis pilot task at NTCIR-6. In: Proceedings of NTICR-6 (2007)
Seki, Y., Evans, D.K., Ku, L.W., Sun, L., Chen, H.H., Kando, N.: Overview of multilingual opinion analysis task at NTCIR-7. In: Proceedings of NTCIR-7 (2008)
Nakagawa, T., Inui, K., Kurohashi, S.: Dependency tree-based sentiment classification using CRFs with hidden variables. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 786–794 (2010)
Schulz, J.M., Womser-Hacker, C., Mandl, T.: Multilingual corpus development for opinion mining. European Language Resources Association (2010)
Xia, Y., Wang, L., Wong, K.F.: Sentiment vector space model for lyric-based song sentiment classification. Int. J. Comput. Process. Lang. 21(4), 309–330 (2008)
Balahur, A., Turchi, M.: Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis. Comput. Speech Lang. 28, 56–75 (2014)
Mihalcea, R., Banea, C., Wiebe, J.: Learning multilingual subjective language via cross-lingual projections. In: Proceedings of ACL (2007)
Banea, C., Mihalcea, R., Wiebe, J.: Multilingual subjectivity: are more languages better? In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING 2010, pp. 28–36. Association for Computational Linguistics (2010)
Banea, C., Mihalcea, R., Wiebe, J., Hassan, S.: Multilingual subjectivity analysis using machine translation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, pp. 127–135. Association for Computational Linguistics (2008)
Prettenhofer, P., Stein, B.: Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010, pp. 1118–1127. Association for Computational Linguistics (2010)
Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP 2006, pp. 120–128. Association for Computational Linguistics (2006)
Chen, Y., Skiena, S.: Building sentiment lexicons for all major languages. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers), vol. 2, pp. 383–389. Association for Computational Linguistics (2014)
Cho, Y.H., Lee, K.J.: Automatic affect recognition using natural language processing techniques and manually built affect lexicon. IEICE Trans. Inf. Syst. E89–D(12), 2964–2971 (2006)
Hu, X., Downie, J.S.: Improving mood classification in music digital libraries by combining lyrics and audio. In: Proceedings of the 10th Annual Joint Conference on Digital Libraries, JCDL 2010, pp. 159–168. ACM (2010)
McKay, C., Burgoyne, J.A., Hockman, J., Smith, J.B.L., Vigliensoni, G., Fujinaga, I.: Evaluating the genre classification performance of lyrical features relative to audio, symbolic and cultural features. In: ISMIR 2010, pp. 213–218 (2010)
Kim, S.M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics, COLING 2004. Association for Computational Linguistics (2004)
Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic orientation from association. ACM Trans. Inf. Syst. 21, 315–346 (2003)
Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)
Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18, 613–620 (1975)
Joachims, T.: Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms. Kluwer Academic Publishers, Norwell (2002)
Ishida, T. (ed.): The Language Grid - Service-Oriented Collective Intelligence for Language Resource Interoperability. Cognitive Technologies. Springer, Heidelberg (2011)
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27: 1–27: 27 (2011)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. SIGKDD Explor. Newsl. 11, 10–18 (2009)
Acknowledgments
This work was supported by the National Science Foundation of China under grant 61170165.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, Q., Gao, Z. (2016). Mining Opinion Polarity from Multilingual Song Lyrics. In: Murakami, Y., Lin, D. (eds) Worldwide Language Service Infrastructure. WLSI 2015. Lecture Notes in Computer Science(), vol 9442. Springer, Cham. https://doi.org/10.1007/978-3-319-31468-6_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-31468-6_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31467-9
Online ISBN: 978-3-319-31468-6
eBook Packages: Computer ScienceComputer Science (R0)