Skip to main content

Mining Opinion Polarity from Multilingual Song Lyrics

  • Conference paper
  • First Online:
  • 478 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9442))

Abstract

Song opinion is an important criterion when people organize and access songs. The ever growing amount of song data in the Web, which includes multilingual songs, calls for the development of automatic tools in classifying songs by opinion polarity. Sony lyric is a critical resource for song opinion classification. In this paper, we propose an approach to mine the opinion polarity of songs based on song lyrics in a multilingual environment. This approach is based on classification and translation. Firstly, we build monolingual opinion classifiers using supervised learning techniques for resource-rich languages, i.e., languages that are rich of labeled training data. However, it is difficult to build a classifier for a resource-rare language. In this case, we employ Language Grid, which is an infrastructure that is built on the top of the Internet, and provides easy-to-use services for multilingual translation, to bridge the gap between the resources in different languages. Song lyrics are translated from resource-rare languages into resource-rich languages, then the pre-trained monolingual opinion classifiers can be used to classify the translated unseen lyrics. To build effective monolingual opinion classifiers, we employ statistical information of song lyrics as features rather than individual words in the song lyrics. Experiments show that, our proposed approach performs better than two typical baseline approaches.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://nlp.stanford.edu/software/tagger.shtml.

  2. 2.

    http://ictclas.org/ictclas_demo.html.

  3. 3.

    http://tartarus.org/martin/PorterStemmer/.

  4. 4.

    This dictionary contains 17,887 entries, and consists of 12 subsets, i.e., Chinese/English positive/negative feeling, Chines/English positive/negative sentiment, Chinese/English opinion, and Chinese/English degree.

  5. 5.

    http://langrid.org/en/index.html.

  6. 6.

    http://music.baidu.com/tag.

References

  1. Vignoli, F.: Digital music interaction concepts: a user study. In: Proceedings of the 5th International Conference on Music Information Retrieval (2004)

    Google Scholar 

  2. Lu, L., Liu, D., Zhang, H.J.: Automatic mood detection and tracking of music audio signals. IEEE Trans. Audio Speech Lang. Process. 14, 5–18 (2006)

    Article  Google Scholar 

  3. Trohidis, K., Tsoumakas, G., Kalliris, G., Vlahavas, I.P.: Multi-label classification of music into emotions. In: ISMIR, pp. 325–330 (2008)

    Google Scholar 

  4. He, H., Jin, J., Xiong, Y., Chen, B., Sun, W., Zhao, L.: Language feature mining for music emotion classification via supervised learning from lyrics. In: Kang, L., Cai, Z., Yan, X., Liu, Y. (eds.) ISICA 2008. LNCS, vol. 5370, pp. 426–435. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  5. Hu, Y., Chen, X., Yang, D.: Lyric-based song emotion detection with affective lexicon and fuzzy clustering method. In: ISMIR 2009, pp. 123–128 (2009)

    Google Scholar 

  6. Lu, B., Tan, C., Cardie, C., Tsou, B.K.: Joint bilingual sentiment classification with unlabeled parallel corpora. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT 2011, vol. 1, pp. 320–330. Association for Computational Linguistics (2011)

    Google Scholar 

  7. Seki, Y., Evans, D.K., Ku, L.W., Chen, H.H., Kando, N., Lin, C.Y.: Overview of opinion analysis pilot task at NTCIR-6. In: Proceedings of NTICR-6 (2007)

    Google Scholar 

  8. Seki, Y., Evans, D.K., Ku, L.W., Sun, L., Chen, H.H., Kando, N.: Overview of multilingual opinion analysis task at NTCIR-7. In: Proceedings of NTCIR-7 (2008)

    Google Scholar 

  9. Nakagawa, T., Inui, K., Kurohashi, S.: Dependency tree-based sentiment classification using CRFs with hidden variables. In: Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 786–794 (2010)

    Google Scholar 

  10. Schulz, J.M., Womser-Hacker, C., Mandl, T.: Multilingual corpus development for opinion mining. European Language Resources Association (2010)

    Google Scholar 

  11. Xia, Y., Wang, L., Wong, K.F.: Sentiment vector space model for lyric-based song sentiment classification. Int. J. Comput. Process. Lang. 21(4), 309–330 (2008)

    Article  Google Scholar 

  12. Balahur, A., Turchi, M.: Comparative experiments using supervised learning and machine translation for multilingual sentiment analysis. Comput. Speech Lang. 28, 56–75 (2014)

    Article  Google Scholar 

  13. Mihalcea, R., Banea, C., Wiebe, J.: Learning multilingual subjective language via cross-lingual projections. In: Proceedings of ACL (2007)

    Google Scholar 

  14. Banea, C., Mihalcea, R., Wiebe, J.: Multilingual subjectivity: are more languages better? In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING 2010, pp. 28–36. Association for Computational Linguistics (2010)

    Google Scholar 

  15. Banea, C., Mihalcea, R., Wiebe, J., Hassan, S.: Multilingual subjectivity analysis using machine translation. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2008, pp. 127–135. Association for Computational Linguistics (2008)

    Google Scholar 

  16. Prettenhofer, P., Stein, B.: Cross-language text classification using structural correspondence learning. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010, pp. 1118–1127. Association for Computational Linguistics (2010)

    Google Scholar 

  17. Blitzer, J., McDonald, R., Pereira, F.: Domain adaptation with structural correspondence learning. In: Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, EMNLP 2006, pp. 120–128. Association for Computational Linguistics (2006)

    Google Scholar 

  18. Chen, Y., Skiena, S.: Building sentiment lexicons for all major languages. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Short Papers), vol. 2, pp. 383–389. Association for Computational Linguistics (2014)

    Google Scholar 

  19. Cho, Y.H., Lee, K.J.: Automatic affect recognition using natural language processing techniques and manually built affect lexicon. IEICE Trans. Inf. Syst. E89–D(12), 2964–2971 (2006)

    Article  Google Scholar 

  20. Hu, X., Downie, J.S.: Improving mood classification in music digital libraries by combining lyrics and audio. In: Proceedings of the 10th Annual Joint Conference on Digital Libraries, JCDL 2010, pp. 159–168. ACM (2010)

    Google Scholar 

  21. McKay, C., Burgoyne, J.A., Hockman, J., Smith, J.B.L., Vigliensoni, G., Fujinaga, I.: Evaluating the genre classification performance of lyrical features relative to audio, symbolic and cultural features. In: ISMIR 2010, pp. 213–218 (2010)

    Google Scholar 

  22. Kim, S.M., Hovy, E.: Determining the sentiment of opinions. In: Proceedings of the 20th International Conference on Computational Linguistics, COLING 2004. Association for Computational Linguistics (2004)

    Google Scholar 

  23. Turney, P.D., Littman, M.L.: Measuring praise and criticism: inference of semantic orientation from association. ACM Trans. Inf. Syst. 21, 315–346 (2003)

    Article  Google Scholar 

  24. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86. Association for Computational Linguistics (2002)

    Google Scholar 

  25. Manning, C.D., Raghavan, P., Schütze, H.: Introduction to Information Retrieval. Cambridge University Press, New York (2008)

    Book  MATH  Google Scholar 

  26. Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18, 613–620 (1975)

    Article  MATH  Google Scholar 

  27. Joachims, T.: Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms. Kluwer Academic Publishers, Norwell (2002)

    Book  Google Scholar 

  28. Ishida, T. (ed.): The Language Grid - Service-Oriented Collective Intelligence for Language Resource Interoperability. Cognitive Technologies. Springer, Heidelberg (2011)

    Google Scholar 

  29. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. Intell. Syst. Technol. 2(3), 27: 1–27: 27 (2011)

    Article  Google Scholar 

  30. Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: an update. SIGKDD Explor. Newsl. 11, 10–18 (2009)

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the National Science Foundation of China under grant 61170165.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Qian Liu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Liu, Q., Gao, Z. (2016). Mining Opinion Polarity from Multilingual Song Lyrics. In: Murakami, Y., Lin, D. (eds) Worldwide Language Service Infrastructure. WLSI 2015. Lecture Notes in Computer Science(), vol 9442. Springer, Cham. https://doi.org/10.1007/978-3-319-31468-6_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-31468-6_12

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-31467-9

  • Online ISBN: 978-3-319-31468-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics