Skip to main content
Log in

A corpus-based approach to classifying emotions using Korean linguistic features

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

Recently, social network services have become the popular communication tools among internet and mobile users. And it has been shared various opinions, which could be included various emotions. Emotion analysis aims to extract various emotion information, such as joy, happy, funny, fear, sad, and lonely, and so on, from texts expressed in natural language. Previous studies about emotion analysis on texts written in Korean have focused generally on the basic sentiments such as positive/neutral/negative preferences or 4–10 emotion classes. In this paper, we propose an emotion analysis method based on supervised learning to classify various emotions from messages written in Korean. We had found the feature set optimized to each emotion class through evaluating the combinations of various linguistic features and built a model to classify the emotion using the optimized feature set. To do this, it was constructed the corpus that is manually annotated with 25 emotion classes. We performed a 10-fold cross variation experiment for evaluating the performance of the proposed method. Our method obtained F-value ranged from 73.1 to 98.0% for each of 25 emotion classes. The optimized feature sets for most of emotion classes include commonly word 2-gram, POS 1-gram, and character 1-gram feature.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  1. Kim, S.Y., Park, S.T., Kim, Y.K.: Samsung-Apple patent war case analysis: focus on the strategy to deal with patent litigation. J. Digit. Converg. 13(3), 117–125 (2015)

    Article  Google Scholar 

  2. Choi, S., Choi, K.: Achievement and satisfaction research of the undergraduate orchestra club activities—a convergent aspects of statistical method and opinion mining. J. Korea Converg. Soc. 6(4), 25–31 (2015)

    Article  Google Scholar 

  3. Pang, B., Lee, L.: Opinion mining and sentiment analysis. Found. Trends Inf. Retr. 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  4. Liu, B., Zhang, L.: A survey of opinion mining and sentiment analysis. In Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 415–463 (2012)

  5. Goldgerg, L.R.: An alternative “description of personality”: the big-five factor structure. J. Personal. Soc. Psychol. 59, 1216–1229 (1990)

    Article  Google Scholar 

  6. Alhussein, M.: Automatic facial emotion recognition using weber local descriptor for e-Healthcare system. Clust. Comput., 1–10 (2016)

  7. Kim, M.C., Park, S.O.: Trust management on user behavioral patterns for a mobile cloud computing. Clust. Comput. 16(4), 725–731 (2013)

    Article  Google Scholar 

  8. Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the 7th Conference on International Language Resources and Evaluation. LREC ’10, pp. 1320–1326 (2010)

  9. Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Languages in Social Media. LSM ’11, pp. 30–38 (2011)

  10. Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment in Twitter events. J. Am. Soc. Inf. Sci. Technol. 62(2), 406–418 (2011)

    Article  Google Scholar 

  11. Kiritchenko, S., Zhu, X., Mohammad, S.M.: Sentiment analysis of short informal texts. J. Artif. Intell. Res. 50, 723–762 (2014)

    MATH  Google Scholar 

  12. Lee, C., Choi, D., Kim, S., Kang, J.: Classification and analysis of emotion in korean microblog texts. KIISE 40(3), 159–167 (2013)

    Google Scholar 

  13. Hong, S.R., Chung, Y.O., Lee, J.H.: Semi-supervised learning for sentiment analysis in mass social media. J. Korean Inst. Intell. Syst. 24(5), 482–488 (2014)

    Article  Google Scholar 

  14. Park, K.M., Lim, H.: Acquiring lexical knowledge using raw corpora and unsupervised clustering method. Clust. Comput. 17(3), 901–910 (2014)

    Article  Google Scholar 

  15. Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford (2009)

  16. Kim, M.K., Kim, J.H., Cha, M.H., Chae, S.H.: An emotion scanning system on text documents. Korean J. Sci. Emot Sensib 12(4), 433–442 (2009)

  17. Ortony, A., Clore, G.L., Collins, A.: The cognitive structure of emotions. Cambridge University Press, Cambridge (1990)

  18. Lee, S.W.: Sentiment analysis system using Stanford sentiment treebank. J. Korean Soc. Mar. Eng. 39(3), 274–279 (2015)

    Article  Google Scholar 

  19. Park, I.J., Min, J.K.: Making a list of Korean emotion terms and exploring dimensions underlying them. Korean J. Soc. Personal. Psychol. 19(1), 109–129 (2005)

    Google Scholar 

  20. Lee, D.J., Yeon, J.H., Hwang, I.B., Lee, S.G.: KKMA: a tool for utilizing Sejong corpus based on relational database. KIISE Trans. Comput. Pract. 16(11), 1046–1050 (2010)

    Google Scholar 

  21. Wikipedia: Emoticons. http://ko.wikipedia.org/wiki/

  22. Oh, J.Y., Cha, J.W.: High speed Korean dependency analysis using cascaded chunking. J. Korea Soc. Simul. 19(1), 103–111 (2010)

    Google Scholar 

  23. The National Institute of the Korean Language: Sejong Treebank. http://www.sejong.or.kr

  24. Choi, D., Park, J.Y., Choi, K.S.,: Korean Treebank transformation for parser training. In: Proceedings of the ACL 2012 Joint Workshop on Statistical Parsing and Semantic Processing of Morphologically Rich Languages, pp. 78–88 (2012)

  25. Hahm, Y.G., Lim, K.T., Martin R., Park, J.Y., Choi, K.S.: Korean natural language processing platform for linked data. Annual Conference on Human and Language Technology, p. 24 (2012)

  26. Petrov, S., Klein, D.: Improved inference for unlexicalized parsing. HLT-NAACL, pp. 404–411 (2007)

  27. Bhuyan, H.K., Kamila, N.K.: Privacy preserving sub-feature selection based on fuzzy probabilities. Clust. Comput. 17(4), 1383–1399 (2014)

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2014R1A1A2056200).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Soonyoung Jung.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jung, Y., Park, K., Lee, T. et al. A corpus-based approach to classifying emotions using Korean linguistic features. Cluster Comput 20, 583–595 (2017). https://doi.org/10.1007/s10586-017-0777-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10586-017-0777-8

Keywords

Navigation