Skip to main content

Building a “Corpus of 7 Types Emotion Co-occurrences Words” of Chinese Emotional Words with Big Data Corpus

  • Conference paper
  • First Online:
HCI in Business, Government and Organizations (HCII 2022)

Abstract

Past studies used human rated as the way of establishing a corpus which costs a lot of time and money but contains insufficient words, also the Categorical Approach was seldom used for building corpus, which may also lead to study bias. Therefore, study 1 of present study has used the Spreading Activation Model as the structure, and used big data of text corpus and word co-occurrences to build a corpus that contains more categories of emotions and much more words. First, study 1 selected the words that can clearly describe the meanings or can effectively evoke the feeling of its emotion category for seven emotions, including Happiness, Surprise, Sadness, Anger, Disgust, Fear, and Love. Then study 1 calculated the averages of co-occurrences for selected words and text corpora by seven emotions categories (measure is Baroni-Urbani, unit is chunk), it computes the averages of co-occurrences by emotional categories for 33669 words, it represents the conceptual consonance of words and the emotions. Study 2 has investigated the practical use of the corpus built in study 1, and used C-LIWC dictionary which was built by human rated as a comparison, taking the posts of Happy Board, Sad Board, Hate Board of PTT Bulletin Board System into the analyses of emotions recognition, result showed that Corpus of 7 Types Emotion Co-occurrences Words” built in study 1 had higher correct rate than human rated corpus. Present study has also compared the correct rates between the Corpus of 7 Types Emotion Co-occurrences Words and CLIWC (Chinese Linguistic Inquiry and Word Count), result showed correct rates of two databases were significant different, the corpus of present study has higher correct rate. Present study has built a text corpus for the material of emotion research, and the results also supports a potential of building the corpora of emotional words with big data measures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Chen, H.-C., Chan, Y.-C., Feng, Y.-J.: Taiwan corpora of Chinese emotions and relevant psychophysiological data-a norm of emotion metaphors in Chinese. Chin. J. Psychol. 55(4), 525–553 (2013). https://doi.org/10.6129/CJP.20130112b

    Article  Google Scholar 

  2. Kiefer, M., Schuch, S., Schenck, W., Fiedler, K.: Mood states modulate activity in semantic brain areas during emotional word encoding. Cereb. Cortex 17(7), 1516–1530 (2007). https://doi.org/10.1093/cercor/bhl062

    Article  Google Scholar 

  3. St-Hilaire, A., Cohen, A.S., Docherty, N.M.: Emotion word use in the conversational speech of schizophrenia patients. Cogn. Neuropsychiatry 13(4), 343–356 (2008). https://doi.org/10.1080/13546800802250560

    Article  Google Scholar 

  4. Russell, J.A., Pratt, G.: A description of the affective quality attributed to environments. J. Pers. Soc. Psychol. 38(2), 311–322 (1980). https://doi.org/10.1037/0022-3514.38.2.311

    Article  Google Scholar 

  5. Darwin, C., Ekman, P., Prodger, P.: The Expression of the Emotions in Man and Animals. Oxford University Press, Oxford (1998)

    Google Scholar 

  6. Eibl-Eibesfeldt, I.: The expressive behavior of the deaf-and-blind born. In: Social Communication and Movement, pp. 163–193. Academic Press (1973)

    Google Scholar 

  7. Plutchik, R.: A general psychoevolutionary theory of emotion. Theories Emotion 1, 3–31 (1980)

    Article  Google Scholar 

  8. Ekman, P., Friesen, W.V.: Measuring facial movement. Environ. Psychol. Nonverbal Behav. 1(1), 56–75 (1976). https://doi.org/10.1007/bf01115465

    Article  Google Scholar 

  9. Ekman, P., Friesen, W.V., Ellsworth, P.: Emotion in the Human Face: Guidelines for Research and an Integration of Findings. Pergamon Press, Oxford (1972)

    Google Scholar 

  10. Fontaine, J.R., Scherer, K.R., Roesch, E.B., Ellsworth, P.C.: The world of emotions is not two-dimensional. Psychol. Sci. 18(12), 1050–1057 (2007). https://doi.org/10.1111/j.1467-9280.2007.02024.x

    Article  Google Scholar 

  11. Lang, P.J., Bradley, M.M., Cuthbert, B.N.: Emotion, attention, and the startle reflex. Psychol. Rev. 97(3), 377–395 (1990). https://doi.org/10.1037/0033-295X.97.3.377

    Article  Google Scholar 

  12. Larsen, R.J., Diener, E.: Promises and Problems with the Circumplex Model of Emotion. In: Emotion, pp. 25–59. Sage Publications Inc, Thousand Oaks (1992)

    Google Scholar 

  13. Osgood, C.E., Suci, G.J., Tannenbaum, P.H.: The Measurement of Meaning. University of Illinois Press, Illinois (1957)

    Google Scholar 

  14. Thayer, R.E.: Activation-deactivation adjective check list: current overview and structural analysis. Psychol. Rep. 58(2), 607–614 (1986). https://doi.org/10.2466/pr0.1986.58.2.607

    Article  Google Scholar 

  15. Bradley, M.M., Lang, P.J.: Affective norms for English Words (ANEW): instruction manual and affective ratings. Technical report C-1, The Center for Research in Psychophysiology, University of Florida (1999). https://pdodds.w3.uvm.edu/teaching/courses/2009-08UVM-300/docs/others/everything/bradley1999a.pdf

  16. Hinojosa, J.A., et al.: Affective norms of 875 Spanish words for five discrete emotional categories and two emotional dimensions. Behav. Res. Methods 48(1), 272–284 (2015). https://doi.org/10.3758/s13428-015-0572-5

    Article  Google Scholar 

  17. Mukherjee, S., Heise, D.R.: Affective meanings of 1,469 Bengali concepts. Behav. Res. Methods 49(1), 184–197 (2016). https://doi.org/10.3758/s13428-016-0704-6

    Article  Google Scholar 

  18. Stadthagen-Gonzalez, H., Imbault, C., Pérez Sánchez, M.A., Brysbaert, M.: Norms of valence and arousal for 14,031 Spanish words. Behav. Res. Methods 49(1), 111–123 (2016). https://doi.org/10.3758/s13428-015-0700-2

    Article  Google Scholar 

  19. Warriner, A.B., Kuperman, V., Brysbaert, M.: Norms of valence, arousal, and dominance for 13,915 English lemmas. Behav. Res. Methods 45(4), 1191–1207 (2013). https://doi.org/10.3758/s13428-012-0314-x

    Article  Google Scholar 

  20. Cho, S.-L., Chen, H.-C., Cheng, C.-M.: Taiwan Corpora of Chinese emotions and relevant psychophysiological data-a study on the norm of Chinese emotional words. [Taiwan Corpora of Chinese Emotions and Relevant Psychophysiological Data-A Study on the Norm of Chinese Emotional Words]. Chin. J. Psychol. 55(4), 493–523 (2013). https://doi.org/10.6129/cjp.20131026

  21. Lee, H.-M., Lee, Y.-S.: Emotionality ratings and free association of 267 common Chinese words. Formosa J. Ment. Health 24(4), 495–524 (2011)

    Google Scholar 

  22. Wang, Y.-N., Zhou, L.-M., Luo, Y.-J.: The pilot establishment and evaluation of Chinese affective words system. Chin. Ment. Health J. 22(8), 608–612 (2008)

    Google Scholar 

  23. Yao, Z., Wu, J., Zhang, Y., Wang, Z.: Norms of valence, arousal, concreteness, familiarity, imageability, and context availability for 1,100 Chinese words. Behav. Res. Methods 49(4), 1374–1385 (2016). https://doi.org/10.3758/s13428-016-0793-2

    Article  Google Scholar 

  24. Brainerd, C.J., Holliday, R.E., Reyna, V.F., Yang, Y., Toglia, M.P.: Developmental reversals in false memory: effects of emotional valence and arousal. J. Exp. Child Psychol. 107(2), 137–154 (2010). https://doi.org/10.1016/j.jecp.2010.04.013

    Article  Google Scholar 

  25. Casasanto, D., de Bruin, A.: Metaphors we learn by: directed motor action improves word learning. Cognition 182, 177–183 (2019). https://doi.org/10.1016/j.cognition.2018.09.015

    Article  Google Scholar 

  26. Pennebaker, J.W., Chung, C.K., Ireland, M., Gonzales, A., Booth, R.J.: The development and psychometric properties of LIWC2007. Austin, TX. LIWC.net (2007)

    Google Scholar 

  27. Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic Inquiry and Word Count: LIWC 2001, vol. 71. Lawrence Erlbaum Associates, Mahway (2001)

    Google Scholar 

  28. Pennebaker, J.W., Booth, R.J., Francis, M.E.: Linguistic Inquiry and Word Count (LIWC 2007) (2007a)

    Google Scholar 

  29. Pennebaker, J.W., Booth, R.J., Francis, M.E.: Operator’s manual: linguistic inquiry and word count: LIWC2007, Austin, Texas. LIWC.net (2007b). http://www.gruberpeplab.com/teaching/psych231_fall2013/documents/231_Pennebaker2007.pdf

  30. Tausczik, Y.R., Pennebaker, J.W.: The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 29(1), 24–54 (2009). https://doi.org/10.1177/0261927x09351676

    Article  Google Scholar 

  31. Rude, S., Gortner, E.-M., Pennebaker, J.W.: Language use of depressed and depression-vulnerable college students. Cogn. Emot. 18(8), 1121–1133 (2004). https://doi.org/10.1080/02699930441000030

    Article  Google Scholar 

  32. Mehl, M.R., Gosling, S.D., Pennebaker, J.W.: Personality in its natural habitat: manifestations and implicit folk theories of personality in daily life. J. Pers. Soc. Psychol. 90(5), 862–877 (2006). https://doi.org/10.1037/0022-3514.90.5.862

    Article  Google Scholar 

  33. Jaffe, E.: What big data means for psychological science. APS Observer 27 (2014). https://www.psychologicalscience.org/observer/what-big-data-means-for-psychological-science

  34. Mednick, S.: The associative basis of the creative process. Psychol. Rev. 69(3), 220–232 (1962). https://doi.org/10.1037/h0048850

    Article  Google Scholar 

  35. Bargh, J.A., Chen, M., Burrows, L.: Automaticity of social behavior: direct effects of trait construct and stereotype activation on action. J. Pers. Soc. Psychol. 71(2), 230–244 (1996). https://doi.org/10.1037/0022-3514.71.2.230

    Article  Google Scholar 

  36. Fazio, R.H.: On the automatic activation of associated evaluations: an overview. Cogn. Emot. 15(2), 115–141 (2001). https://doi.org/10.1080/02699930125908

    Article  Google Scholar 

  37. Charles, W.G., Miller, G.A.: Contexts of antonymous adjectives. Appl. Psycholinguist. 10(3), 357–375 (1989). https://doi.org/10.1017/S0142716400008675

  38. Justeson, J.S., Katz, S.M.: Co-occurrences of antonymous adjectives and their contexts. Comput. Linguist. 17(1), 1–19 (1991)

    Google Scholar 

  39. Spence, D.P., Owens, K.C.: Lexical co-occurrence and association strength. J. Psycholinguist. Res. 19(5), 317–330 (1990). https://doi.org/10.1007/bf01074363

    Article  Google Scholar 

  40. Collins, A.M., Loftus, E.F.: A spreading-activation theory of semantic processing. Psychol. Rev. 82(6), 407–428 (1975). https://doi.org/10.1037/0033-295X.82.6.407

    Article  Google Scholar 

  41. Shaver, P., Schwartz, J., Kirson, D., O’Connor, C.: Emotion knowledge: further exploration of a prototype approach. J. Pers. Soc. Psychol. 52(6), 1061–1086 (1987). https://doi.org/10.1037/0022-3514.52.6.1061

    Article  Google Scholar 

  42. Huang, C.-L., et al.: The development of the Chinese linguistic inquiry and word count dictionary. Chin. J. Psychol. 54(2), 185–201 (2012)

    Google Scholar 

  43. Lin, S.-Y., Chen, H.-C., Chang, T.-H., Lee, W.-E., Sung, Y.-T.: CLAD: a corpus-derived Chinese lexical association database. Behav. Res. Methods 51(5), 2310–2336 (2019). https://doi.org/10.3758/s13428-019-01208-2

    Article  Google Scholar 

  44. Baroni-Urbani, C., Buser, M.W.: Similarity of binary data. Syst. Biol. 25(3), 251–259 (1976). https://doi.org/10.2307/2412493

    Article  Google Scholar 

Download references

Acknowledgements

This work was financially supported by the grant MOST-111-2634-F-002-004 from Ministry of Science and Technology (MOST) of Taiwan, the MOST AI Biomedical Research Center, and the “Institute for Research Excellence in Learning Sciences” and “Chinese Language and Technology Center” of National Taiwan Normal University from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education in Taiwan.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hsueh-Chih Chen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chen, CH. et al. (2022). Building a “Corpus of 7 Types Emotion Co-occurrences Words” of Chinese Emotional Words with Big Data Corpus. In: Fui-Hoon Nah, F., Siau, K. (eds) HCI in Business, Government and Organizations. HCII 2022. Lecture Notes in Computer Science, vol 13327. Springer, Cham. https://doi.org/10.1007/978-3-031-05544-7_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-05544-7_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-05543-0

  • Online ISBN: 978-3-031-05544-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics