Abstract
Emotion theories are mainly classified as categorical or dimensional approaches. Given the importance of emotional words in emotion research, researchers have constructed a co-occurrence corpus of 7 types of emotion words through word co-occurrence and big data corpora. However, in addition to the categorical approach, the dimensional approach plays an important role in natural language processing. In particular, valence has an important influence on the study of emotion and language. In this study, the co-occurrence corpus of 7 types of emotion words constructed by Chen et al. [1] was expanded to create a corpus of emotional valences. Then, stepwise multiple regression analysis was performed with the predicted criterion variables and 15 predictor variables. The criterion variables were the emotional valences of 553 frequently occurring stimulus words included in the Chinese Word Association Norms [2]. The predictor variables included the emotion co-occurrences scores for 2 clusters (a cluster of literal emotion words and a cluster of metaphorical emotion words) and 7 types of emotions (happiness, love, surprise, sadness, anger, disgust, and fear) [the emotional words were common words from both the co-occurrence corpus of 7 types of emotion words constructed by Chen et al. [1] and the Chinese Word Association Norms established by Hu et al. [2]] and the virtue word co-occurrences score. The results showed that the scores for literal happiness word co-occurrences, metaphorical happiness word co-occurrences, literal disgust word co-occurrences, literal fear word co-occurrences, and virtue word co-occurrences could predict the valence values of emotion words, with the multiple correlation coefficients of multiple regression analyses reaching .729. Subsequently, the valence values of 33,669 words were established using the formula obtained from the multiple regression analysis of the 553 words. Next, the correlation between the actual valence values and the predicted valence values was analyzed to test the cross-validity of the established valences using the common words in the norm established by Lee and Lee [3] for the emotionality ratings and free associations of 267 common Chinese words. The results showed that the correlation between the 2 was .755, indicating that the predicted values generated by the big data corpora and word co-occurrence had a degree of similarity with the manually determined values. Based on theories and tests, this study used the co-occurrence data of 7 emotions and virtue to construct the corpus of emotional valences for 33,669 Chinese words. The results showed that the combined use of big data corpora and word co-occurrence can effectively expand existing corpora that were established based on emotional categories, improve the efficiency of manual construction of corpora, and establish a larger corpus of emotional words.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chen, C.H., et al.: Building a “Corpus of 7 types emotion co-occurrences words” of Chinese emotional words with Big Data Corpus (in Chinese) [Paper presentation]. In: 24th International Conference on Human-Computer Interaction (HCII2022), Virtual Only Conference, June 2022 (2022)
Hu, J.-F., Chen, Y.-C., Zhuo, S.-L., Chen, H.-C., Chang, Y.-L., Sung, Y.-T.: Word “association” and “associated” norms for 1200 Chinese two-character words (in Chinese). Bull. Educ. Psychol. 49(1), 137–161 (2017). https://doi.org/10.6251/BEP.20161111
Lee, H.M., Lee, Y.S.: Emotionality ratings and free association of 267 common Chinese words (in Chinese). Formosa J. Mental Health 24(4), 495–524 (2011)
Darwin, C., Ekman, P., Prodger, P.: The Expression of the Emotions in Man and Animals. Oxford University Press, New York (1998)
Russell, J.A., Pratt, G.: A description of the affective quality attributed to environments. J. Pers. Soc. Psychol. 38(2), 311–322 (1980). https://doi.org/10.1037/0022-3514.38.2.311
Eibl-Eibesfeldt, I.: The Expressive Behavior of the Deaf-and-Blind Born Social Comm. and Mov., Ed. I., pp. 163–193. Academic Press (1973)
Plutchik, R.: A general psychoevolutionary theory of emotion. Theor. Emotion 1, 3–31 (1980)
Ekman, P., Friesen, W.V.: Measuring facial movement. Environ. Psychol. Nonverb. Behav. 1(1), 56–75 (1976). https://doi.org/10.1007/bf01115465
Ekman, P., Friesen, W.V., Ellsworth, P.: Emotion in the Human Face: Guidelines for Research and an Integration of Findings. Pergamon Press, Oxford (1972)
Shaver, P., Schwartz, J., Kirson, D., O’Connor, C.: Emotion knowledge: Further exploration of a prototype approach. J. Pers. Soc. Psychol. 52(6), 1061–1086 (1987). https://doi.org/10.1037/0022-3514.52.6.1061
Fontaine, J.R., Scherer, K.R., Roesch, E.B., Ellsworth, P.C.: The world of emotions is not two-dimensional. Psychol. Sci. 18(12), 1050–1057 (2007). https://doi.org/10.1111/j.1467-9280.2007.02024.x
Lang, P.J., Bradley, M.M., Cuthbert, B.N.: Emotion, attention, and the startle reflex. Psychol. Rev. 97(3), 377–395 (1990). https://doi.org/10.1037/0033-295X.97.3.377
Larsen, R.J., Diener, E.: Promises and Problems with the Circumplex Model of Emotion, pp. 25–59. Sage Publications Inc., Thousand Oaks (1992)
Osgood, C.E., Suci, G.J., Tannenbaum, P.H.: The Measurement of Meaning. University of Illinois Press (1957)
Thayer, R.E.: Activation-deactivation adjective check list: current overview and structural analysis. Psychol. Rep. 58(2), 607–614 (1986). https://doi.org/10.2466/pr0.1986.58.2.607
Kuperman, V., Estes, Z., Brysbaert, M., Warriner, A.B.: Emotion and language: valence and arousal affect word recognition. J. Exp. Psychol. Gen. 143(3), 1065 (2014)
Kauschke, C., Bahn, D., Vesker, M., Schwarzer, G.: The role of emotional valence for the processing of facial and verbal stimuli—positivity or negativity bias? Front. Psychol. 10, 1654 (2019)
Gendron, M., Lindquist, K.A., Barsalou, L., Barrett, L.F.: Emotion words shape emotion percepts. Emotion 12(2), 314–325 (2012). https://doi.org/10.1037/a0026007
Ko, Y.H., Cho, S.L.: The relationship between proneness to borderline personality and suicide ideation based on the analysis of mental function and free association of emotion words (in Chinese). Fu-Jen Journal of Medicine 11(2), 59–71 (2013)
Bradley, M.M., Lang, P.J.: Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings, pp. 1–45. The Center for Research in Psychophysiology (1999)
Mukherjee, S., Heise, D.R.: Affective meanings of 1,469 Bengali concepts. Behav. Res. Methods 49(1), 184–197 (2016). https://doi.org/10.3758/s13428-016-0704-6
Stadthagen-Gonzalez, H., Imbault, C., Pérez Sánchez, M.A., Brysbaert, M.: Norms of valence and arousal for 14,031 Spanish words. Behav. Res. Methods 49(1), 111–123 (2016). https://doi.org/10.3758/s13428-015-0700-2
Warriner, A.B., Kuperman, V., Brysbaert, M.: Norms of valence, arousal, and dominance for 13,915 English lemmas. Behav. Res. Methods 45(4), 1191–1207 (2013). https://doi.org/10.3758/s13428-012-0314-x
Wang, Y.N., Zhou, L.M., Luo, Y.J.: The pilot establishment and evaluation of Chinese affective words system (in Chinese). Chin. Ment. Health J. 22(8), 608–612 (2008)
Cho, S.L., Chen, H.C., Cheng, C.M.: Taiwan corpora of Chinese emotions and relevant psychophysiological data-a study on the norms of Chinese emotional words (in Chinese). Chinese J. Psychol. 55(4), 493–523 (2013). https://doi.org/10.6129/cjp.20131026
Chen, H.C., Chan, Y.C., Feng, Y.J.: Taiwan corpora of Chinese emotions and relevant psychophysiological data-a norm of emotion metaphors in Chinese (in Chinese). Chinese J. Psychol. 55(4), 525–553 (2013). https://doi.org/10.6129/CJP.20130112b
Yao, Z., Wu, J., Zhang, Y., Wang, Z.: Norms of valence, arousal, concreteness, familiarity, imageability, and context availability for 1,100 Chinese words. Behav. Res. Methods 49(4), 1374–1385 (2016). https://doi.org/10.3758/s13428-016-0793-2
Haidt, J.: The Moral Emotions Handbook of Affective Sciences, pp. 852–870. Oxford University Press, New York (2003)
Rich, J.M.: Moral education and the emotions. J. Moral Educ. 9(2), 81–87 (1980). https://doi.org/10.1080/0305724800090202
Rudolph, U., Tscharaktschiew, N.: An attributional analysis of moral emotions: naïve scientists and everyday judges. Emot. Rev. 6(4), 344–352 (2014). https://doi.org/10.1177/1754073914534507
Jackson, M.: Emotion and Psyche. John Hunt Publishing Limited (2010)
Tangney, J.P., Stuewig, J., Mashek, D.J.: Moral emotions and moral behavior. Annu. Rev. Psychol. 58, 345–372 (2007). https://doi.org/10.1146/annurev.psych.56.091103.070145
Hinojosa, J.A., et al.: Affective norms of 875 Spanish words for five discrete emotional categories and two emotional dimensions. Behav. Res. Methods 48(1), 272–284 (2015). https://doi.org/10.3758/s13428-015-0572-5
Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. Paper Presented at the Lrec (2010)
Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37(2), 267–307 (2011). https://doi.org/10.1162/COLI_a_00049
Ku, L.-W., Chen, H.-H.: Mining opinions from the web: beyond relevance retrieval. J. Am. Soc. Inf. Sci. Technol. 58(12), 1838–1850 (2007). https://doi.org/10.1002/asi.20630
Pennebaker, J.W., Booth, R.J., Francis, M.E.: Linguistic Inquiry and Word Count (LIWC2007) (2007)
Pennebaker, J.W., Booth, R.J., Francis, M.E.: Operator’s manual: linguistic inquiry and word count: LIWC2007, Austin: LIWC. net (2007). http://homepage.psy.utexas.edu/HomePage/Faculty/Pennebaker/Reprints/LIWC2007_OperatorManual.pdf. Accessed 1 Oct 2013
Pennebaker, J.W., Francis, M.E., Booth, R.J.: Linguistic Inquiry and Word Count: LIWC 2001, vol. 71. Lawrence Erlbaum Associates, Mahway (2001)
Jaffe, E.: What Big Data Means for Psychological Science. Observer 27(6) (2014)
Chang Chien, C.Y.: Analysis of Life Curriculum Textbooks in Elementary Schools with Regard to Character Building Education (in Chinese). Master’s thesis in the summer social education program of the Department of Further Education, National Taitung University (2010)
Baroni-Urbani, C., Buser, M.W.: Similarity of binary data. Syst. Biol. 25(3), 251–259 (1976). https://doi.org/10.2307/2412493
Pecina, P.: Lexical association measures and collocation extraction. Lang. Resour. Eval. 44(1–2), 137–158 (2009). https://doi.org/10.1007/s10579-009-9101-4
Acknowledgements
This work was financially supported by the grant MOST-111-2634-F-002-004 from Ministry of Science and Technology (MOST) of Taiwan, the MOST AI Biomedical Research Center, and the “Institute for Research Excellence in Learning Sciences” and “Chinese Language and Technology Center” of National Taiwan Normal University from The Featured Areas Research Center Program within the framework of the Higher Education Sprout Project by the Ministry of Education in Taiwan.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Chang, CY. et al. (2022). The Corpus of Emotional Valences for 33,669 Chinese Words Based on Big Data. In: Fui-Hoon Nah, F., Siau, K. (eds) HCI in Business, Government and Organizations. HCII 2022. Lecture Notes in Computer Science, vol 13327. Springer, Cham. https://doi.org/10.1007/978-3-031-05544-7_11
Download citation
DOI: https://doi.org/10.1007/978-3-031-05544-7_11
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05543-0
Online ISBN: 978-3-031-05544-7
eBook Packages: Computer ScienceComputer Science (R0)