Abstract
Personality traits are crucial factors that influence individual behavior and responses. In this study, we introduced both personality traits and experience traits in an analysis of the characteristics of citizens using social media posts from different cities. The personality traits referenced some facets of the “Big Five Model” of personality. The experience traits were introduced considering the close relationship between personality traits and behavior. Specifically, we labeled social media posts from each city with two labels, representing a personality trait and an experience trait. We then examined how these traits reflect the tendencies in citizens’ behavior. The personality traits defined in this study were “Altruism,” “Artistic Interest,” “Adventurousness,” “Gregariousness,” and “Activity.” We assigned the labels manually and fine-tuned large language models to assign the personality trait labels to cities automatically. Finally, we analyzed the differences in personality trait trends between cities. Our experimental results showed that the F1 scores of the prediction models for both personality traits and experience traits exceeded 0.8. The analysis of social media posts using the trained models demonstrated that the citizens of a certain city had significantly higher scores for the personality traits “Artistic Interest” and “Gregariousness” than those from other cities, which was consistent with the results of previous questionnaire-based studies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. In: Burstein, J., Doran, C., Solorio, T. (eds.) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pp. 4171–4186. Association for Computational Linguistics, Minneapolis, Minnesota (2019). https://doi.org/10.18653/v1/N19-1423
El-Demerdash, K., El-Khoribi, R.A., Ismail Shoman, M.A., Abdou, S.: Deep learning based fusion strategies for personality prediction. Egypt. Inf. J. 23(1), 47–53 (2022). https://doi.org/10.1016/j.eij.2021.05.004
Fleiss, J.L.: Measuring nominal scale agreement among many raters. Psychol. Bull. 76(5), 378–382 (1971). https://doi.org/10.1037/h0031619
Furnham, A., Chamorro-Premuzic, T.: Personality, intelligence, and art. Personality Individ. Differ. 36(3), 705–715 (2004). https://doi.org/10.1016/S0191-8869(03)00128-4
Goldberg, L.R.: A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. Pers. Psychol. Eur. 7(1), 7–28 (1999)
Hirsh, J.B., Kang, S.K., Bodenhausen, G.V.: Personalized persuasion: tailoring persuasive appeals to recipients’ personality traits. Psychol. Sci. 23(6), 578–581 (2012). https://doi.org/10.1177/0956797611436349
John, O.P., Srivastava, S.: The big five trait taxonomy: history, measurement, and theoretical perspectives. In: Pervin, L.A., John, O.P. (eds.) Handbook of personality: Theory and research, 2nd edn., pp. 102–138. Guilford, New York (1999)
Johnson, J.A.: Measuring thirty facets of the five factor model with a 120-item public domain inventory: development of the IPIP-NEO-120. J. Res. Pers. 51, 78–89 (2014). https://doi.org/10.1016/j.jrp.2014.05.003
Kamijo, K., Nasukawa, T., Kitamura, H.: Personality estimation from Japanese text. In: Nissim, M., Patti, V., Plank, B. (eds.) Proceedings of the Workshop on Computational Modeling of People’s Opinions, Personality, and Emotions in Social Media (PEOPLES), pp. 101–109. The COLING 2016 Organizing Committee, Osaka, Japan (2016). https://aclanthology.org/W16-4311
Landis, J.R., Koch, G.G.: The measurement of observer agreement for categorical data. Biometrics 33(1), 159–174 (1977). https://doi.org/10.2307/2529310
Liu, Y., et al.: RoBERTa: a robustly optimized BERT pretraining approach (2019). https://arxiv.org/abs/1907.11692
Mairesse, F., Walker, M., Mehl, M., Moore, R.: Using linguistic cues for the automatic recognition of personality in conversation and text. J. Artif. Intell. Res. (JAIR) 30, 457–500 (2007). https://doi.org/10.1613/jair.2349
Oshio, A., Abe, S., Cutrone, P., Gosling, S.D.: Big Five content representation of the Japanese version of the Ten-Item Personality Inventory. Psychology 4(12), 924–929 (2013). https://doi.org/10.4236/psych.2013.412133
Reimers, N., Gurevych, I.: Sentence-BERT: sentence embeddings using siamese BERT-networks. In: Inui, K., Jiang, J., Ng, V., Wan, X. (eds.) Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp. 3982–3992. Hong Kong, China (2019). https://doi.org/10.18653/v1/D19-1410
Schmitt, D.P., Allik, J., McCrae, R.R., Benet-Martínez, V.: The geographic distribution of big five personality traits: patterns and profiles of human self-description across 56 nations. J. Cross Cult. Psychol. 38(2), 173–212 (2007). https://doi.org/10.1177/0022022106297299
The Harris Poll: The 2023 State of Social Media: AI & Data Take Center Stage (2023). https://sproutsocial.com/insights/data/harris-insights-report-2023/
Yamada, I., Asai, A., Shindo, H., Takeda, H., Matsumoto, Y.: LUKE: deep contextualized entity representations with entity-aware self-attention. In: Webber, B., Cohn, T., He, Y., Liu, Y. (eds.) Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 6442–6454 (2020). https://doi.org/10.18653/v1/2020.emnlp-main.523
Yoshino, S., Oshio, A.: Regional differences in big five personality traits in Japan (in Japanese). Jpn. J. Environ. Psychol. 9(1), 19–33 (2021). https://doi.org/10.20703/jenvpsy.9.1_19
Acknowledgments
This work was partially supported by the JSPS the Grant-in-Aid for Scientific Research (B) (#23K28375), the Grant-in-Aid for Scientific Research (C) (#24K15066), and a Grant-in-Aid for Research ActivityStart-up (#22K21303). We would like to express our sincere gratitude to Tomoya Hiramoto, Shiori Oogu, and Sachiko Tachibana from the National Institute for Japanese Language and Linguistics for their great cooperation in annotating the dataset.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2025 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Iwasaki, T., Seki, Y., Kashino, W., Keyaki, A., Kando, N. (2025). Estimating Citizen Personality Traits Using Social Media Posts. In: Oliver, G., Frings-Hessami, V., Du, J.T., Tezuka, T. (eds) Sustainability and Empowerment in the Context of Digital Libraries. ICADL 2024. Lecture Notes in Computer Science, vol 15494. Springer, Singapore. https://doi.org/10.1007/978-981-96-0868-3_10
Download citation
DOI: https://doi.org/10.1007/978-981-96-0868-3_10
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-96-0867-6
Online ISBN: 978-981-96-0868-3
eBook Packages: Computer ScienceComputer Science (R0)