Abstract
To predict personality traits of data-driven personas, we apply an automatic persona generation methodology to generate 15 personas from the social media data of an online news organization. After generating the personas, we aggregate each personas’ YouTube comments and predict the “Big Five” personality traits of each persona from the comments pertaining to that persona. For this, we develop a deep learning classifier using three publicly available datasets. Results indicate an average performance increase of 4.84% in F1 scores relative to the baseline. We then analyze how the personas differ by their detected personality traits and discuss how personality traits could be implemented in data-driven persona profiles, as either scores or narratives.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
The MPD dataset was previously available on the Web (http://mypersonality.org), but at the time of writing it has been withdrawn. The YT dataset is available upon request (https://www.idiap.ch/dataset/youtube-personality), and the essays dataset can be readily downloaded (https://github.com/SenticNet/personality-detection/blob/master/essays.csv).
- 4.
References
Cooper, A.: The Inmates Are Running the Asylum: Why High Tech Products Drive Us Crazy and How to Restore the Sanity. Sams - Pearson Education, Indianapolis (1999)
Pruitt, J., Grudin, J.: Personas: practice and theory. In: Proceedings of the 2003 Conference on Designing for User Experiences, pp. 1–15. ACM, New York (2003). https://doi.org/10.1145/997078.997089
Nielsen, L.: Personas - User Focused Design. Springer, London (2013)
Salminen, J., Jansen, B.J., An, J., Kwak, H., Jung, S.: Are personas done? Evaluating their usefulness in the age of digital analytics. Persona Stud. 4, 47–65 (2018). https://doi.org/10.21153/psj2018vol4no2art737
LeRouge, C., Ma, J., Sneha, S., Tolle, K.: User profiles and personas in the design and development of consumer health technologies. Int. J. Med. Inform. 82, e251–e268 (2013). https://doi.org/10.1016/j.ijmedinf.2011.03.006
Pruitt, J., Adlin, T.: The Persona Lifecycle: Keeping People in Mind Throughout Product Design. Morgan Kaufmann, Boston (2006)
Nielsen, L., Storgaard Hansen, K.: Personas is applicable: a study on the use of personas in Denmark. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. pp. 1665–1674. ACM (2014)
Salminen, J., Jung, S., An, J., Kwak, H., Nielsen, L., Jansen, B.J.: Confusion and information triggered by photos in persona profiles. Int. J. Hum.-Comput. Stud. 129, 1–14 (2019). https://doi.org/10.1016/j.ijhcs.2019.03.005
Nielsen, L.: Personas - User Focused Design. Springer, New York (2019). https://doi.org/10.1007/978-1-4471-4084-9
Nielsen, L., Hansen, K.S., Stage, J., Billestrup, J.: A template for design personas: analysis of 47 persona descriptions from Danish industries and organizations. Int. J. Sociotechnol. Knowl. Dev. 7, 45–61 (2015). https://doi.org/10.4018/ijskd.2015010104
Anvari, F., Richards, D., Hitchens, M., Babar, M.A.: Effectiveness of persona with personality traits on conceptual design. In: Proceedings of the 37th International Conference on Software Engineering, vol. 2, Piscataway, NJ, USA, pp. 263–272. IEEE Press (2015)
Anvari, F., Richards, D., Hitchens, M., Babar, M.A., Tran, H.M.T., Busch, P.: An empirical investigation of the influence of persona with personality traits on conceptual design. J. Syst. Softw. 134, 324–339 (2017). https://doi.org/10.1016/j.jss.2017.09.020
Gosling, S.D., Rentfrow, P.J., Swann, W.B.: A very brief measure of the Big-Five personality domains. J. Res. Pers. 37, 504–528 (2003)
Ardelt, M.: Still stable after all these years? Personality stability theory revisited. Soc. Psychol. Q. 392–405 (2000)
Leong, L.-Y., Jaafar, N.I., Sulaiman, A.: Understanding impulse purchase in Facebook commerce: does Big Five matter? Internet Res. 27, 786–818 (2017)
Hoffman, L.R.: Homogeneity of member personality and its effect on group problem-solving. J. Abnorm. Soc. Psychol. 58, 27 (1959)
Barrick, M.R., Mount, M.K.: The Big Five personality dimensions and job performance: a meta-analysis. Personnel Psychol. 44, 1–26 (1991)
Schoen, H., Schumann, S.: Personality traits, partisan attitudes, and voting behavior. Evidence from Germany. Polit. Psychol. 28, 471–498 (2007)
Haugtvedt, C.P., Petty, R.E., Cacioppo, J.T.: Need for cognition and advertising: understanding the role of personality variables in consumer behavior. J. Consum. Psychol. 1, 239–260 (1992)
Salminen, J., Guan, K., Jung, S.-G., Chowdhury, S.A., Jansen, B.J.: A literature review of quantitative persona creation. In: Proceedings of the ACM Conference of Human Factors in Computing Systems (CHI 2020), Honolulu, Hawaii, USA. ACM (2020)
An, J., Kwak, H., Salminen, J., Jung, S., Jansen, B.J.: Imaginary people representing real numbers: generating personas from online social media data. ACM Trans. Web (TWEB) 12, 1–26 (2018)
Alam, F., Riccardi, G.: Predicting personality traits using multimodal information. In: Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality Recognition, pp. 15–18. ACM (2014)
Bleidorn, W., Hopwood, C.J.: Using machine learning to advance personality assessment and theory. Pers. Soc. Psychol. Rev. 1088868318772990 (2018)
Majumder, N., Poria, S., Gelbukh, A., Cambria, E.: Deep learning-based document modeling for personality detection from text. IEEE Intell. Syst. 32, 74–79 (2017). https://doi.org/10.1109/MIS.2017.23
Kim, J.H., Kim, Y.: Instagram user characteristics and the color of their photos: colorfulness, color diversity, and color harmony. Inf. Process. Manag. 56, 1494–1505 (2019). https://doi.org/10.1016/j.ipm.2018.10.018
Carducci, G., Rizzo, G., Monti, D., Palumbo, E., Morisio, M.: TwitPersonality: computing personality traits from tweets using word embeddings and supervised learning. Information 9, 127 (2018)
An, J., Kwak, H., Jung, S., Salminen, J., Jansen, B.J.: Customer segmentation using online platforms: isolating behavioral and demographic segments for persona creation via aggregated user data. Soc. Netw. Anal. Min. 8 (2018). https://doi.org/10.1007/s13278-018-0531-0
Salminen, J., et al.: From 2,772 segments to five personas: summarizing a diverse online audience by generating culturally adapted personas. First Monday 23 (2018). https://doi.org/10.5210/fm.v23i6.8415
Cambria, E.: Affective computing and sentiment analysis. IEEE Intell. Syst. 31, 102–107 (2016)
Pennebaker, J.W., King, L.A.: Linguistic styles: language use as an individual difference. J. Pers. Soc. Psychol. 77, 1296 (1999)
Tausczik, Y.R., Pennebaker, J.W.: The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 29, 24–54 (2010). https://doi.org/10.1177/0261927X09351676
Tskhay, K.O., Rule, N.O.: Perceptions of personality in text-based media and OSN: a meta-analysis. J. Res. Personal. 49, 25–30 (2014)
Xue, D., et al.: Deep learning-based personality recognition from text posts of online social networks. Appl. Intell. 48, 4232–4246 (2018). https://doi.org/10.1007/s10489-018-1212-4
Howlader, P., Pal, K.K., Cuzzocrea, A., Kumar, S.D.: Predicting Facebook-users’ personality based on status and linguistic features via flexible regression analysis techniques. In: Proceedings of the 33rd Annual ACM Symposium on Applied Computing, pp. 339–345. ACM (2018)
Luyckx, K., Daelemans, W.: Using syntactic features to predict author personality from text. Proc. Digital Human. 2008, 146–149 (2008)
Mairesse, F., Walker, M.A., Mehl, M.R., Moore, R.K.: Using linguistic cues for the automatic recognition of personality in conversation and text. J. Artif. Intell. Res. 30, 457–500 (2007)
Rammstedt, B., John, O.P.: Measuring personality in one minute or less: a 10-item short version of the Big Five inventory in English and German. J. Res. Pers. 41, 203–212 (2007)
Fang, J., Wen, C., Prybutok, V.: An assessment of equivalence between paper and social media surveys: the role of social desirability and satisficing. Comput. Hum. Behav. 30, 335–343 (2014)
Kozinets, R.V., Dolbec, P.-Y., Earley, A.: Netnographic analysis: understanding culture through social media data. The SAGE Handbook of Qualitative Data Analysis, pp. 262–276 (2014)
Plank, B., Hovy, D.: Personality traits on Twitter—or—how to get 1,500 personality tests in a week. In: Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 92–98 (2015)
Pratama, B.Y., Sarno, R.: Personality classification based on Twitter text using Naive Bayes, KNN and SVM. In: 2015 International Conference on Data and Software Engineering (ICoDSE), pp. 170–174 (2015). https://doi.org/10.1109/ICODSE.2015.7436992
Sewwandi, D., Perera, K., Sandaruwan, S., Lakchani, O., Nugaliyadde, A., Thelijjagoda, S.: Linguistic features based personality recognition using social media data. In: 2017 6th National Conference on Technology and Management (NCTM), pp. 63–68. IEEE (2017)
Mitrou, L., Kandias, M., Stavrou, V., Gritzalis, D.: Social media profiling: a Panopticon or Omniopticon tool? In: Proceedings of the 6th Conference of the Surveillance Studies Network, Barcelona, Spain (2014)
Darliansyah, A., Naeem, M.A., Mirza, F., Pears, R.: SENTIPEDE: a smart system for sentiment-based personality detection from short texts. J. Univ. Comput. Sci. 25, 1323–1352 (2019)
Tandera, T., Suhartono, D., Wongso, R., Prasetio, Y.L.: Personality prediction system from Facebook users. Procedia Comput. Sci. 116, 604–611 (2017)
Yılmaz, T., Ergil, A., İlgen, B.: Deep learning-based document modeling for personality detection from Turkish texts. In: Arai, K., Bhatia, R., Kapoor, S. (eds.) FTC 2019. AISC, vol. 1069, pp. 729–736. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-32520-6_53
Agarwal, B.: Personality detection from text: a review. Int. J. Comput. Syst. 1, 1–4 (2014)
Stillwell, D.J., Kosinski, M.: myPersonality project: example of successful utilization of online social networks for large-scale social research. Presented at the International Conference on Mobile Systems (MobiSys) (2012)
Biel, J.-I., Gatica-Perez, D., Dines, J., Tsminiaki, V.: Hi YouTube! personality impressions and verbal content in social video. https://infoscience.epfl.ch/record/196978. https://doi.org/10.1145/2522848.2522877. Accessed 07 Jan 2020
Jones, M., Marsden, G.: Mobile Interaction Design. Wiley (2006)
Negru, S., Buraga, S.: A knowledge-based approach to the user-centered design process. In: Fred, A., Dietz, Jan L.G., Liu, K., Filipe, J. (eds.) IC3K 2012. CCIS, vol. 415, pp. 165–178. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-54105-6_11
Pichler, R.: A template for writing great personas (2012)
Salminen, J., Jansen, B.J., An, J., Kwak, H., Jung, S.-G.: Automatic persona generation for online content creators: conceptual rationale and a research agenda. Personas - User Focused Design. HIS, pp. 135–160. Springer, London (2019). https://doi.org/10.1007/978-1-4471-7427-1_8
Anvari, F., Tran, H.M.T.: Persona ontology for user centred design professionals. In: The ICIME 4th International Conference on Information Management and Evaluation, Ho Chi Minh City, Vietnam, pp. 35–44 (2013)
Câmara, M., Signoretti, A., Costa, C., Soares, S.C.: Business Affective Persona (BAP): a methodology to create personas to enhance customer relationship with trust and empathy. Revista Turismo Desenvolvimento, pp. 85–97 (2018)
Salminen, J., Jung, S., Chowdhury, S.A., Sengün, S., Jansen, B.J.: Personas and analytics: a comparative user study of efficiency and effectiveness for a user identification task. In: Proceedings of the ACM Conference of Human Factors in Computing Systems (CHI 2020), Honolulu, Hawaii, USA. ACM (2020). https://doi.org/10.1145/3313831.3376770
Jung, S., Salminen, J., Kwak, H., An, J., Jansen, B.J.: Automatic Persona Generation (APG): a rationale and demonstration. In: Proceedings of the 2018 Conference on Human Information Interaction & Retrieval, New Brunswick, NJ, USA, pp. 321–324. ACM (2018). https://doi.org/10.1145/3176349.3176893
Jung, S., Salminen, J., An, J., Kwak, H., Jansen, B.J.: Automatically conceptualizing social media analytics data via personas. Presented at the International AAAI Conference on Web and Social Media (ICWSM 2018), San Francisco, California, USA, 25 June 2018 (2018)
Jung, S., Salminen, J., Jansen, B.J.: Personas Changing over time: analyzing variations of data-driven personas during a two-year period. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, New York, NY, USA, pp. LBW2714:1–LBW2714:6. ACM (2019). https://doi.org/10.1145/3290607.3312955
Salminen, J., et al.: Generating cultural personas from social data: a perspective of middle eastern users. In: Proceedings of the Fourth International Symposium on Social Networks Analysis, Management and Security (SNAMS-2017), Prague, Czech Republic. IEEE (2017). https://doi.org/10.1109/FiCloudW.2017.97
Lee, D.D., Seung, S.H.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Norman, W.T.: Toward an adequate taxonomy of personality attributes: replicated factor structure in peer nomination personality ratings. J. Abnorm. Soc. Psychol. 66, 574 (1963)
Ashton, M.C., Lee, K.: How well do Big Five measures capture HEXACO scale variance? J. Pers. Assess. 101, 567–573 (2019)
Goldberg, L.R.: The development of markers for the Big-Five factor structure. Psychol. Assess. 4, 26 (1992)
Yin, C., Zhang, X., Liu, L.: Reposting negative information on microblogs: do personality traits matter? Inf. Process. Manag. 57, 102106 (2020). https://doi.org/10.1016/j.ipm.2019.102106
Sun, X., Liu, B., Cao, J., Luo, J., Shen, X.: Who am I? Personality detection based on deep learning for texts. In: 2018 IEEE International Conference on Communications (ICC), pp. 1–6 (2018). https://doi.org/10.1109/ICC.2018.8422105
Cawley, G.C., Talbot, N.L.: Efficient leave-one-out cross-validation of kernel fisher discriminant classifiers. Pattern Recogn. 36, 2585–2592 (2003)
Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on Machine Learning (ICML 2014), pp. 1188–1196 (2014)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436 (2015)
Goodman-Deane, J., Waller, S., Demin, D., González-de-Heredia, A., Bradley, M., Clarkson, J.P.: Evaluating inclusivity using quantitative personas. Presented at the Design Research Society Conference, 28 June 2018 (2018). https://doi.org/10.21606/drs.2018.400
Tu, N., et al.: Combine qualitative and quantitative methods to create persona. In: 2010 3rd International Conference on Information Management, Innovation Management and Industrial Engineering, pp. 597–603 (2010). https://doi.org/10.1109/ICIII.2010.463
Salminen, J., Jung, S.G., Jansen, B.J.: The future of data-driven personas: a marriage of online analytics numbers and human attributes. In: ICEIS 2019 - Proceedings of the 21st International Conference on Enterprise Information Systems, Heraklion, Greece, pp. 596–603. SciTePress (2019)
Jung, S., An, J., Kwak, H., Ahmad, M., Nielsen, L., Jansen, B.J.: Persona generation from aggregated social media data. In: Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems, Denver, Colorado, USA, pp. 1748–1755. ACM (2017)
Salminen, J., Liu, Y.-H., Sengun, S., Santos, J.M., Jung, S.-G., Jansen, B.J.: The effect of numerical and textual information on visual engagement and perceptions of ai-driven persona interfaces. In: Proceedings of the ACM Intelligent User Interfaces (IUI 2020), Cagliary, Italy. ACM (2020)
Salminen, J., Jung, S.-G., Jansen, B.J.: Detecting demographic bias in automatically generated personas. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, pp. LBW0122:1–LBW0122:6. ACM, New York (2019). https://doi.org/10.1145/3290607.3313034
Phillips, M.J.: Ethics and Manipulation in Advertising: Answering a Flawed Indictment. Greenwood Publishing Group (1997)
Acknowledgments
We thank Dr. Lene Nielsen for discussions and inspiration on how to potentially display the automatically inferred personality traits in data-driven personas. We thank Al Jazeera Media Network for sharing the data that made this research possible.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Salminen, J., Rao, R.G., Jung, Sg., Chowdhury, S.A., Jansen, B.J. (2020). Enriching Social Media Personas with Personality Traits: A Deep Learning Approach Using the Big Five Classes. In: Degen, H., Reinerman-Jones, L. (eds) Artificial Intelligence in HCI. HCII 2020. Lecture Notes in Computer Science(), vol 12217. Springer, Cham. https://doi.org/10.1007/978-3-030-50334-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-50334-5_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-50333-8
Online ISBN: 978-3-030-50334-5
eBook Packages: Computer ScienceComputer Science (R0)