Skip to main content

Enriching Social Media Personas with Personality Traits: A Deep Learning Approach Using the Big Five Classes

  • Conference paper
  • First Online:
Artificial Intelligence in HCI (HCII 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12217))

Included in the following conference series:

Abstract

To predict personality traits of data-driven personas, we apply an automatic persona generation methodology to generate 15 personas from the social media data of an online news organization. After generating the personas, we aggregate each personas’ YouTube comments and predict the “Big Five” personality traits of each persona from the comments pertaining to that persona. For this, we develop a deep learning classifier using three publicly available datasets. Results indicate an average performance increase of 4.84% in F1 scores relative to the baseline. We then analyze how the personas differ by their detected personality traits and discuss how personality traits could be implemented in data-driven persona profiles, as either scores or narratives.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.youtube.com/channel/UCV3Nm3T-XAgVhKH9jT0ViRg.

  2. 2.

    https://developers.google.com/youtube/analytics/.

  3. 3.

    The MPD dataset was previously available on the Web (http://mypersonality.org), but at the time of writing it has been withdrawn. The YT dataset is available upon request (https://www.idiap.ch/dataset/youtube-personality), and the essays dataset can be readily downloaded (https://github.com/SenticNet/personality-detection/blob/master/essays.csv).

  4. 4.

    https://github.com/senticnet/personality-detection.

References

  1. Cooper, A.: The Inmates Are Running the Asylum: Why High Tech Products Drive Us Crazy and How to Restore the Sanity. Sams - Pearson Education, Indianapolis (1999)

    Book  Google Scholar 

  2. Pruitt, J., Grudin, J.: Personas: practice and theory. In: Proceedings of the 2003 Conference on Designing for User Experiences, pp. 1–15. ACM, New York (2003). https://doi.org/10.1145/997078.997089

  3. Nielsen, L.: Personas - User Focused Design. Springer, London (2013)

    Book  Google Scholar 

  4. Salminen, J., Jansen, B.J., An, J., Kwak, H., Jung, S.: Are personas done? Evaluating their usefulness in the age of digital analytics. Persona Stud. 4, 47–65 (2018). https://doi.org/10.21153/psj2018vol4no2art737

  5. LeRouge, C., Ma, J., Sneha, S., Tolle, K.: User profiles and personas in the design and development of consumer health technologies. Int. J. Med. Inform. 82, e251–e268 (2013). https://doi.org/10.1016/j.ijmedinf.2011.03.006

    Article  Google Scholar 

  6. Pruitt, J., Adlin, T.: The Persona Lifecycle: Keeping People in Mind Throughout Product Design. Morgan Kaufmann, Boston (2006)

    Google Scholar 

  7. Nielsen, L., Storgaard Hansen, K.: Personas is applicable: a study on the use of personas in Denmark. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. pp. 1665–1674. ACM (2014)

    Google Scholar 

  8. Salminen, J., Jung, S., An, J., Kwak, H., Nielsen, L., Jansen, B.J.: Confusion and information triggered by photos in persona profiles. Int. J. Hum.-Comput. Stud. 129, 1–14 (2019). https://doi.org/10.1016/j.ijhcs.2019.03.005

    Article  Google Scholar 

  9. Nielsen, L.: Personas - User Focused Design. Springer, New York (2019). https://doi.org/10.1007/978-1-4471-4084-9

    Book  Google Scholar 

  10. Nielsen, L., Hansen, K.S., Stage, J., Billestrup, J.: A template for design personas: analysis of 47 persona descriptions from Danish industries and organizations. Int. J. Sociotechnol. Knowl. Dev. 7, 45–61 (2015). https://doi.org/10.4018/ijskd.2015010104

    Article  Google Scholar 

  11. Anvari, F., Richards, D., Hitchens, M., Babar, M.A.: Effectiveness of persona with personality traits on conceptual design. In: Proceedings of the 37th International Conference on Software Engineering, vol. 2, Piscataway, NJ, USA, pp. 263–272. IEEE Press (2015)

    Google Scholar 

  12. Anvari, F., Richards, D., Hitchens, M., Babar, M.A., Tran, H.M.T., Busch, P.: An empirical investigation of the influence of persona with personality traits on conceptual design. J. Syst. Softw. 134, 324–339 (2017). https://doi.org/10.1016/j.jss.2017.09.020

    Article  Google Scholar 

  13. Gosling, S.D., Rentfrow, P.J., Swann, W.B.: A very brief measure of the Big-Five personality domains. J. Res. Pers. 37, 504–528 (2003)

    Article  Google Scholar 

  14. Ardelt, M.: Still stable after all these years? Personality stability theory revisited. Soc. Psychol. Q. 392–405 (2000)

    Google Scholar 

  15. Leong, L.-Y., Jaafar, N.I., Sulaiman, A.: Understanding impulse purchase in Facebook commerce: does Big Five matter? Internet Res. 27, 786–818 (2017)

    Article  Google Scholar 

  16. Hoffman, L.R.: Homogeneity of member personality and its effect on group problem-solving. J. Abnorm. Soc. Psychol. 58, 27 (1959)

    Google Scholar 

  17. Barrick, M.R., Mount, M.K.: The Big Five personality dimensions and job performance: a meta-analysis. Personnel Psychol. 44, 1–26 (1991)

    Article  Google Scholar 

  18. Schoen, H., Schumann, S.: Personality traits, partisan attitudes, and voting behavior. Evidence from Germany. Polit. Psychol. 28, 471–498 (2007)

    Article  Google Scholar 

  19. Haugtvedt, C.P., Petty, R.E., Cacioppo, J.T.: Need for cognition and advertising: understanding the role of personality variables in consumer behavior. J. Consum. Psychol. 1, 239–260 (1992)

    Article  Google Scholar 

  20. Salminen, J., Guan, K., Jung, S.-G., Chowdhury, S.A., Jansen, B.J.: A literature review of quantitative persona creation. In: Proceedings of the ACM Conference of Human Factors in Computing Systems (CHI 2020), Honolulu, Hawaii, USA. ACM (2020)

    Google Scholar 

  21. An, J., Kwak, H., Salminen, J., Jung, S., Jansen, B.J.: Imaginary people representing real numbers: generating personas from online social media data. ACM Trans. Web (TWEB) 12, 1–26 (2018)

    Article  Google Scholar 

  22. Alam, F., Riccardi, G.: Predicting personality traits using multimodal information. In: Proceedings of the 2014 ACM Multi Media on Workshop on Computational Personality Recognition, pp. 15–18. ACM (2014)

    Google Scholar 

  23. Bleidorn, W., Hopwood, C.J.: Using machine learning to advance personality assessment and theory. Pers. Soc. Psychol. Rev. 1088868318772990 (2018)

    Google Scholar 

  24. Majumder, N., Poria, S., Gelbukh, A., Cambria, E.: Deep learning-based document modeling for personality detection from text. IEEE Intell. Syst. 32, 74–79 (2017). https://doi.org/10.1109/MIS.2017.23

    Article  Google Scholar 

  25. Kim, J.H., Kim, Y.: Instagram user characteristics and the color of their photos: colorfulness, color diversity, and color harmony. Inf. Process. Manag. 56, 1494–1505 (2019). https://doi.org/10.1016/j.ipm.2018.10.018

    Article  Google Scholar 

  26. Carducci, G., Rizzo, G., Monti, D., Palumbo, E., Morisio, M.: TwitPersonality: computing personality traits from tweets using word embeddings and supervised learning. Information 9, 127 (2018)

    Article  Google Scholar 

  27. An, J., Kwak, H., Jung, S., Salminen, J., Jansen, B.J.: Customer segmentation using online platforms: isolating behavioral and demographic segments for persona creation via aggregated user data. Soc. Netw. Anal. Min. 8 (2018). https://doi.org/10.1007/s13278-018-0531-0

  28. Salminen, J., et al.: From 2,772 segments to five personas: summarizing a diverse online audience by generating culturally adapted personas. First Monday 23 (2018). https://doi.org/10.5210/fm.v23i6.8415

  29. Cambria, E.: Affective computing and sentiment analysis. IEEE Intell. Syst. 31, 102–107 (2016)

    Article  Google Scholar 

  30. Pennebaker, J.W., King, L.A.: Linguistic styles: language use as an individual difference. J. Pers. Soc. Psychol. 77, 1296 (1999)

    Article  Google Scholar 

  31. Tausczik, Y.R., Pennebaker, J.W.: The psychological meaning of words: LIWC and computerized text analysis methods. J. Lang. Soc. Psychol. 29, 24–54 (2010). https://doi.org/10.1177/0261927X09351676

    Article  Google Scholar 

  32. Tskhay, K.O., Rule, N.O.: Perceptions of personality in text-based media and OSN: a meta-analysis. J. Res. Personal. 49, 25–30 (2014)

    Article  Google Scholar 

  33. Xue, D., et al.: Deep learning-based personality recognition from text posts of online social networks. Appl. Intell. 48, 4232–4246 (2018). https://doi.org/10.1007/s10489-018-1212-4

    Article  Google Scholar 

  34. Howlader, P., Pal, K.K., Cuzzocrea, A., Kumar, S.D.: Predicting Facebook-users’ personality based on status and linguistic features via flexible regression analysis techniques. In: Proceedings of the 33rd Annual ACM Symposium on Applied Computing, pp. 339–345. ACM (2018)

    Google Scholar 

  35. Luyckx, K., Daelemans, W.: Using syntactic features to predict author personality from text. Proc. Digital Human. 2008, 146–149 (2008)

    Google Scholar 

  36. Mairesse, F., Walker, M.A., Mehl, M.R., Moore, R.K.: Using linguistic cues for the automatic recognition of personality in conversation and text. J. Artif. Intell. Res. 30, 457–500 (2007)

    Article  Google Scholar 

  37. Rammstedt, B., John, O.P.: Measuring personality in one minute or less: a 10-item short version of the Big Five inventory in English and German. J. Res. Pers. 41, 203–212 (2007)

    Article  Google Scholar 

  38. Fang, J., Wen, C., Prybutok, V.: An assessment of equivalence between paper and social media surveys: the role of social desirability and satisficing. Comput. Hum. Behav. 30, 335–343 (2014)

    Article  Google Scholar 

  39. Kozinets, R.V., Dolbec, P.-Y., Earley, A.: Netnographic analysis: understanding culture through social media data. The SAGE Handbook of Qualitative Data Analysis, pp. 262–276 (2014)

    Google Scholar 

  40. Plank, B., Hovy, D.: Personality traits on Twitter—or—how to get 1,500 personality tests in a week. In: Proceedings of the 6th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 92–98 (2015)

    Google Scholar 

  41. Pratama, B.Y., Sarno, R.: Personality classification based on Twitter text using Naive Bayes, KNN and SVM. In: 2015 International Conference on Data and Software Engineering (ICoDSE), pp. 170–174 (2015). https://doi.org/10.1109/ICODSE.2015.7436992

  42. Sewwandi, D., Perera, K., Sandaruwan, S., Lakchani, O., Nugaliyadde, A., Thelijjagoda, S.: Linguistic features based personality recognition using social media data. In: 2017 6th National Conference on Technology and Management (NCTM), pp. 63–68. IEEE (2017)

    Google Scholar 

  43. Mitrou, L., Kandias, M., Stavrou, V., Gritzalis, D.: Social media profiling: a Panopticon or Omniopticon tool? In: Proceedings of the 6th Conference of the Surveillance Studies Network, Barcelona, Spain (2014)

    Google Scholar 

  44. Darliansyah, A., Naeem, M.A., Mirza, F., Pears, R.: SENTIPEDE: a smart system for sentiment-based personality detection from short texts. J. Univ. Comput. Sci. 25, 1323–1352 (2019)

    Google Scholar 

  45. Tandera, T., Suhartono, D., Wongso, R., Prasetio, Y.L.: Personality prediction system from Facebook users. Procedia Comput. Sci. 116, 604–611 (2017)

    Article  Google Scholar 

  46. Yılmaz, T., Ergil, A., İlgen, B.: Deep learning-based document modeling for personality detection from Turkish texts. In: Arai, K., Bhatia, R., Kapoor, S. (eds.) FTC 2019. AISC, vol. 1069, pp. 729–736. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-32520-6_53

    Chapter  Google Scholar 

  47. Agarwal, B.: Personality detection from text: a review. Int. J. Comput. Syst. 1, 1–4 (2014)

    Google Scholar 

  48. Stillwell, D.J., Kosinski, M.: myPersonality project: example of successful utilization of online social networks for large-scale social research. Presented at the International Conference on Mobile Systems (MobiSys) (2012)

    Google Scholar 

  49. Biel, J.-I., Gatica-Perez, D., Dines, J., Tsminiaki, V.: Hi YouTube! personality impressions and verbal content in social video. https://infoscience.epfl.ch/record/196978. https://doi.org/10.1145/2522848.2522877. Accessed 07 Jan 2020

  50. Jones, M., Marsden, G.: Mobile Interaction Design. Wiley (2006)

    Google Scholar 

  51. Negru, S., Buraga, S.: A knowledge-based approach to the user-centered design process. In: Fred, A., Dietz, Jan L.G., Liu, K., Filipe, J. (eds.) IC3K 2012. CCIS, vol. 415, pp. 165–178. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-54105-6_11

    Chapter  Google Scholar 

  52. Pichler, R.: A template for writing great personas (2012)

    Google Scholar 

  53. Salminen, J., Jansen, B.J., An, J., Kwak, H., Jung, S.-G.: Automatic persona generation for online content creators: conceptual rationale and a research agenda. Personas - User Focused Design. HIS, pp. 135–160. Springer, London (2019). https://doi.org/10.1007/978-1-4471-7427-1_8

    Chapter  Google Scholar 

  54. Anvari, F., Tran, H.M.T.: Persona ontology for user centred design professionals. In: The ICIME 4th International Conference on Information Management and Evaluation, Ho Chi Minh City, Vietnam, pp. 35–44 (2013)

    Google Scholar 

  55. Câmara, M., Signoretti, A., Costa, C., Soares, S.C.: Business Affective Persona (BAP): a methodology to create personas to enhance customer relationship with trust and empathy. Revista Turismo Desenvolvimento, pp. 85–97 (2018)

    Google Scholar 

  56. Salminen, J., Jung, S., Chowdhury, S.A., Sengün, S., Jansen, B.J.: Personas and analytics: a comparative user study of efficiency and effectiveness for a user identification task. In: Proceedings of the ACM Conference of Human Factors in Computing Systems (CHI 2020), Honolulu, Hawaii, USA. ACM (2020). https://doi.org/10.1145/3313831.3376770

  57. Jung, S., Salminen, J., Kwak, H., An, J., Jansen, B.J.: Automatic Persona Generation (APG): a rationale and demonstration. In: Proceedings of the 2018 Conference on Human Information Interaction & Retrieval, New Brunswick, NJ, USA, pp. 321–324. ACM (2018). https://doi.org/10.1145/3176349.3176893

  58. Jung, S., Salminen, J., An, J., Kwak, H., Jansen, B.J.: Automatically conceptualizing social media analytics data via personas. Presented at the International AAAI Conference on Web and Social Media (ICWSM 2018), San Francisco, California, USA, 25 June 2018 (2018)

    Google Scholar 

  59. Jung, S., Salminen, J., Jansen, B.J.: Personas Changing over time: analyzing variations of data-driven personas during a two-year period. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, New York, NY, USA, pp. LBW2714:1–LBW2714:6. ACM (2019). https://doi.org/10.1145/3290607.3312955

  60. Salminen, J., et al.: Generating cultural personas from social data: a perspective of middle eastern users. In: Proceedings of the Fourth International Symposium on Social Networks Analysis, Management and Security (SNAMS-2017), Prague, Czech Republic. IEEE (2017). https://doi.org/10.1109/FiCloudW.2017.97

  61. Lee, D.D., Seung, S.H.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)

    Article  Google Scholar 

  62. Norman, W.T.: Toward an adequate taxonomy of personality attributes: replicated factor structure in peer nomination personality ratings. J. Abnorm. Soc. Psychol. 66, 574 (1963)

    Article  Google Scholar 

  63. Ashton, M.C., Lee, K.: How well do Big Five measures capture HEXACO scale variance? J. Pers. Assess. 101, 567–573 (2019)

    Article  Google Scholar 

  64. Goldberg, L.R.: The development of markers for the Big-Five factor structure. Psychol. Assess. 4, 26 (1992)

    Article  Google Scholar 

  65. Yin, C., Zhang, X., Liu, L.: Reposting negative information on microblogs: do personality traits matter? Inf. Process. Manag. 57, 102106 (2020). https://doi.org/10.1016/j.ipm.2019.102106

    Article  Google Scholar 

  66. Sun, X., Liu, B., Cao, J., Luo, J., Shen, X.: Who am I? Personality detection based on deep learning for texts. In: 2018 IEEE International Conference on Communications (ICC), pp. 1–6 (2018). https://doi.org/10.1109/ICC.2018.8422105

  67. Cawley, G.C., Talbot, N.L.: Efficient leave-one-out cross-validation of kernel fisher discriminant classifiers. Pattern Recogn. 36, 2585–2592 (2003)

    Article  Google Scholar 

  68. Le, Q., Mikolov, T.: Distributed representations of sentences and documents. In: Proceedings of the 31st International Conference on Machine Learning (ICML 2014), pp. 1188–1196 (2014)

    Google Scholar 

  69. Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)

  70. LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521, 436 (2015)

    Article  Google Scholar 

  71. Goodman-Deane, J., Waller, S., Demin, D., González-de-Heredia, A., Bradley, M., Clarkson, J.P.: Evaluating inclusivity using quantitative personas. Presented at the Design Research Society Conference, 28 June 2018 (2018). https://doi.org/10.21606/drs.2018.400

  72. Tu, N., et al.: Combine qualitative and quantitative methods to create persona. In: 2010 3rd International Conference on Information Management, Innovation Management and Industrial Engineering, pp. 597–603 (2010). https://doi.org/10.1109/ICIII.2010.463

  73. Salminen, J., Jung, S.G., Jansen, B.J.: The future of data-driven personas: a marriage of online analytics numbers and human attributes. In: ICEIS 2019 - Proceedings of the 21st International Conference on Enterprise Information Systems, Heraklion, Greece, pp. 596–603. SciTePress (2019)

    Google Scholar 

  74. Jung, S., An, J., Kwak, H., Ahmad, M., Nielsen, L., Jansen, B.J.: Persona generation from aggregated social media data. In: Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems, Denver, Colorado, USA, pp. 1748–1755. ACM (2017)

    Google Scholar 

  75. Salminen, J., Liu, Y.-H., Sengun, S., Santos, J.M., Jung, S.-G., Jansen, B.J.: The effect of numerical and textual information on visual engagement and perceptions of ai-driven persona interfaces. In: Proceedings of the ACM Intelligent User Interfaces (IUI 2020), Cagliary, Italy. ACM (2020)

    Google Scholar 

  76. Salminen, J., Jung, S.-G., Jansen, B.J.: Detecting demographic bias in automatically generated personas. In: Extended Abstracts of the 2019 CHI Conference on Human Factors in Computing Systems, pp. LBW0122:1–LBW0122:6. ACM, New York (2019). https://doi.org/10.1145/3290607.3313034

  77. Phillips, M.J.: Ethics and Manipulation in Advertising: Answering a Flawed Indictment. Greenwood Publishing Group (1997)

    Google Scholar 

Download references

Acknowledgments

We thank Dr. Lene Nielsen for discussions and inspiration on how to potentially display the automatically inferred personality traits in data-driven personas. We thank Al Jazeera Media Network for sharing the data that made this research possible.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Joni Salminen .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Salminen, J., Rao, R.G., Jung, Sg., Chowdhury, S.A., Jansen, B.J. (2020). Enriching Social Media Personas with Personality Traits: A Deep Learning Approach Using the Big Five Classes. In: Degen, H., Reinerman-Jones, L. (eds) Artificial Intelligence in HCI. HCII 2020. Lecture Notes in Computer Science(), vol 12217. Springer, Cham. https://doi.org/10.1007/978-3-030-50334-5_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-50334-5_7

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-50333-8

  • Online ISBN: 978-3-030-50334-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics