Social profiling through image understanding: Personality inference using convolutional neural networks

doi:10.1016/j.cviu.2016.10.013

Computer Vision and Image Understanding

Volume 156, March 2017, Pages 34-50

https://doi.org/10.1016/j.cviu.2016.10.013 Get rights and content

Highlights

•
Linking OCEAN personality traits and preferred images in the Flickr social network.
•
Classification of personality traits by novel image features, designed by CNN.
•
Interpretation of visual features by an ad-hoc deconvolution strategy.
•
Online demo application.

Abstract

The role of images in the last ten years has changed radically due to the advent of social networks: from media objects mainly used to communicate visual information, images have become personal, associated with the people that create or interact with them (for example, giving a “like”). Therefore, in the same way that a post reveals something of its author, so now the images associated to a person may embed some of her individual characteristics, such as her personality traits. In this paper, we explore this new level of image understanding with the ultimate goal of relating a set of image preferences to personality traits by using a deep learning framework. In particular, our problem focuses on inferring both self-assessed (how the personality traits of a person can be guessed from her preferred image) and attributed traits (what impressions in terms of personality traits these images trigger in unacquainted people), learning a sort of wisdom of the crowds. Our characterization of each image is locked within the layers of a CNN, allowing us to discover more entangled attributes (aesthetic patterns and semantic information) and to better generalize the patterns that identify a trait. The experimental results show that the proposed method outperforms state-of-the-art results and captures what visually characterizes a certain trait: using a deconvolution strategy we found a clear distinction of features, patterns and content between low and high values in a given trait.

Introduction

Two directions have shaped the image understanding field of the last 30 years (Liu et al., 2007): the first is the one of the low-level processing, where basic information is extracted from the pixel values in the form of color histograms, frequency responses etc., and used to create a representation in a vectorial space, where tasks of clustering or classification can be carried out (Carson, Thomas, Belongie, Hellerstein, Malik, 1999, Vailaya, Figueiredo, Jain, Zhang, 2001). In the second direction, the semantic content of the image is extracted by means of segmentation, classification and detection approaches, and used for tasks such as content-based indexing and retrieval (Li, Su, Fei-Fei, Xing, 2010, Smeulders, Worring, Santini, Gupta, Jain, 2000).

The advent of Internet, the capability of dealing with big data, and the diffusion of social media, gave rise to a third way of dealing with images (Jin, Wang, Luo, Yu, Han, 2011, Vinciarelli, Pentland, 2015); specifically, images started being associated with people: in facts, images are now digital objects that could be easily uploaded by a certain user into social platforms such as Facebook, Flickr, and the like. Images can be also tagged as “preferred”, highlighting those shots that naturally meet expectations of one in terms of aesthetical preferences and/or semantic content.

Both of these activities (uploading and tagging pictures) indicate a substantial revolution in how images are used: from means to represent visual aspects of reality, where the ownership of the photo is neglected, they have become personal messages, from the sender (the subject which uploads the photos into a social network, or that selects some shots as favorite) to his receiver(s) (the user of the social network that sees the uploaded or the preferred pictures). In this fresh new perspective, uploading or “preferring” images will communicate something, that is, personal messages as the kind of subjects that one may like (cars, landscapes, people) or the life experiences one is going through. But images communicate more than this, and this fact does represent a true revolution in the image understanding field, with a new layer of image interpretation which has started to be unveiled; to explain this new perspective, the sender/receiver communication perspective discussed above becomes invaluable.

In dyadic face-to-face communications, people share their opinions, experiences and impressions of life by using explicit verbal signals (that is, spoken sentences) and non-verbal signals (for example, by how they deliver the sentences, or by assuming bodily expressions) (Vinciarelli, Mohammadi, 2014, Vinciarelli, Pentland, 2015). Many social psychology studies highlight the fundamental importance of both aspects, the verbal content and the non-verbal signals, for the successful exchange of messages. This two-body communication paradigm is modeled by the Brunswick lens, in the field of social psychology (Brunswik, 1956).

Very recently, the Brunswick lens model has been customized for this new kind of communication by images: in this new setting, personality traits have been considered as the social signals sent with the uploaded images, and whose inference is one of the most intriguing challenges. In this respect, the works of Cristani et al. (2013); Segalin et al. (2016) focused on inferring with a regressor the real personality traits of the sender (collected by self-assessed tests), but also those traits that unacquainted people (the assessors) associate with the sender by looking at her images. In particular, Segalin et al. (2016) showed that the assessors’ evaluations were 1) consistently similar, 2) in partial disagreement with the self-assessed evaluations, and 3) more easily predicted by machine learning techniques. In other words, the act of sharing images online may evoke a common psychological response in the receiving crowd, and this can be reasonably predicted by automatic approaches. Thus, it is possible to build a wisdom of the crowds model of personality profiles from collections of images, based on the impressions these may generate on a general hypothetical audience.

A limitation of the approach in Segalin et al. (2016) is that the features used to describe the images are taken from the computational aesthetics (CA) literature; in practice, CA often focuses on designing features that explain how a particular image has been captured, discarding the content of the images. In addition, given the wide spectrum of subjects appearing in database images, standard object recognition and feature extraction techniques might not be sufficient to capture significant dependencies between the pictures and the personality traits of their owner. This leads to the development of more advanced techniques such as feature learning, carried out in this paper by convolutional neural networks.

Computer vision with convolutional neural networks (CNNs) has received much attention in recent years, as it is well suited for processing large amounts of data and providing outstanding performances in classical problems like object (Krizhevsky et al., 2012) and image style (Karayev et al., 2013) recognition. In fact, our approach fine-tunes CNNs pre-trained for image classification with the intention of co-opting their effective representational power to indirectly capture the aesthetic attributes of photographs, with the ultimate goal of predicting the personality traits associated with them. This allows us to discover more entangled attributes and to better generalize the patterns that identify a trait. In practice, whereas CA features are explicitly crafted to reveal information about the style of an image, remaining agnostic w.r.t. the content of the image, CNNs exhibit no such limitation, capturing both the aesthetic patterns in the pictures and their content, unveiling semantic information (for example, capturing possible recurrent objects preferred by a user).

Experiments have been focused on the PsychoFlickr corpus (Segalin et al., 2016): the dataset provides 200 “favored” images from 300 Flickr users for a total of 60,000 images. Additionally, the personality profile of each user is described in terms of the Big Five traits (Rammstedt and John, 2007) extensively used in psychology: Openness to experience (O), Conscientiousness (C), Extraversion (E), Agreeableness (A) and Neuroticism (N). This information is collected both through a self-assessment questionnaire and an independent group of 12 assessors, rating the image sets of each user. This allows the corpus to supply two different evaluation criteria for the same data.

The experimental results show that the proposed method sufficiently captures what characterizes a certain trait: on a quantitative level, it performs around 10% better on attributed traits than on self-assessed ones, with a best accuracy of 68% on attributed Neuroticism; on a qualitative level, ranking the test images by confidence shows a clear distinction of features, patterns and content between low and high values in a given trait. These results also outperform (Segalin et al., 2016) when suitably re-casted from regression to classification. Finally, we also introduce an online application demo that uses our trained classifiers to predict personality traits given a proposed set of pictures liked by a subject.

In the following sections, we first describe some related work in computer vision and computational aesthetics; we then introduce our approach based on processing the PsychoFlickr corpus using convolutional neural networks, followed by a section discussing the results. Finally, we briefly present our demo and provide some concluding remarks.

Section snippets

Related work

The idea that aesthetic values are connected to features goes back at least to Birkoff in the 1930s (Birkhoff, 1933). Hoenig (Hoenig, 2005) in 2005 comprehensively defined computational aesthetics (CA) as a field of study with many emphasis on three important factors: computational methods, the human aesthetic point of view and the need to focus on objective approaches. CA is an inter-disciplinary area at the crossroad between computer vision and pattern recognition (CVPR), psychology, visual

Our approach

The groundbreaking success of the Convolutional Neural Networks (CNNs) in the ILSVRC challenges (Krizhevsky et al., 2012) have clearly demonstrated the aptitude of these classifiers at deconstructing the elements and features contained within photographs. Most importantly, they share some basic primitive components analyzed in the first works of personality inference (which essentially applied standard CA features): color, composition, textural properties, etc. More in the detail, CNNs

Experiments and results

In this section, we first describe a set of baseline experiments where CNNs are used as feature extractors and classification is performed with linear SVMs; we then apply our approach of fine-tuning pre-trained nets and compare these results against the literature and the baseline ones; in addition, we examine a few experiments on the original regression problem in the PsychoFlickr dataset; finally, we analyze the attributes learned by our models. All experiments were performed on a linux pc

Demo

As a matter of proof that our proposed method effectively works, we developed a web interface where a subject can upload an image or a set of images, paste a web address linked to a picture or select an image form a list already stored in the server that he/she likes¹. The proposed demo loads the models of the aesthetic preferences related to the attributed traits and classifies the pictures assigning them to the low or high

Conclusions

In this paper, we examine the problem of relating a set of image preferences to personality traits by using a deep learning framework. We cast this recently introduced application problem as a new level of image understanding that enhances the role of images through considerations on the social aspects of contemporary online activities. The role of social platforms like Flickr, Facebook, Instagram, etc., in building online social personas where most activities are shared to a wide audience

Acknowledgment

Dong Seon Cheng was supported by the Hankuk University of Foreign Studies Research Fund of 2015.

References (47)

A. Furnham et al.
Personality and preference for surreal paintings
Pers. Individual Differ.
(1997)
D.J. Hughes et al.
A tale of two sites: twitter vs. facebook and the personality predictors of social media usage
Comput. Human Behav.
(2012)
X. Lu et al.
Rapid: Rating pictorial aesthetics using deep learning
Proceedings of the ACM International Conference on Multimedia
(2014)
D. Rawlings et al.
Personality, creativity and aesthetic preference: comparing psychoticism, sensation seeking, schizotypy and openness to experience
Empirical Stud. Arts
(1998)
C. Bauckhage et al.
Can computers learn from the aesthetic wisdom of the crowd?
KI-Künstliche Intelligenz
(2013)
G.D. Birkhoff
Aesthetic measure
(1933)
E. Brunswik
Perception and the Representative Design of Psychological Experiments
(1956)
A. Campbell et al.
Feature discovery by deep learning for aesthetic analysis of evolved abstract images
Evolutionary and Biologically Inspired Music, Sound, Art and Design
(2015)
C. Carson et al.
Blobworld: A system for region-based image indexing and retrieval
Visual Information and Information Systems
(1999)
V. Ciesielski et al.
Finding Image Features Associated with High Aesthetic Value by Machine Learning
(2013)

P.T. Costa et al.

Revised NEO Personality Inventory (NEO PI-R) and NEO Five-Factor Inventory (NEO FFI): Professional Manual

(1992)

M. Cristani et al.

Unveiling the multimedia unconscious: Implicit cognitive processes and multimedia content analysis

Proceedings of the ACM international conference on Multimedia

(2013)

R. Datta

Semantics and Aesthetics Inference for Image Search: Statistical Learning Approaches. Ph.D. thesis

(2009)

R. Datta et al.

Studying aesthetics in photographic images using a computational approach

ECCV

(2006)

S. Dhar et al.

High level describable attributes for predicting aesthetics and interestingness

CVPR

(2011)

Furnham, A., Walker, J., 2001. The influence of personality traits, previous experience of art, and demographic...

P. Galanter

Computational aesthetic evaluation: past and future

Computers and Creativity

(2012)

J. Golbeck et al.

Predicting personality from twitter

Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on

(2011)

Gong, E.,. Deep aesthetic...

S.D. Gosling et al.

Personality impressions based on facebook profiles.

ICWSM

(2007)

F. Hoenig

Defining computational aesthetics

Proceedings of the Eurographics Conference on Computational Aesthetics in Graphics, Visualization and Imaging

(2005)

Y. Jia et al.

Caffe: convolutional architecture for fast feature embedding

arXiv preprint arXiv:1408.5093

(2014)

X. Jin et al.

Likeminer: a system for mining the power of ‘like’ in social media networks

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining

(2011)

Cited by (49)

Personality modeling from image aesthetic attribute-aware graph representation learning
2022, Journal of Visual Communication and Image Representation
Citation Excerpt :
To address this issue, an end-to-end weakly supervised dual convolutional network was proposed to simultaneously model users’ five personality traits, leveraging their local attention mechanism on liked images [14]. In addition, Segalin et al. adopted CNN to train a personality model that maps an image to the personality traits of people who like the image [22]. Although advanced progress has been made in existing works, since not all liked images can reveal users’ personality traits (as shown in Fig. 1), it is unreasonable to directly take users’ personality traits as the supervised labels of all their liked images for model learning.
Recently, inferring users’ personality traits on social media has attracted extensive attention. Existing studies have shown that users’ personality traits can be inferred from their preferences for images. However, since users’ preferences on images are often affected by multiple factors, some liked images cannot effectively reflect their personality traits. To handle this issue, this paper proposes a personality modeling approach based on image aesthetic attribute-aware graph representation learning, which can leverage aesthetic attributes to refine the liked images that are consistent with users’ personality traits. Specifically, we first utilize a Convolutional Neural Network (CNN) to train an aesthetic attribute prediction module. Then, attribute-aware graph representation learning is introduced to refine the images with similar aesthetic attributes from users’ liked images. Finally, the aesthetic attributes of all refined images are combined to predict personality traits through a Multi-Layer Perceptron (MLP). Experimental results and visual analysis have shown that the proposed method is superior to state-of-the-art personality modeling methods.
Fuzzy and genetic algorithm based approach for classification of personality traits oriented social media images
2022, Knowledge-Based Systems
Citation Excerpt :
This is because images uploaded on social media are associated with the personality traits of the people that interact with them. Furthermore, a person’s personality traits reflect on their uploading and receiving images [4]. This is understandable because people use social media networks to share their daily activities, expressing emotions, sharing happiness, etc.
In recent years, the usage of social media has been increasing exponentially because of its various real world applications in digital communication such as content sharing, entertainment, creating awareness, sending alerts, etc. One such task is to upload images/videos, write comments and post user reactions to express feedback, which can then be used to study human personality traits. Classifying images according to different personality traits, like Agreeableness, Conscientiousness, Extraversion, Neuroticism, Openness, etc., is challenging and essential because of several real-world applications mentioned above. This paper proposes a new personality-traits based method for classifying social images using Fuzzy and genetic algorithms. For each user, the proposed approach extracts profile picture, banners and descriptions to construct a set of vocabularies with the help of text detection, recognition and image annotation. For each word in the vocabulary, we employ a fuzzy logic-based method for obtaining a fuzzy co-occurrence matrix by defining the relationship between the words, which results in a fuzzy co-occurrence matrix for each input data point. We also propose a genetic algorithm based fusion method to generate a feature matrix, which is ultimately fed to the fully connected neural network for classification. The effectiveness of the proposed approach is demonstrated on our dataset with five classes containing 5000 images along with four benchmark datasets, namely, (i) five classes of Liu et al.’s dataset (33556 images) (ii) five classes of PERS dataset (28434 images), (iii) ten classes of Krishnani et al.’s dataset (2000 images), and (iv) two classes of facial emotions of FERPlus dataset (26398 images). The results show that the proposed approach outperforms the existing methods for all the datasets in terms of classification rate.
Multimodal assessment of apparent personality using feature attention and error consistency constraint
2021, Image and Vision Computing
Citation Excerpt :
These studies indicate that there is a strong correlation between users' behavior on social networks and their personality [32]. Additionally, the exploitation of images and words used in public profiles in social networks is a way of obtaining an effective personality trait model, as shown in [33,34]. There are methods for recognition based on combinations of speaking style and body movements.
Personality computing and affective computing, where the recognition of personality traits is essential, have gained increasing interest and attention in many research areas recently. We propose a novel approach to recognize the Big Five personality traits of people from videos. To this end, we use four different modalities, namely, ambient appearance (scene), facial appearance, voice, and transcribed speech. Through a specialized subnetwork for each of these modalities, our model learns reliable modality-specific representations and fuse them using an attention mechanism that re-weights each dimension of these representations to obtain an optimal combination of multimodal information. A novel loss function is employed to enforce the proposed model to give an equivalent importance for each of the personality traits to be estimated through a consistency constraint that keeps the trait-specific errors as close as possible. To further enhance the reliability of our model, we employ (pre-trained) state-of-the-art architectures (i.e., ResNet, VGGish, ELMo) as the backbones of the modality-specific subnetworks, which are complemented by multilayered Long Short-Term Memory networks to capture temporal dynamics. To minimize the computational complexity of multimodal optimization, we use two-stage modeling, where the modality-specific subnetworks are first trained individually, and the whole network is then fine-tuned to jointly model multimodal data. On the large scale ChaLearn First Impressions V2 challenge dataset, we evaluate the reliability of our model as well as investigating the informativeness of the considered modalities. Experimental results show the effectiveness of the proposed attention mechanism and the error consistency constraint. While the best performance is obtained using facial information among individual modalities, with the use of all four modalities, our model achieves a mean accuracy of 91.8%, improving the state of the art in automatic personality analysis.
Psychological targeting in the age of Big Data
2021, Measuring and Modeling Persons and Situations
Advances in the collection, storage, and processing of large amounts of user data have given rise to psychological targeting, which we define as the process of extracting individuals’ psychological characteristics from their digital footprints in order to target them with psychologically-informed interventions at scale. In this chapter, we introduce a two-stage framework of psychological targeting consisting of (1) psychological profiling and (2) psychologically-informed interventions. We summarize the most important research findings in relation to the two stages and discuss important methodological opportunities and pitfalls. To help researchers make the most of the opportunities, we also provide practical advice on how to deal with some of the potential pitfalls. Finally, we highlight ethical opportunities and challenges and offer some suggestions for addressing these challenges. If done right, psychological targeting has the potential to advance our scientific understanding of human nature and to enhance the well-being of individuals and society at large.
A social image recommendation system based on deep reinforcement learning
2024, PLoS ONE
Seeing the Intangible: Surveying Automatic High-Level Visual Understanding from Still Images
2023, arXiv

View all citing articles on Scopus

View full text

Social profiling through image understanding: Personality inference using convolutional neural networks

Highlights

Abstract

Introduction

Section snippets

Related work

Our approach

Experiments and results

Demo

Conclusions

Acknowledgment

Pers. Individual Differ.

Comput. Human Behav.

Empirical Stud. Arts

Can computers learn from the aesthetic wisdom of the crowd?

KI-Künstliche Intelligenz

Aesthetic measure

Perception and the Representative Design of Psychological Experiments

Feature discovery by deep learning for aesthetic analysis of evolved abstract images

Evolutionary and Biologically Inspired Music, Sound, Art and Design

Blobworld: A system for region-based image indexing and retrieval

Visual Information and Information Systems

Finding Image Features Associated with High Aesthetic Value by Machine Learning

Revised NEO Personality Inventory (NEO PI-R) and NEO Five-Factor Inventory (NEO FFI): Professional Manual

Unveiling the multimedia unconscious: Implicit cognitive processes and multimedia content analysis

Proceedings of the ACM international conference on Multimedia

Semantics and Aesthetics Inference for Image Search: Statistical Learning Approaches. Ph.D. thesis

Studying aesthetics in photographic images using a computational approach

ECCV

High level describable attributes for predicting aesthetics and interestingness

CVPR

Computational aesthetic evaluation: past and future

Computers and Creativity

Predicting personality from twitter

Privacy, Security, Risk and Trust (PASSAT) and 2011 IEEE Third Inernational Conference on Social Computing (SocialCom), 2011 IEEE Third International Conference on

Personality impressions based on facebook profiles.

ICWSM

Defining computational aesthetics

Proceedings of the Eurographics Conference on Computational Aesthetics in Graphics, Visualization and Imaging

Caffe: convolutional architecture for fast feature embedding

arXiv preprint arXiv:1408.5093

Likeminer: a system for mining the power of ‘like’ in social media networks

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining