Abstract
Users of social media are not only producers and consumers of online content, they also evaluate each other’s content. Some social media include the possibility to down vote or dislike the content posted by other users, posing the risk that users who receive dislikes might be more likely to become inactive, especially if the disliked content is about a person. We analyzed the data on more than 150,000 YouTube videos to understand how video impact and user incentives can be related to the possibility to dislike user content. We processed images related to videos to identify faces and quantify if evaluating content related to people is connected to disliking patterns. We found that videos with faces on their images tend to have less dislikes if they are posted by male users, but the effect is not present for female users. On the contrary, videos with faces and posted by female users attract more views and likes. Analyzing the probability of users to become inactive, we find that receiving dislikes is associated with users becoming inactive. This pattern is stronger when dislikes are given to videos with faces, showing that negative evaluations about people have a stronger association with user inactivity. Our results show that user evaluations in social media are a multi-faceted phenomenon that requires large-scale quantitative analyses, identifying under which conditions users disencourage other users from being active in social media.
Notes
- 1.
- 2.
We replicated the analysis with alternative intervals of one and three months to determine inactivity, and regression models were qualitatively unchanged.
References
Abisheva, A., Garcia, D., Schweitzer, F.: When the filter bubble bursts: collective evaluation dynamics in online communities. In: Proceedings of the 8th ACM Conference on Web Science, pp. 307–308 (2016)
Abisheva, A., Garimella, V.R.K., Garcia, D., Weber, I.: Who watches (and shares) what on YouTube? and when? using Twitter to understand YouTube viewership. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 593–602 (2014)
Alstott, J., Bullmore, E., Plenz, D.: powerlaw: a python package for analysis of heavy-tailed distributions. PLoS ONE 9(1), e85777 (2014)
Aragón, P., Gómez, V., Kaltenbrunner, A.: To thread or not to thread: the impact of conversation threading on online discussion. In: ICWSM, pp. 12–21 (2017)
Bates, D., Mchler, M., Bolker, B., Walker, S.: Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67(1), 1–48 (2015)
Cashmore, P.: Should Facebook add a dislike button? In: CNN articles (2010). http://cnn.it/2t7tu2h
Castells, M.: The Rise of the Network Society, vol. 1. Wiley, Hoboken (1996)
Clauset, A., Shalizi, C.R., Newman, M.E.J.: Power-law distributions in empirical data. SIAM Rev. 51(4), 661 (2009)
Crane, R., Sornette, D.: Robust dynamic classes revealed by measuring the response function of a social system. Proc. Natl. Acad. Sci. 105(41), 15649–15653 (2008)
Del Vicario, M., Bessi, A., Zollo, F., Petroni, F., Scala, A., Caldarelli, G., Stanley, H.E., Quattrociocchi, W.: The spreading of misinformation online. Proc. Natl. Acad. Sci. 113(3), 554–559 (2016)
Garcia, D., Mavrodiev, P., Casati, D., Schweitzer, F.: Understanding popularity, reputation, and social influence in the Twitter society. Policy Internet (2017)
Garcia, D., Mavrodiev, P., Schweitzer, F.: Social resilience in online communities: the autopsy of friendster. In: Proceedings of the 1st ACM Conference in Online Social Networks (COSN 2013), pp. 39–50 (2013)
Garcia, D., Mendez, F., Serdult, U., Schweitzer, F.: Political polarization and popularity in online participatory media: an integrated approach. In: Proceedings of the 1st Workshop on Politics, Elections and Data - PLEAD 2012, pp. 3–10 (2012)
Garcia, D., Strohmaier, M.: The qwerty effect on the web: how typing shapes the meaning of words in online human-computer interaction. In: Proceedings of the 25th International Conference on World Wide Web, pp. 661–670 (2016)
Greenwald, A.G., Banaji, M.R.: Implicit social cognition: attitudes, self-esteem, and stereotypes. Psychol. Rev. 102(1), 4 (1995)
Hannák, A., Wagner, C., Garcia, D., Mislove, A., Strohmaier, M., Wilson, C.: Bias in online freelance marketplaces: evidence from taskrabbit and fiverr. In: CSCW, pp. 1914–1933 (2017)
Huang, T.K., Ribeiro, B., Madhyastha, H.V., Faloutsos, M.: The socio-monetary incentives of online social network malware campaigns. In: Proceedings of the Second ACM Conference on Online Social Networks, pp. 259–270 (2014)
Karimi, F., Wagner, C., Lemmerich, F., Jadidi, M., Strohmaier, M.: Inferring gender from names on the web: a comparative evaluation of gender detection methods. In: Proceedings of the 25th International Conference Companion on World Wide Web, pp. 53–54 (2016)
Karnstedt, M., Hennessy, T., Chan, J., Hayes, C.: Churn in social networks: a discussion boards case study. In: 2010 IEEE Second International Conference on Social Computing (SocialCom), pp. 233–240. IEEE (2010)
Kooti, F., Magno, G., Weber, I.: The social name-letter effect on online social networks. In: International Conference on Social Informatics, pp. 216–227 (2014)
Lakkaraju, H., McAuley, J.J., Leskovec, J.: What’s in a name? understanding the interplay between titles, content, and communities in social media. ICWSM 1(2), 3 (2013)
Magno, G., Weber, I.: International gender differences and gaps in online social networks. In: International Conference on Social Informatics, pp. 121–138 (2014)
McAuley, J., Leskovec, J.: Hidden factors and hidden topics: understanding rating dimensions with review text. In: Proceedings of the 7th ACM Conference on Recommender Systems, RecSys 2013, pp. 165–172. ACM (2013)
Mieghem, P.V.: Human psychology of common appraisal: the reddit score. IEEE Trans. Multimed. 13(6), 1404–1406 (2011)
Newman, M.E.: Power laws, pareto distributions and zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)
Nilizadeh, S., Groggel, A., Lista, P., Das, S., Ahn, Y.Y., Kapadia, A., Rojas, F.: Twitter’s glass ceiling: the effect of perceived gender on online visibility. In: ICWSM, pp. 289–298 (2016)
Oksanen, A., Garcia, D., Sirola, A., Nsi, M., Kaakinen, M., Keipi, T., Rsnen, P.: Pro-anorexia and anti-pro-anorexia videos on YouTube: sentiment analysis of user responses. J. Med. Internet Res. 11(17), e2560 (2015)
Palmer, C.: Risk perception: another look at the ‘white male’ effect. Health, Risk Soc. 5(1), 71–83 (2003)
Ribeiro, B.: Modeling and predicting the growth and death of membership-based websites. In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014, pp. 653–664. ACM (2014)
Simmel, G.: Fashion. Am. J. Sociol. 62(6), 541–558 (1957)
Stommel, S., Garcia, D., Abisheva, A., Schweitzer, F.: Anticipated shocks in online activity: response functions of attention and word-of-mouth processes. In: Proceedings of the 8th ACM Conference on Web Science, pp. 274–275 (2016)
Szabo, G., Huberman, B.A.: Predicting the popularity of online content. Commun. ACM 53(8), 80 (2010)
Szell, M., Thurner, S.: Measuring social dynamics in a massive multiplayer online game. Soc. Netw. 32(4), 313–329 (2010)
Tassi, P.: Facebook Didn’t Kill Digg, Reddit Did. In: Forbes (2012). http://bit.ly/2tx8e5C
Tufekci, Z.: Big questions for social media big data: representativeness, validity and other methodological pitfalls. In: ICWSM (2014)
Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424 (2002)
Walker, J., Ante, S.E.: Once a social media star, Digg sells for $500,000. The Wall Street J. (2012). http://on.wsj.com/2uv1AAS
Yasseri, T., Sumi, R., Rung, A., Kornai, A., Kertsz, J.: Dynamics of conflicts in Wikipedia. PLOS ONE 7(6), 1–12 (2012)
Zhou, E., Cao, Z., Yin, Q.: Naive-deep face recognition: touching the limit of LFW benchmark or not? arXiv preprint arXiv:1501.04690 (2015)
Acknowledgements
This research was funded by the Swiss NSF (Grant number: CR21I1_146499)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
As a preliminary step to fitting models and testing hypotheses, we survey descriptive statistics to guide the models explained in the previous section. The distributions of views, likes, and dislikes per video are shown on Fig. 3. The histogram of the left panel confirms our observations over the mean and median values of Table 1: all variables are right skewed. This skewness presents heavy right tails that, when eyeballing the plots, suggest the possibility that views, likes, and dislikes follow power-law distributions. Nevertheless, this possibility seems less plausible on the Complementary Cumulative Density Function (CCDF) shown on the right panel of Fig. 3, where the right tails decay faster than it would be expected for a power-law.
To have a better idea on whether the distributions of amount of views, likes, and dislikes might have scaling properties or diverging moments [25], we applied the method explained in [3] to verify that they do not follow a power-law distribution. We fitted power-law and log-normal distributions to the empirical data, comparing the fits in a log-likelihood ratio test. The results lend very strong evidence favoring the log-normal distribution over the power-law in all three cases: views (\(LLR=449.97, p<0.01\)), likes (\(LLR=275.7, p<0.01\)), and dislikes (\(LLR=159.99, p<0.01\)). This is an example of how informal statistics can be misleading in deciding whether distributions follow a power-law [3, 8], suggesting that we should assume the distributions as log-normally distributed instead.
To ensure that we analyze the evaluative tendencies of videos and not their intrinsic correlation with video popularity, we divide likes and dislikes by the amount of views in the variables \(L_R=likes/views\) and \(D_R=dislikes/views\). These two variables and the amount of views are all roughly log-normally distributed, as it can be appreciated on the histograms of log-transformed values shown on the upper panels of Fig. 4. Some minor skewness can be attributed to integer approximations and boundary values. To cope with these possible deviations from normality in our models, we perform regression diagnostics to model fits to check that residuals are approximately normally distributed.
Figure 4D shows the distribution of the logarithm of the time between videos of the same user \(log(\Delta t)\). A clear bimodality is present, but it disappears when normalizing over the average time between videos of each user \( \langle t \rangle \). Figure 4E shows the distribution of \(log(\Delta t/\langle t \rangle )\), where no bimodality can be observed. This points to the source of bimodality being a variable at the user level, i.e. the activity rate of each user, as the distribution of time intervals collapses to a unimodal distribution after normalization. In our mixed effects regression models of P(I), we include random effects in the form of an intercept for each user that correct for this pattern, ensuring that our results are not a confound with idiographic properties of the users.
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Garcia, D., Abisheva, A., Schweitzer, F. (2017). Evaluative Patterns and Incentives in YouTube. In: Ciampaglia, G., Mashhadi, A., Yasseri, T. (eds) Social Informatics. SocInfo 2017. Lecture Notes in Computer Science(), vol 10540. Springer, Cham. https://doi.org/10.1007/978-3-319-67256-4_24
Download citation
DOI: https://doi.org/10.1007/978-3-319-67256-4_24
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67255-7
Online ISBN: 978-3-319-67256-4
eBook Packages: Computer ScienceComputer Science (R0)