Skip to main content

Evaluative Patterns and Incentives in YouTube

  • Conference paper
  • First Online:
Social Informatics (SocInfo 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10540))

Included in the following conference series:

Abstract

Users of social media are not only producers and consumers of online content, they also evaluate each other’s content. Some social media include the possibility to down vote or dislike the content posted by other users, posing the risk that users who receive dislikes might be more likely to become inactive, especially if the disliked content is about a person. We analyzed the data on more than 150,000 YouTube videos to understand how video impact and user incentives can be related to the possibility to dislike user content. We processed images related to videos to identify faces and quantify if evaluating content related to people is connected to disliking patterns. We found that videos with faces on their images tend to have less dislikes if they are posted by male users, but the effect is not present for female users. On the contrary, videos with faces and posted by female users attract more views and likes. Analyzing the probability of users to become inactive, we find that receiving dislikes is associated with users becoming inactive. This pattern is stronger when dislikes are given to videos with faces, showing that negative evaluations about people have a stronger association with user inactivity. Our results show that user evaluations in social media are a multi-faceted phenomenon that requires large-scale quantitative analyses, identifying under which conditions users disencourage other users from being active in social media.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Notes

  1. 1.

    https://www.faceplusplus.com/.

  2. 2.

    We replicated the analysis with alternative intervals of one and three months to determine inactivity, and regression models were qualitatively unchanged.

References

  1. Abisheva, A., Garcia, D., Schweitzer, F.: When the filter bubble bursts: collective evaluation dynamics in online communities. In: Proceedings of the 8th ACM Conference on Web Science, pp. 307–308 (2016)

    Google Scholar 

  2. Abisheva, A., Garimella, V.R.K., Garcia, D., Weber, I.: Who watches (and shares) what on YouTube? and when? using Twitter to understand YouTube viewership. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 593–602 (2014)

    Google Scholar 

  3. Alstott, J., Bullmore, E., Plenz, D.: powerlaw: a python package for analysis of heavy-tailed distributions. PLoS ONE 9(1), e85777 (2014)

    Article  Google Scholar 

  4. Aragón, P., Gómez, V., Kaltenbrunner, A.: To thread or not to thread: the impact of conversation threading on online discussion. In: ICWSM, pp. 12–21 (2017)

    Google Scholar 

  5. Bates, D., Mchler, M., Bolker, B., Walker, S.: Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67(1), 1–48 (2015)

    Article  Google Scholar 

  6. Cashmore, P.: Should Facebook add a dislike button? In: CNN articles (2010). http://cnn.it/2t7tu2h

  7. Castells, M.: The Rise of the Network Society, vol. 1. Wiley, Hoboken (1996)

    MATH  Google Scholar 

  8. Clauset, A., Shalizi, C.R., Newman, M.E.J.: Power-law distributions in empirical data. SIAM Rev. 51(4), 661 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  9. Crane, R., Sornette, D.: Robust dynamic classes revealed by measuring the response function of a social system. Proc. Natl. Acad. Sci. 105(41), 15649–15653 (2008)

    Article  Google Scholar 

  10. Del Vicario, M., Bessi, A., Zollo, F., Petroni, F., Scala, A., Caldarelli, G., Stanley, H.E., Quattrociocchi, W.: The spreading of misinformation online. Proc. Natl. Acad. Sci. 113(3), 554–559 (2016)

    Article  Google Scholar 

  11. Garcia, D., Mavrodiev, P., Casati, D., Schweitzer, F.: Understanding popularity, reputation, and social influence in the Twitter society. Policy Internet (2017)

    Google Scholar 

  12. Garcia, D., Mavrodiev, P., Schweitzer, F.: Social resilience in online communities: the autopsy of friendster. In: Proceedings of the 1st ACM Conference in Online Social Networks (COSN 2013), pp. 39–50 (2013)

    Google Scholar 

  13. Garcia, D., Mendez, F., Serdult, U., Schweitzer, F.: Political polarization and popularity in online participatory media: an integrated approach. In: Proceedings of the 1st Workshop on Politics, Elections and Data - PLEAD 2012, pp. 3–10 (2012)

    Google Scholar 

  14. Garcia, D., Strohmaier, M.: The qwerty effect on the web: how typing shapes the meaning of words in online human-computer interaction. In: Proceedings of the 25th International Conference on World Wide Web, pp. 661–670 (2016)

    Google Scholar 

  15. Greenwald, A.G., Banaji, M.R.: Implicit social cognition: attitudes, self-esteem, and stereotypes. Psychol. Rev. 102(1), 4 (1995)

    Article  Google Scholar 

  16. Hannák, A., Wagner, C., Garcia, D., Mislove, A., Strohmaier, M., Wilson, C.: Bias in online freelance marketplaces: evidence from taskrabbit and fiverr. In: CSCW, pp. 1914–1933 (2017)

    Google Scholar 

  17. Huang, T.K., Ribeiro, B., Madhyastha, H.V., Faloutsos, M.: The socio-monetary incentives of online social network malware campaigns. In: Proceedings of the Second ACM Conference on Online Social Networks, pp. 259–270 (2014)

    Google Scholar 

  18. Karimi, F., Wagner, C., Lemmerich, F., Jadidi, M., Strohmaier, M.: Inferring gender from names on the web: a comparative evaluation of gender detection methods. In: Proceedings of the 25th International Conference Companion on World Wide Web, pp. 53–54 (2016)

    Google Scholar 

  19. Karnstedt, M., Hennessy, T., Chan, J., Hayes, C.: Churn in social networks: a discussion boards case study. In: 2010 IEEE Second International Conference on Social Computing (SocialCom), pp. 233–240. IEEE (2010)

    Google Scholar 

  20. Kooti, F., Magno, G., Weber, I.: The social name-letter effect on online social networks. In: International Conference on Social Informatics, pp. 216–227 (2014)

    Google Scholar 

  21. Lakkaraju, H., McAuley, J.J., Leskovec, J.: What’s in a name? understanding the interplay between titles, content, and communities in social media. ICWSM 1(2), 3 (2013)

    Google Scholar 

  22. Magno, G., Weber, I.: International gender differences and gaps in online social networks. In: International Conference on Social Informatics, pp. 121–138 (2014)

    Google Scholar 

  23. McAuley, J., Leskovec, J.: Hidden factors and hidden topics: understanding rating dimensions with review text. In: Proceedings of the 7th ACM Conference on Recommender Systems, RecSys 2013, pp. 165–172. ACM (2013)

    Google Scholar 

  24. Mieghem, P.V.: Human psychology of common appraisal: the reddit score. IEEE Trans. Multimed. 13(6), 1404–1406 (2011)

    Article  Google Scholar 

  25. Newman, M.E.: Power laws, pareto distributions and zipf’s law. Contemp. Phys. 46(5), 323–351 (2005)

    Article  Google Scholar 

  26. Nilizadeh, S., Groggel, A., Lista, P., Das, S., Ahn, Y.Y., Kapadia, A., Rojas, F.: Twitter’s glass ceiling: the effect of perceived gender on online visibility. In: ICWSM, pp. 289–298 (2016)

    Google Scholar 

  27. Oksanen, A., Garcia, D., Sirola, A., Nsi, M., Kaakinen, M., Keipi, T., Rsnen, P.: Pro-anorexia and anti-pro-anorexia videos on YouTube: sentiment analysis of user responses. J. Med. Internet Res. 11(17), e2560 (2015)

    Google Scholar 

  28. Palmer, C.: Risk perception: another look at the ‘white male’ effect. Health, Risk Soc. 5(1), 71–83 (2003)

    Article  Google Scholar 

  29. Ribeiro, B.: Modeling and predicting the growth and death of membership-based websites. In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014, pp. 653–664. ACM (2014)

    Google Scholar 

  30. Simmel, G.: Fashion. Am. J. Sociol. 62(6), 541–558 (1957)

    Article  Google Scholar 

  31. Stommel, S., Garcia, D., Abisheva, A., Schweitzer, F.: Anticipated shocks in online activity: response functions of attention and word-of-mouth processes. In: Proceedings of the 8th ACM Conference on Web Science, pp. 274–275 (2016)

    Google Scholar 

  32. Szabo, G., Huberman, B.A.: Predicting the popularity of online content. Commun. ACM 53(8), 80 (2010)

    Article  Google Scholar 

  33. Szell, M., Thurner, S.: Measuring social dynamics in a massive multiplayer online game. Soc. Netw. 32(4), 313–329 (2010)

    Article  Google Scholar 

  34. Tassi, P.: Facebook Didn’t Kill Digg, Reddit Did. In: Forbes (2012). http://bit.ly/2tx8e5C

  35. Tufekci, Z.: Big questions for social media big data: representativeness, validity and other methodological pitfalls. In: ICWSM (2014)

    Google Scholar 

  36. Turney, P.D.: Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 417–424 (2002)

    Google Scholar 

  37. Walker, J., Ante, S.E.: Once a social media star, Digg sells for $500,000. The Wall Street J. (2012). http://on.wsj.com/2uv1AAS

  38. Yasseri, T., Sumi, R., Rung, A., Kornai, A., Kertsz, J.: Dynamics of conflicts in Wikipedia. PLOS ONE 7(6), 1–12 (2012)

    Article  Google Scholar 

  39. Zhou, E., Cao, Z., Yin, Q.: Naive-deep face recognition: touching the limit of LFW benchmark or not? arXiv preprint arXiv:1501.04690 (2015)

Download references

Acknowledgements

This research was funded by the Swiss NSF (Grant number: CR21I1_146499)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Garcia .

Editor information

Editors and Affiliations

Appendix

Appendix

As a preliminary step to fitting models and testing hypotheses, we survey descriptive statistics to guide the models explained in the previous section. The distributions of views, likes, and dislikes per video are shown on Fig. 3. The histogram of the left panel confirms our observations over the mean and median values of Table 1: all variables are right skewed. This skewness presents heavy right tails that, when eyeballing the plots, suggest the possibility that views, likes, and dislikes follow power-law distributions. Nevertheless, this possibility seems less plausible on the Complementary Cumulative Density Function (CCDF) shown on the right panel of Fig. 3, where the right tails decay faster than it would be expected for a power-law.

Fig. 3.
figure 3

Video impact distributions. Left: histogram of amount of views, likes, and dislikes over the videos of the dataset. Right: Complementary Cumulative Density Function (CCDF) of counts of videos with more than a certain amount of views, likes, and dislikes. While the histograms show right-skewness, the CCDF of counts show a decay faster than a power-law.

To have a better idea on whether the distributions of amount of views, likes, and dislikes might have scaling properties or diverging moments [25], we applied the method explained in [3] to verify that they do not follow a power-law distribution. We fitted power-law and log-normal distributions to the empirical data, comparing the fits in a log-likelihood ratio test. The results lend very strong evidence favoring the log-normal distribution over the power-law in all three cases: views (\(LLR=449.97, p<0.01\)), likes (\(LLR=275.7, p<0.01\)), and dislikes (\(LLR=159.99, p<0.01\)). This is an example of how informal statistics can be misleading in deciding whether distributions follow a power-law [3, 8], suggesting that we should assume the distributions as log-normally distributed instead.

Fig. 4.
figure 4

Histograms of log-transformed video metrics. The upper panels (A,B,C) show the histogram of log-transformed amount of views, likes ratio (\(L_R\)), and dislikes ratio (\(D_R\)) over the videos of the dataset. Panel D shows the histograms of log-transformed time intervals between videos of the same user (\(\Delta t\)), in seconds. The vertical red line shows the threshold of inactivity of 2 months. Panel E shows the histogram of time intervals normalized over the average time between videos of the user.

To ensure that we analyze the evaluative tendencies of videos and not their intrinsic correlation with video popularity, we divide likes and dislikes by the amount of views in the variables \(L_R=likes/views\) and \(D_R=dislikes/views\). These two variables and the amount of views are all roughly log-normally distributed, as it can be appreciated on the histograms of log-transformed values shown on the upper panels of Fig. 4. Some minor skewness can be attributed to integer approximations and boundary values. To cope with these possible deviations from normality in our models, we perform regression diagnostics to model fits to check that residuals are approximately normally distributed.

Figure 4D shows the distribution of the logarithm of the time between videos of the same user \(log(\Delta t)\). A clear bimodality is present, but it disappears when normalizing over the average time between videos of each user \( \langle t \rangle \). Figure 4E shows the distribution of \(log(\Delta t/\langle t \rangle )\), where no bimodality can be observed. This points to the source of bimodality being a variable at the user level, i.e. the activity rate of each user, as the distribution of time intervals collapses to a unimodal distribution after normalization. In our mixed effects regression models of P(I), we include random effects in the form of an intercept for each user that correct for this pattern, ensuring that our results are not a confound with idiographic properties of the users.

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Garcia, D., Abisheva, A., Schweitzer, F. (2017). Evaluative Patterns and Incentives in YouTube. In: Ciampaglia, G., Mashhadi, A., Yasseri, T. (eds) Social Informatics. SocInfo 2017. Lecture Notes in Computer Science(), vol 10540. Springer, Cham. https://doi.org/10.1007/978-3-319-67256-4_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-67256-4_24

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-67255-7

  • Online ISBN: 978-3-319-67256-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics