skip to main content
10.1145/3308558.3313729acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Generalists and Specialists: Using Community Embeddings to Quantify Activity Diversity in Online Platforms

Published:13 May 2019Publication History

ABSTRACT

In many online platforms, people must choose how broadly to allocate their energy. Should one concentrate on a narrow area of focus, and become a specialist, or apply oneself more broadly, and become a generalist? In this work, we propose a principled measure of how generalist or specialist a user is, and study behavior in online platforms through this lens. To do this, we construct highly accurate community embeddings that represent communities in a high-dimensional space. We develop sets of community analogies and use them to optimize our embeddings so that they encode community relationships extremely well. Based on these embeddings, we introduce a natural measure of activity diversity, the GS-score.

Applying our embedding-based measure to online platforms, we observe a broad spectrum of user activity styles, from extreme specialists to extreme generalists, in both community membership on Reddit and programming contributions on GitHub. We find that activity diversity is related to many important phenomena of user behavior. For example, specialists are much more likely to stay in communities they contribute to, but generalists are much more likely to remain on platforms as a whole. We also find that generalists engage with significantly more diverse sets of users than specialists do. Furthermore, our methodology leads to a simple algorithm for community recommendation, matching state-of-the-art methods like collaborative filtering. Our methods and results introduce an important new dimension of online user behavior and shed light on many aspects of online platform use.

References

  1. Lada A Adamic, Xiao Wei, Jiang Yang, Sean Gerrish, Kevin K Nam, and Gavin S Clarkson. 2010. Individual focus and knowledge contribution. arXiv preprint arXiv:1002.0561(2010).Google ScholarGoogle Scholar
  2. Katharine A Anderson. 2017. Skill networks and measures of complex human capital. Proceedings of the National Academy of Sciences (PNAS) (2017).Google ScholarGoogle ScholarCross RefCross Ref
  3. Jason Baumgartner. 2017. pushshift.io Reddit archive. https://pushshift.io/. (2017). Accessed: 2018-07-23.Google ScholarGoogle Scholar
  4. Isaiah Berlin. 1953. The hedgehog and the fox. Weidenfeld & Nicolson.Google ScholarGoogle Scholar
  5. Hendrik Bode, Frederick Mosteller, John W Tukey, and Charles Winsor. 1949. The education of a scientific generalist. Science (1949).Google ScholarGoogle Scholar
  6. Elisabeth Bublitz and Florian Noseleit. 2014. The skill balancing act: when does broad expertise pay off?Small Business Economics42, 1 (2014).Google ScholarGoogle Scholar
  7. Eunjoon Cho, Seth A Myers, and Jure Leskovec. 2011. Friendship and mobility: user movement in location-based social networks. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Nicholas B Davies, John R Krebs, and Stuart A West. 2012. An introduction to behavioural ecology. John Wiley & Sons.Google ScholarGoogle Scholar
  9. Will Hamilton, Zhitao Ying, and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems (NIPS). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. William L Hamilton, Justine Zhang, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, and Jure Leskovec. 2017. Loyalty in online communities. In Proceedings of the International AAAI Conference on Weblogs and Social Media (ICWSM).Google ScholarGoogle ScholarCross RefCross Ref
  11. Srijan Kumar, William L Hamilton, Jure Leskovec, and Dan Jurafsky. 2018. Community interaction and conflict on the web. In Proceedings of the 2018 World Wide Web Conference (WWW). Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Erin Leahey. 2007. Not by productivity alone: How visibility and specialization contribute to academic earnings. American Sociological Review72, 4 (2007), 533-561.Google ScholarGoogle Scholar
  13. Omer Levy and Yoav Goldberg. 2014. Dependency-based word embeddings. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers).Google ScholarGoogle ScholarCross RefCross Ref
  14. Omer Levy and Yoav Goldberg. 2014. Neural word embedding as implicit matrix factorization. In Advances in Neural Information Processing Systems (NIPS). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Trevor Martin. 2017. community2vec: Vector representations of online communities encode semantic relationships. In Proceedings of the Second Workshop on NLP and Computational Social Science.Google ScholarGoogle ScholarCross RefCross Ref
  16. Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781(2013).Google ScholarGoogle Scholar
  17. Tomas Mikolov, Ilya Sutskever, Kai Chen, Greg S Corrado, and Jeff Dean. 2013. Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (NIPS). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Scott Page. 2007. The Difference: How the Power of Diversity Creates Better Groups, Firms, Schools, and Societies.Princeton University Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. 2014. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Steffen Rendle, Christoph Freudenthaler, Zeno Gantner, and Lars Schmidt-Thieme. 2009. BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence (UAI). Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Kurt C Stange. 2009. The generalist approach. The Annals of Family Medicine7, 3 (2009).Google ScholarGoogle Scholar
  22. Rosemary Stevens. 2017. Medical practice in modern England: the impact of specialization and state medicine. Routledge.Google ScholarGoogle Scholar
  23. Chenhao Tan and Lillian Lee. 2015. All who wander: On the prevalence and characteristics of multi-community engagement. In Proceedings of the 24th International Conference on World Wide Web (WWW). Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Jian Tang, Meng Qu, Mingzhe Wang, Ming Zhang, Jun Yan, and Qiaozhu Mei. 2015. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web (WWW). Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Philip E Tetlock. 2005. Expert political judgment: How good is it? How can we know?Princeton University Press.Google ScholarGoogle Scholar
  26. Justine Zhang, William L Hamilton, Cristian Danescu-Niculescu-Mizil, Dan Jurafsky, and Jure Leskovec. 2017. Community identity and user engagement in a multi-community landscape. In Proceedings of the International AAAI Conference on Weblogs and Social Media (ICWSM).Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    WWW '19: The World Wide Web Conference
    May 2019
    3620 pages
    ISBN:9781450366748
    DOI:10.1145/3308558

    Copyright © 2019 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 13 May 2019

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate1,899of8,196submissions,23%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format