skip to main content
10.1145/2786451.2786484acmconferencesArticle/Chapter ViewAbstractPublication PageswebsciConference Proceedingsconference-collections
short-paper

Taming a Menagerie of Heavy Tails with Skew Path Analysis

Published: 28 June 2015 Publication History

Abstract

The discovery of stable, heavy-tailed distributions of activity on the web has inspired many researchers to search for simple mechanisms that can cut through the complexity of countless social interactions to yield powerful new theories about human behavior. A dominant mode of investigation involves fitting a mathematical model to an observed distribution, and then inferring the behaviors that generate the modeled distribution. Yet, distributions of activity are not always stable, and the process of fitting a mathematical model to empirical distributions can be highly uncertain, especially for smaller and highly variable datasets.
In this paper, we introduce an approach called skew-path analysis, which measures how concentrated information production is along different dimensions in community-generated data. The approach scales from small to large datasets, and is suitable for investigating the dynamics of online behavior. We offer a preliminary demonstration of the approach by using it to analyze six years of data from an online health community, and show that the technique offers interesting insights into the dynamics of information production. In particular, we find evidence for two distinct point attractors within a subset of the forums analyzed, demonstrating the utility of the approach.

References

[1]
Holly Arrow, Joseph E. McGrath, and Jennifer L. Berdahl. 2000. Small Groups as Complex Systems: Formation, Coordination, Development, and Adaptation. SAGE.
[2]
Albert-László Barabási. 2005. The origin of bursts and heavy tails in human dynamics. Nature 435, 7039, 207--211. http://doi.org/10.1038/nature03459
[3]
Albert-László Barabási and Réka Albert. 1999. Emergence of Scaling in Random Networks. Science 286, 5439, 509--512. http://doi.org/10.1126/science.286.5439.509
[4]
Maurice C. Bryson. 1974. Heavy-Tailed Distributions: Properties and Tests. Technometrics 16, 1, 61--68. http://doi.org/10.1080/00401706.1974.10489150
[5]
Brian S. Butler. 2001. Membership Size, Communication Activity, and Sustainability: A Resource-Based Model of Online Social Structures. Information Systems Research 12, 4, 346--362. http://doi.org/10.1287/isre.12.4.346.9703
[6]
A. Clauset, C. Shalizi, and M. Newman. 2009. Power-Law Distributions in Empirical Data. SIAM Review 51, 4, 661--703. http://doi.org/10.1137/070710111
[7]
Mark E. Crovella, Murad S. Taqqu, and Azer Bestavros. 1998. A Practical Guide to Heavy Tails. In Robert J. Adler, Raisa E. Feldman and Murad S. Taqqu (eds.). Birkhauser Boston Inc., Cambridge, MA, USA, 3--25. Retrieved March 18, 2015 from http://dl.acm.org/citation.cfm?id=292595.292596
[8]
Connie J. G. Gersick. 1991. Revolutionary Change Theories: A Multilevel Exploration of the Punctuated Equilibrium Paradigm. Academy of Management Review 16, 1, 10--36. http://doi.org/10.5465/AMR.1991.4278988
[9]
Lei Guo, Enhua Tan, Songqing Chen, Xiaodong Zhang, and Yihong (Eric) Zhao. 2009. Analyzing Patterns of User Content Generation in Online Social Networks. Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM, 369--378. http://doi.org/10.1145/1557019.1557064
[10]
Bernardo A. Huberman and Lada A. Adamic. 1999. Internet: Growth dynamics of the World-Wide Web. Nature 401, 6749, 131--131. http://doi.org/10.1038/43604
[11]
Jina Huh. forthcoming. Clinical Questions in Online Health Communities: The Case of "See your doctor" Threads. Proceeding of the 2015 conference on Computer Supported Cooperative Work, ACM Press.
[12]
Lou Jost. 2006. Entropy and diversity. Oikos 113, 2, 363--375. http://doi.org/10.1111/j.2006.0030-1299.14714.x
[13]
Andrey Kan, Jeffrey Chan, Conor Hayes, Bernie Hogan, James Bailey, and Christopher Leckie. 2013. A time decoupling approach for studying forum dynamics. World Wide Web 16, 5-6, 595--620. http://doi.org/10.1007/s11280-012-0169-1
[14]
Thomas S. Kuhn. 2012. The Structure of Scientific Revolutions: 50th Anniversary Edition. University of Chicago Press.
[15]
R. Dean Malmgren, Daniel B. Stouffer, Adilson E. Motter, and Luís A. N. Amaral. 2008. A Poissonian explanation for heavy tails in e-mail communication. Proceedings of the National Academy of Sciences 105, 47, 18153--18158. http://doi.org/10.1073/pnas.0800332105
[16]
Diane Maloney-Krichmar and Jenny Preece. 2005. A Multilevel Analysis of Sociability, Usability, and Community Dynamics in an Online Health Community. ACM Trans. Comput.-Hum. Interact. 12, 2, 201--232. http://doi.org/10.1145/1067860.1067864
[17]
Daphne Ruth Raban and Eyal Rabin. 2007. Statistical Inference from Power Law Distributed Web-Based Social Interactions. Social Science Research Network, Rochester, NY. Retrieved March 18, 2015 from http://papers.ssrn.com/abstract=999286
[18]
Catherine Ridings and Molly McLure Wasko. 2010. Online discussion group sustainability: Investigating the interplay between structural dynamics and social dynamics over time. Journal of the Association for Information Systems 11, 2. Retrieved from http://aisel.aisnet.org/jais/vol11/iss2/1
[19]
Peter J. Rousseeuw. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53--65. http://doi.org/10.1016/0377-0427(87)90125-7
[20]
Hanna Tuomisto. 2010. A consistent terminology for quantifying species diversity? Yes, it does exist. Oecologia 164, 4, 853--860. http://doi.org/10.1007/s00442-010-1812-0
[21]
F.B. Viegas and M. Smith. 2004. Newsgroup Crowds and AuthorLines: visualizing the activity of individuals in conversational cyberspaces. Proceedings of the 37th Annual Hawaii International Conference on System Sciences, 2004, 10 pp.--. http://doi.org/10.1109/HICSS.2004.1265288
[22]
Xiaohan Zhao, Alessandra Sala, Christo Wilson, et al. 2012. Multi-scale Dynamics in a Massive Online Social Network. Proceedings of the 2012 ACM Conference on Internet Measurement Conference, ACM, 171--184. http://doi.org/10.1145/2398776.2398795
[23]
Alexa - Top Sites by Category: Health. Retrieved January 24, 2015 from http://www.alexa.com/topsites/category/Top/Health

Cited By

View all
  • (2018)How People Weave Online Information Into PseudoknowledgeSocial Media + Society10.1177/20563051187856394:3Online publication date: 19-Jul-2018
  • (2017)The Collaborative Construction and Evolution of Pseudo-knowledge in Online ConversationsProceedings of the 8th International Conference on Social Media & Society10.1145/3097286.3097297(1-10)Online publication date: 28-Jul-2017

Index Terms

  1. Taming a Menagerie of Heavy Tails with Skew Path Analysis

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WebSci '15: Proceedings of the ACM Web Science Conference
      June 2015
      366 pages
      ISBN:9781450336727
      DOI:10.1145/2786451
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 28 June 2015

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. Diversity
      2. Dynamics
      3. Power Law
      4. Social Media

      Qualifiers

      • Short-paper
      • Research
      • Refereed limited

      Conference

      WebSci '15
      Sponsor:
      WebSci '15: ACM Web Science Conference
      June 28 - July 1, 2015
      Oxford, United Kingdom

      Acceptance Rates

      Overall Acceptance Rate 245 of 933 submissions, 26%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)3
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 23 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)How People Weave Online Information Into PseudoknowledgeSocial Media + Society10.1177/20563051187856394:3Online publication date: 19-Jul-2018
      • (2017)The Collaborative Construction and Evolution of Pseudo-knowledge in Online ConversationsProceedings of the 8th International Conference on Social Media & Society10.1145/3097286.3097297(1-10)Online publication date: 28-Jul-2017

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media