skip to main content
10.1145/1830252.1830253acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Time-based sampling of social network activity graphs

Published:24 July 2010Publication History

ABSTRACT

While most research in online social networks (OSNs) in the past has focused on static friendship networks, social network activity graphs are quite important as well. However, characterizing social network activity graphs is computationally intensive; reducing the size of these graphs using sampling algorithms is critical. There are two important requirements---the sampling algorithm must be able to preserve core graph characteristics and be amenable to a streaming implementation since activity graphs are naturally evolving in a streaming fashion. Existing approaches satisfy either one or the other requirement, but not both. In this paper, we propose a novel sampling algorithm called Streaming Time Node Sampling (STNS) that exploits temporal clustering often found in real social networks. Using real communication data collected from Facebook and Twitter, we show that STNS significantly out-performs state-of-the-art sampling mechanisms such as node sampling and Forest Fire sampling, across both averages and distributions of several graph properties.

References

  1. Facebook. http://www.facebook.com/.Google ScholarGoogle Scholar
  2. Myspace. http://www.myspace.com/.Google ScholarGoogle Scholar
  3. Twitter. http://www.twitter.com/.Google ScholarGoogle Scholar
  4. D. Achlioptas, A. Clauset, D. Kempe, and C. Moore. On the bias of traceroute sampling: or, power-law degree distributions in regular graphs. In ACM STOC, pages 694--703, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Ahn, S. Han, H. Kwak, S. Moon, and H. Jeong. Analysis of topological characteristics of huge online social networking services. In WWW, pages 835--844, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. H. Chun, H. Kwak, Y. Eom, Y. Ahn, S. Moon, and H. Jeong. Comparison of online social relations in volume vs interaction: a case study of cyworld. In ACM/USENIX IMC, pages 57--70, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Dall 'Asta, I. Alvarez-Hamelin, A. Barrat, A. Vázquez, and A. Vespignani. Exploring networks with traceroute-like probes: Theory and simulations. Theoretical Computer Science, 355(1):6--24, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Datta and H. Kargupta. Uniform data sampling from a peer-to-peer network. In Proceedings of ICDCS'02, page 50, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. H. Eldardiry and J. Neville. A resampling technique for relational data graphs. In SNA-KDD'08: Proceedings of the second workshop on Social Network Mining and Analysis, 2008.Google ScholarGoogle Scholar
  10. Facebook. Chat reaches 1 billion messages sent per day. http://www.facebook.com/note.php?note_id=91351698919, 2009.Google ScholarGoogle Scholar
  11. C. Gkantsidis, M. Mihail, and A. Saberi. Random walks in peer-to-peer networks. In IEEE INFOCOM, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Hubler, H.-P. Kriegel, K. M. Borgwardt, and Z. Ghahramani. Metropolis algorithms for representative subgraph sampling. In ICDM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. I. Kahanda and J. Neville. Using transactional information to predict link strength in online social networks. In AAAI Conference on Weblogs and Social Media, 2009.Google ScholarGoogle Scholar
  14. J. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604--632, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. V. Krishnamurthy, M. Faloutsos, M. Chrobak, J. Cui, L. Lao, and A. Percus. Sampling large Internet topologies for simulation purposes. Computer Networks, 51(15):4284--4302, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Kumar, J. Novak, and A. Tomkins. Structure and evolution of online social networks. In SIGKDD, pages 611--617, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. S. Lee, P. Kim, and H. Jeong. Statistical properties of sampled networks. Physical Review E, 73:016102, 2006.Google ScholarGoogle ScholarCross RefCross Ref
  18. J. Leskovec, L. Backstrom, R. Kumar, and A. Tomkins. Microscopic evolution of social networks. In SIGKDD, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Leskovec and C. Faloutsos. Sampling from large graphs. In SIGKDD, pages 631--636, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Leskovec and E. Horvitz. Worldwide Buzz: Planetary-Scale Views on an Instant-Messaging Network. In WWW, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Leskovec, J. M. Kleinberg, and C. Faloutsos. Graphs over time: densification laws, shrinking diameters and possible explanations. In SIGKDD, pages 177--187, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel, and B. Bhattacharjee. Measurement and analysis of online social networks. In ACM/USENIX IMC, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. 1998.Google ScholarGoogle Scholar
  24. M. Stumpf, C. Wiuf, and R. May. Subnets of scale-free networks are not scale-free: Sampling properties of networks. Proceedings of the National Academy of Sciences, 102(12):4221--4224, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  25. D. Stutzbach, R. Rejaie, N. Duffield, S. Sen, and W. Willinger. On unbiased sampling for unstructured peer-to-peer networks. In IMC, pages 27--40, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. B. Viswanath, A. Mislove, M. Cha, and K. P. Gummadi. On the evolution of user interaction in facebook. In WOSN, August 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. C. Wilson, B. Boe, A. Sala, K. P. Puttaswamy, and B. Y. Zhao. User interactions in social networks and their implications. In EuroSys, pages 205--218, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. S. Yoon, S. Lee, S.-H. Yook, and Y. Kim. Statistical properties of sampled networks by random walks. Phys. Rev. E, 75(4):046114, Apr 2007.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Time-based sampling of social network activity graphs

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MLG '10: Proceedings of the Eighth Workshop on Mining and Learning with Graphs
          July 2010
          185 pages
          ISBN:9781450302142
          DOI:10.1145/1830252

          Copyright © 2010 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 July 2010

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader