skip to main content
10.1145/2661829.2661862acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

Distributed Graph Summarization

Published:03 November 2014Publication History

ABSTRACT

Graph has been a ubiquitous and essential data representation to model real world objects and their relationships. Today, large amounts of graph data have been generated by various applications. Graph summarization techniques are crucial in uncovering useful insights about the patterns hidden in the underlying data. However, all existing works in graph summarization are single-process solutions, and as a result cannot scale to large graphs. In this paper, we introduce three distributed graph summarization algorithms to address this problem. Experimental results show that the proposed algorithms can produce good quality summaries and scale well with increasing data sizes. To the best of our knowledge, this is the first work to study distributed graph summarization methods.

References

  1. D. A. Bader and K. Madduri. Gtgraph: A suite of synthetic graph generators. www.cse.psu.edu/~madduri/software/GTgraph.Google ScholarGoogle Scholar
  2. A. Z. Broder, M. Charikar, A. M. Frieze, and M. Mitzenmacher. Min-wise independent permutations (extended abstract). In STOC '98, pages 327--336. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Chakrabarti, Y. Zhan, and C. Faloutsos. R-mat: A recursive model for graph mining. In SDM'04.Google ScholarGoogle Scholar
  4. P. Indyk and R. Motwani. Approximate nearest neighbors: towards removing the curse of dimensionality. In STOC'98, pages 604--613. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. Lin and C. Dyer. Data-Intensive Text Processing with MapReduce, volume 3. 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. X. Liu, Q. He, Y. Tian, W. Lee, J. McPherson, and J. Han. Event-based social networks: Linking the online and offline social worlds. In SIGKDD'12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. Low, D. Bickson, J. Gonzalez, C. Guestrin, A. Kyrola, and J. M. Hellerstein. Distributed GraphLab: a framework for machine learning and data mining in the cloud. PVLDB, 5(8):716--727, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. Malewicz, M. H. Austern, A. J. Bik, J. C. Dehnert, I. Horn, N. Leiser, and G. Czajkowski. Pregel: a system for large-scale graph processing. In SIGMOD'10, pages 135--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. S. Navlakha, R. Rastogi, and N. Shrivastava. Graph summarization with bounded error. In SIGMOD'08, pages 567--580. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Rissanen. Modeling by shortest data description. Automatica, 14:465--471, 1978. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. B. Shao, H. Wang, and Y. Li. Trinity: A Distributed Graph Engine on a Memory Cloud. In SIGMOD'13, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Y. Tian, R. Hankins, and J. M. Patel. Efficient aggregation for graph summarization. In SIGMOD'08, pages 419--432. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. N. Zhang, Y. Tian, and J. M. Patel. Discovery-driven graph summarization. In ICDE'10, pages 880--891.Google ScholarGoogle Scholar

Index Terms

  1. Distributed Graph Summarization

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '14: Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management
          November 2014
          2152 pages
          ISBN:9781450325981
          DOI:10.1145/2661829

          Copyright © 2014 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 3 November 2014

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          CIKM '14 Paper Acceptance Rate175of838submissions,21%Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader