skip to main content
10.1145/2764947.2764955acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Constructing Bisimulation Summaries on a Multi-Core Graph Processing Framework

Published:31 May 2015Publication History

ABSTRACT

Bisimulation summaries of graph data have multiple applications, including facilitating graph exploration and enabling query optimization techniques, but efficient, scalable, summary construction is challenging. The literature describes parallel construction algorithms using message-passing, and these have been recently adapted to MapReduce environments. The fixpoint nature of bisimulation is well suited to iterative graph processing, but the existing MapReduce solutions do not drastically decrease per-iteration times as the computation progresses.

In this paper, we focus on leveraging parallel multi-core graph frameworks with the goal of constructing summaries in roughly the same amount of time that it takes to input the data into the framework (for a range of real world data graphs) and output the summary. To achieve our goal we introduce a singleton optimization that significantly reduces per-iteration times after only a few iterations. We present experimental results validating that our scalable GraphChi implementation achieves our goal with bisimulation summaries of million to billion edge graphs.

References

  1. Big data: The next frontier for innovation, competition, and productivity. McKinsey & Company, May 2011.Google ScholarGoogle Scholar
  2. R. Appuswamy, C. Gkantsidis, D. Narayanan, O. Hodson, and A. Rowstron. Scale-up vs Scale-out for Hadoop: Time to rethink? SOCC, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. Blom and S. Orzan. A distributed algorithm for strong bisimulation reduction of state spaces. Electr. Notes Theor. Comput. Sci., 68(4):523--538, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  4. A. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. Wiener. Graph structure in the web. Comput. Netw., 33(1-6):309--320, June 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. P. Consens, V. Fionda, S. Khatchadourian, and G. Pirrò. Rewriting Queries over Summaries of Big Data Graphs. AMW, 2014.Google ScholarGoogle Scholar
  6. J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. Commun. ACM, 51(1):107--113, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Dovier, C. Piazza, and A. Policriti. An efficient algorithm for computing bisimulation equivalence. Theor. Comput. Sci., 311(1-3):221--256, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. R. Goldman and J. Widom. DataGuides: Enabling query formulation and optimization in semistructured databases. In VLDB, pages 436--445, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. O. Hassanzadeh and M. P. Consens. Linked movie data base. In LDOW, 2009.Google ScholarGoogle Scholar
  10. M. R. Henzinger, T. A. Henzinger, and P. W. Kopke. Computing simulations on finite and infinite graphs. In Foundations of Computer Science, pages 453--462, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. P. C. Kanellakis and S. A. Smolka. CCS expressions, finite state processes, and three problems of equivalence. Information and Computation, 86(1):43--68, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. S. Khatchadourian and M. P. Consens. ExpLOD: Summary-Based Exploration of Interlinking and RDF Usage in the Linked Open Data Cloud. In ESWC, pages 272--287, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Kumar, J. Gluck, A. Deshpande, and J. Lin. Hone: "Scaling Down" Hadoop on Shared-Memory Systems. PVLDB, 6(12):1354--1357, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. H. Kwak, C. Lee, H. Park, and S. Moon. What is Twitter, a social network or a news media? In WWW, pages 591--600. ACM, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. A. Kyrola, G. Blelloch, and C. Guestrin. GraphChi: Large-scale graph computation on just a PC. In OSDI, pages 31--46, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y. Luo, G. H. L. Fletcher, J. Hidders, P. De Bra, and Y. Wu. Regularities and dynamics in bisimulation reductions of big graphs. GRADES, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Luo, Y. Lange, G. H. Fletcher, P. Bra, J. Hidders, and Y. Wu. Bisimulation Reduction of Big Graphs on MapReduce. In BNCOD, volume 7968, pages 189--203. 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. A. Mislove, M. Marcon, K. P. Gummadi, P. Druschel, and B. Bhattacharjee. Measurement and analysis of online social networks. In SIGCOMM Conference on Internet Measurement, pages 29--42, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. Paige and R. E. Tarjan. Three partition refinement algorithms. SIAM Journal on Computing, 16(6):973--989, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. M. Rudolf, M. Paradies, C. Bornhövd, and W. Lehner. Synopsys: large graph analytics in the SAP HANA database through summarization. In GRADES, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. D. Sangiorgi. On the origins of bisimulation and coinduction. Trans. Program. Lang. Syst., 31(4), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. A. Schätzle, A. Neu, G. Lausen, and M. Przyjaciel-Zablocki. Large-scale bisimulation of RDF graphs. SWIM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. L. G. Valiant. A bridging model for parallel computation. Commun. ACM, 33(8):103--111, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Constructing Bisimulation Summaries on a Multi-Core Graph Processing Framework

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        GRADES'15: Proceedings of the GRADES'15
        May 2015
        54 pages
        ISBN:9781450336116
        DOI:10.1145/2764947

        Copyright © 2015 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 31 May 2015

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited

        Acceptance Rates

        Overall Acceptance Rate29of61submissions,48%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader