skip to main content
10.1145/3295500.3356182acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article

Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics

Published:17 November 2019Publication History

ABSTRACT

We propose Slim Graph: the first programming model and framework for practical lossy graph compression that facilitates high-performance approximate graph processing, storage, and analytics. Slim Graph enables the developer to express numerous compression schemes using small and programmable compression kernels that can access and modify local parts of input graphs. Such kernels are executed in parallel by the underlying engine, isolating developers from complexities of parallel programming. Our kernels implement novel graph compression schemes that preserve numerous graph properties, for example connected components, minimum spanning trees, or graph spectra. Finally, Slim Graph uses statistical divergences and other metrics to analyze the accuracy of lossy graph compression. We illustrate both theoretically and empirically that Slim Graph accelerates numerous graph algorithms, reduces storage used by graph datasets, and ensures high accuracy of results. Slim Graph may become the common ground for developing, executing, and analyzing emerging lossy graph compression schemes.

References

  1. [n.d.]. Hyperlink Graph 2012. http://webdatacommons.org/hyperlinkgraph/2012-08/download.html.Google ScholarGoogle Scholar
  2. Kook Jin Ahn, Sudipto Guha, and Andrew McGregor. 2012. Analyzing graph structure via linear measurements. In Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms. SIAM, 459--467.Google ScholarGoogle ScholarCross RefCross Ref
  3. Kook Jin Ahn, Sudipto Guha, and Andrew McGregor. 2012. Graph sketches: sparsification, spanners, and subgraphs. In Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI symposium on Principles of Database Systems. ACM, 5--14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Stephen Alstrup, Søren Dahlgaard, Arnold Filtser, Morten Stöckel, and Christian Wulff-Nilsen. 2017. Constructing light spanners deterministically in near-linear time. arXiv preprint arXiv.1709.01960 (2017).Google ScholarGoogle Scholar
  5. Ingo Althöfer, Gautam Das, David Dobkin, Deborah Joseph, and José Soares. 1993. On sparse spanners of weighted graphs. Discrete & Computational Geometry 9, 1 (1993), 81--100.Google ScholarGoogle ScholarCross RefCross Ref
  6. David G Anderson, Ming Gu, and Christopher Melgaard. 2014. An efficient algorithm for unweighted spectral graph sparsification. arXiv preprint arXiv:1410A273 (2014).Google ScholarGoogle Scholar
  7. David A Bader, Shiva Kintali, Kamesh Madduri, and Milena Mihail. 2007. Approximating betweenness centrality. In International Workshop on Algorithms and Models for the Web-Graph. Springer, 124--137.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Michèle Basseville. 2010. Divergence measures for statistical data processing. (2010).Google ScholarGoogle Scholar
  9. Surender Baswana, Telikepalli Kavitha, Kurt Mehlhorn, and Seth Pettie. 2010. Additive spanners and (α, β)-spanners. ACM Transactions on Algorithms (TALG) 7, 1 (2010), 5.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Surender Baswana and Sandeep Sen. 2007. A simple and linear time randomized algorithm for computing sparse spanners in weighted graphs. Random Structures & Algorithms 30, 4 (2007), 532--563.Google ScholarGoogle ScholarCross RefCross Ref
  11. Omar Batarfi, Radwa El Shawi, Ayman G Fayoumi, Reza Nouri, Ahmed Barnawi, Sherif Sakr, et al. 2015. Large scale graph processing systems: survey and an experimental evaluation. Cluster Computing 18, 3 (2015), 1189--1213.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Joshua Batson, Daniel A Spielman, Nikhil Srivastava, and Shang-Hua Teng. 2013. Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 8 (2013), 87--94.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Scott Beamer, Krste Asanović, and David Patterson. 2015. The GAP benchmark suite. arXiv preprint arXiv.1508.03619 (2015).Google ScholarGoogle Scholar
  14. Maham Anwar Beg, Muhammad Ahmad, Arif Zaman, and Imdadullah Khan. 2018. Scalable Approximation Algorithm for Graph Summarization. pacific-asia conference on knowledge discovery and data mining (2018), 502--514.Google ScholarGoogle Scholar
  15. András A Benczúr and David R Karger. 1996. Approximating st minimum cuts in O (n 2) time. In Proceedings of the twenty-eighth annual ACM symposium on Theory of computing. ACM, 47--55.Google ScholarGoogle Scholar
  16. Suman K Bera and Amit Chakrabarti. 2017. Towards tighter space bounds for counting triangles and other substructures in graph streams. In 34th Symposium on Theoretical Aspects of Computer Science (STACS 2017). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.Google ScholarGoogle Scholar
  17. Maciej Besta and Torsten Hoefler. 2018. Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations. arXiv preprint arXiv:1806.01799 (2018).Google ScholarGoogle Scholar
  18. Maciej Besta, Michał Podstawski, Linus Groner, Edgar Solomonik, and Torsten Hoefler. 2017. To push or to pull: On reducing communication and synchronization in graph computations. In Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 93--104.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Maciej Besta, Dimitri Stanojevic, Tijana Zivic, Jagpreet Singh, Maurice Hoerold, and Torsten Hoefler. 2018. Log (graph): a near-optimal high-performance graph representation. In Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques. ACM, 7.Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Paolo Boldi, Marco Rosa, and Sebastiano Vigna. 2011. HyperANF: Approximating the neighbourhood function of very large graphs on a budget. In Proceedings of the 20th international conference on World wide web. ACM, 625--634.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Paolo Boldi and Sebastiano Vigna. 2004. The webgraph framework I: compression techniques. In Proceedings of the 13th international conference on World Wide Web. ACM, 595--602.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Michele Borassi and Emanuele Natale. 2016. KADABRA is an adaptive algorithm for betweenness via random approximation. arXiv preprint arXiv:1604.08553 (2016).Google ScholarGoogle Scholar
  23. Stéphane Boucheron, Gábor Lugosi, and Pascal Massart. 2013. Concentration inequalities: A nonasymptotic theory of independence. Oxford university press.Google ScholarGoogle Scholar
  24. Ulrik Brandes. 2001. A faster algorithm for betweenness centrality. Journal of mathematical sociology 25, 2 (2001), 163--177.Google ScholarGoogle ScholarCross RefCross Ref
  25. Nieves R Brisaboa, Susana Ladra, and Gonzalo Navarro. 2009. k2-Trees for Compact Web Graph Representation.. In SPIRE, Vol. 9. Springer, 18--30.Google ScholarGoogle Scholar
  26. Luciana S Buriol, Gereon Frahling, Stefano Leonardi, Alberto Marchetti-Spaccamela, and Christian Sohler. 2006. Counting triangles in data streams. In Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 253--262.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Daniele Calandriello, Ioannis Koutis, Alessandro Lazaric, and Michal Valko. 2018. Improved large-scale graph learning through ridge spectral sparsification. In International Conference on Machine Learning.Google ScholarGoogle Scholar
  28. Stéphane Campinas, Renaud Delbru, and Giovanni Tummarello. 2013. Efficiency and precision trade-offs in graph summary algorithms. In Proceedings of the 17th International Database Engineering and Applications Symposium on. 38--47.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Keren Censor-Hillel and Michal Dory. 2018. Distributed spanner approximation. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing. ACM, 139--148.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Keren Censor-Hillel, Ami Paz, and Noam Ravid. 2018. The Sparsest Additive Spanner via Multiple Weighted BFS Trees. arXiv preprint arXiv:1811.01997 (2018).Google ScholarGoogle Scholar
  31. Sung-Hyuk Cha. 2007. Comprehensive survey on distance/similarity measures between probability density functions. City 1, 2 (2007), 1.Google ScholarGoogle Scholar
  32. Bernard Chazelle, Ronitt Rubinfeld, and Luca Trevisan. 2005. Approximating the minimum spanning tree weight in sublinear time. SIAM Journal on computing 34, 6 (2005), 1370--1379.Google ScholarGoogle Scholar
  33. Shiri Chechik, Daniel H Larkin, Liam Roditty, Grant Schoenebeck, Robert E Tarjan, and Virginia Vassilevska Williams. 2014. Better approximation algorithms for the graph diameter. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 1041--1052.Google ScholarGoogle ScholarCross RefCross Ref
  34. Mostafa Haghir Chehreghani, Albert Bifet, and Talel Abdessalem. 2018. Efficient Exact and Approximate Algorithms for Computing Betweenness Centrality in Directed Graphs. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 752--764.Google ScholarGoogle Scholar
  35. Chen Chen, Cindy Xide Lin, Matt Fredrikson, Mihai Christodorescu, Xifeng Yan, and Jiawei Han. 2009. Mining graph patterns efficiently via randomized summaries. very large data bases 2, 1 (2009), 742--753.Google ScholarGoogle Scholar
  36. Avery Ching, Sergey Edunov, Maja Kabiljo, Dionysios Logothetis, and Sambavi Muthukrishnan. 2015. One trillion edges: Graph processing at facebook-scale. Proceedings of the VLDB Endowment 8, 12 (2015), 1804--1815.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Nicos Christofides. 1976. Worst-case analysis of a new heuristic for the travelling salesman problem. Technical Report. Carnegie-Mellon Univ Pittsburgh Pa Management Sciences Research Group.Google ScholarGoogle Scholar
  38. Timothy Chu, Yu Gao, Richard Peng, Sushant Sachdeva, Saurabh Sawlani, and Junxing Wang. 2018. Graph sparsification, spectral sketches, and faster resistance computation, via short cycle decompositions. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 361--372.Google ScholarGoogle ScholarCross RefCross Ref
  39. Thomas H Cormen, Charles E Leiserson, Ronald L Rivest, and Clifford Stein. 2009. Introduction to algorithms. MIT press.Google ScholarGoogle Scholar
  40. Graham Cormode and Hossein Jowhari. 2017. A second look at counting triangles in graph streams (corrected). Theoretical Computer Science 683 (2017), 22--30.Google ScholarGoogle ScholarCross RefCross Ref
  41. Thomas M Cover and Joy A Thomas. 2012. Elements of information theory. John Wiley & Sons.Google ScholarGoogle Scholar
  42. Ketan Date, Keven Feng, Rakesh Nagi, Jinjun Xiong, Nam Sung Kim, and Wen-Mei Hwu. 2017. Collaborative (cpu+ gpu) algorithms for triangle counting and truss decomposition on the minsky architecture: Static graph challenge: Subgraph isomorphism. In 2017 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 1--7.Google ScholarGoogle ScholarCross RefCross Ref
  43. Etienne de Klerk, Dmitrii V Pasechnik, and Joost P Warners. 2004. On approximate graph colouring and max-k-cut algorithms based on the θ-function. Journal of Combinatorial Optimization 8, 3 (2004), 267--294.Google ScholarGoogle ScholarCross RefCross Ref
  44. Camil Demetrescu, Andrew V Goldberg, and David S Johnson. 2009. The Shortest Path Problem: Ninth DIMLACS Implementation Challenge. Vol. 74. American Math. Soc.Google ScholarGoogle Scholar
  45. Niels Doekemeijer and Ana Lucia Varbanescu. 2014. A survey of parallel graph processing frameworks. Delft University of Technology (2014), 21.Google ScholarGoogle Scholar
  46. Michal Dory. 2018. Distributed Approximation of Minimum k-edge-connected Spanning Subgraphs. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing. ACM, 149--158.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Stefania Dumbrava, Angela Bonifati, Amaia Nazabal Ruiz Diaz, and Romain Vuillemot. 2018. Approximate Evaluation of Label-Constrained Reachability Queries. arXiv preprint arXiv.1811.11561 (2018).Google ScholarGoogle Scholar
  48. Cody Dunne and Ben Shneiderman. 2013. Motif simplification: improving network visualization readability with fan, connector, and clique glyphs. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 3247--3256.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. Ghizlane ECHBARTHI and Hamamache KHEDDOUCI. 2017. Lasas: an aggregated search based graph matching approach. In The 29th International Conference on Software Engineering and Knowledge Engineering.Google ScholarGoogle Scholar
  50. Talya Eden, Amit Levi, Dana Ron, and C Seshadhri. 2017. Approximately counting triangles in sublinear time. SIAM J. Comput. 46, 5 (2017), 1603--1646.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Michael Elkin and Ofer Neiman. 2018. Efficient algorithms for constructing very sparse spanners and emulators. ACM Transactions on Algorithms (TALG) 15, 1 (2018), 4.Google ScholarGoogle Scholar
  52. Guy Even, Joseph Naor, Satish Rao, and Baruch Schieber. 1999. Fast approximate graph partitioning algorithms. SIAM J. Comput. 28, 6 (1999), 2187--2214.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Wenfei Fan, Jianzhong Li, Xin Wang, and Yinghui Wu. 2012. Query preserving graph compression. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. 157--168.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. Arash Farzan and J Ian Munro. 2008. Succinct representations of arbitrary graphs. In European Symposium on Algorithms. Springer, 393--404.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. Zhuo Feng. 2016. Spectral graph sparsification in nearly-linear time leveraging efficient spectral perturbation analysis. In Proceedings of the 53rd Annual Design Automation Conference. ACM, 57.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. Jacob Fox, Tim Roughgarden, C Seshadhri, Fan Wei, and Nicole Wein. 2018. Finding cliques in social networks: A new distribution-free model. arXiv preprint arXiv:1804.07431 (2018).Google ScholarGoogle Scholar
  57. Haohuan Fu, Junfeng Liao, Jinzhe Yang, Lanning Wang, Zhenya Song, Xiaomeng Huang, Chao Yang, Wei Xue, Fangfang Liu, Fangli Qiao, et al. 2016. The Sunway TaihuLight supercomputer: system and applications. Science China Information Sciences 59, 7 (2016), 072001.Google ScholarGoogle ScholarCross RefCross Ref
  58. Robert Geisberger, Peter Sanders, and Dominik Schultes. 2008. Better approximation of betweenness centrality. In Proceedings of the Meeting on Algorithm Engineering & Expermiments. Society for Industrial and Applied Mathematics, 90--100.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. Robert Gerstenberger, Maciej Besta, and Torsten Hoefler. 2014. Enabling highly-scalable remote memory access programming with MPI-3 one sided. Scientific Programming 22, 2 (2014), 75--91.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. Oded Green, Pavan Yalamanchili, and Lluís-Miquel Munguía. 2014. Fast triangle counting on the GPU. In Proceedings of the 4th Workshop on Irregular Applications: Architectures and Algorithms. IEEE Press, 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. Sudipto Guha and Andrew McGregor. 2012. Graph synopses, sketches, and streams: A survey. Proceedings of the VLDB Endowment 5, 12 (2012), 2030--2031.Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. Magnús M Halldórsson. 1993. A still better performance guarantee for approximate graph coloring. Inform. Process. Lett. 45, 1 (1993), 19--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. Safiollah Heidari, Yogesh Simmhan, Rodrigo N Calheiros, and Rajkumar Buyya. 2018. Scalable graph processing frameworks: A taxonomy and open challenges. ACM Computing Surveys (CSUR) 51, 3 (2018), 60.Google ScholarGoogle ScholarDigital LibraryDigital Library
  64. Wilko Henecka and Matthew Roughan. 2015. Lossy compression of dynamic, weighted graphs. In 2015 3rd International Conference on Future Internet of Things and Cloud. IEEE, 427--434.Google ScholarGoogle ScholarDigital LibraryDigital Library
  65. Pili Hu and Wing Cheong Lau. 2013. A survey and taxonomy of graph sampling. arXiv preprint arXiv:1308.5865 (2013).Google ScholarGoogle Scholar
  66. Yang Hu, Hang Liu, and H Howie Huang. 2018. High-Performance Triangle Counting on GPUs. In 2018 IEEE High Performance extreme Computing Conference (HPEC). IEEE, 1--5.Google ScholarGoogle Scholar
  67. Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, and Ion Stoica. 2018. {ASAP}: Fast, Approximate Graph Pattern Mining at Scale. In 13th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 18). 745--761.Google ScholarGoogle Scholar
  68. Anand Padmanabha Iyer, Aurojit Panda, Shivaram Venkataraman, Mosharaf Chowdhury, Aditya Akella, Scott Shenker, and Ion Stoica. 2018. Bridging the GAP: towards approximate graph analytics. In Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). ACM, 10.Google ScholarGoogle ScholarDigital LibraryDigital Library
  69. Arun Jambulapati and Aaron Sidford. 2018. Efficient O (n/epsilon) Spectral Sketches for the Laplacian and its Pseudoinverse. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2487--2503.Google ScholarGoogle ScholarCross RefCross Ref
  70. Madhav Jha, Ali Pinar, and C Seshadhri. 2015. Counting triangles in real-world graph streams: Dealing with repeated edges and time windows. In 2015 49th Asilomar Conference on Signals, Systems and Computers. IEEE, 1507--1514.Google ScholarGoogle ScholarCross RefCross Ref
  71. Madhav Jha, C Seshadhri, and Ali Pinar. 2015. A space-efficient streaming algorithm for estimating transitivity and triangle counts using the birthday paradox. ACM Transactions on Knowledge Discovery from Data (TKDD) 9, 3 (2015), 15.Google ScholarGoogle Scholar
  72. Hossein Jowhari and Mohammad Ghodsi. 2005. New streaming algorithms for counting triangles in graphs. In International Computing and Combinatorics Conference. Springer, 710--716.Google ScholarGoogle ScholarCross RefCross Ref
  73. Vasiliki Kalavri, Tiago Simas, and Dionysios Logothetis. 2016. The shortest path is not always a straight line: leveraging semi-metricity in graph analysis. Proceedings of the VLDB Endowment 9, 9 (2016), 672--683.Google ScholarGoogle ScholarDigital LibraryDigital Library
  74. David R. Karger. 2000. Minimum cuts in near-linear time. J. ACM 47, 1 (2000), 46--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  75. Jonathan A Kelner and Alex Levin. 2013. Spectral sparsification in the semi-streaming setting. Theory of Computing Systems 53, 2 (2013), 243--262.Google ScholarGoogle ScholarCross RefCross Ref
  76. Subhash Khot and Oded Regev. 2008. Vertex cover might be hard to approximate to within 2- ε. J. Comput. System Sci 74, 3 (2008), 335--349.Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. Ioannis Koutis and Shen Chen Xu. 2016. Simple parallel and distributed algorithms for spectral graph sparsification. ACM Transactions on Parallel Computing (TOPC) 3, 2 (2016), 14.Google ScholarGoogle Scholar
  78. Solomon Kullback. 1997. Information theory and statistics. Courier Corporation.Google ScholarGoogle Scholar
  79. K. Ashwin Kumar and Petros Efstathopoulos. 2018. Utility-driven graph summarization. very large data bases 12, 4 (2018), 335--347.Google ScholarGoogle Scholar
  80. Jérôme Kunegis. 2013. Konect: the koblenz network collection. In Proc. of Intl. Conf. on World Wide Web (WWW). ACM, 1343--1350.Google ScholarGoogle ScholarDigital LibraryDigital Library
  81. Rasmus Kyng and Zhao Song. 2018. A Matrix Chernoff Bound for Strongly Rayleigh Distributions and Spectral Sparsifiers from a few Random Spanning Trees. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 373--384.Google ScholarGoogle ScholarCross RefCross Ref
  82. Kyong-Ha Lee, Yoon-Joon Lee, Hyunsik Choi, Yon Dohn Chung, and Bongki Moon. 2012. Parallel data processing with MapReduce: a survey. AcM sIGMoD Record 40, 4 (2012), 11--20.Google ScholarGoogle Scholar
  83. Yin Tat Lee and He Sun. 2018. Constructing linear-sized spectral sparsification in almost-linear time. SIAM J. Comput. 47, 6 (2018), 2315--2336.Google ScholarGoogle ScholarCross RefCross Ref
  84. Christoph Lenzen and Reut Levi. 2018. A Centralized Local Algorithm for the Sparse Spanning Graph Problem. In 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.Google ScholarGoogle Scholar
  85. Jure Leskovec and Christos Faloutsos. 2006. Sampling from large graphs. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 631--636.Google ScholarGoogle ScholarDigital LibraryDigital Library
  86. Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data.Google ScholarGoogle Scholar
  87. Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, et al. 2018. ShenTu: processing multi-trillion edge graphs on millions of cores in seconds. In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis. IEEE Press, 56.Google ScholarGoogle ScholarDigital LibraryDigital Library
  88. Wei Liu, Andrey Kan, Jeffrey Chan, James Bailey, Christopher Leckie, Jian Pei, and Ramamohanarao Kotagiri. 2012. On compressing weighted time-evolving graphs. In Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2319--2322.Google ScholarGoogle ScholarDigital LibraryDigital Library
  89. Xingjie Liu, Yuanyuan Tian, Qi He, Wang-Chien Lee, and John McPherson. 2014. Distributed Graph Summarization. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 799--808.Google ScholarGoogle ScholarDigital LibraryDigital Library
  90. Yike Liu, Tara Safavi, Abhilash Dighe, and Danai Koutra. 2018. Graph Summarization Methods and Applications: A Survey. Comput. Surveys 51, 3 (2018), 62.Google ScholarGoogle ScholarDigital LibraryDigital Library
  91. Yi Lu, James Cheng, Da Yan, and Huanhuan Wu. 2014. Large-scale distributed graph computing systems: An experimental evaluation. Proceedings of the VLDB Endowment 8, 3 (2014), 281--292.Google ScholarGoogle ScholarDigital LibraryDigital Library
  92. Andrew Lumsdaine, Douglas Gregor, Bruce Hendrickson, and Jonathan W. Berry. 2007. Challenges in Parallel Graph Processing. Par. Proc. Let. 17, 1 (2007), 5--20.Google ScholarGoogle ScholarCross RefCross Ref
  93. Grzegorz Malewicz, Matthew H. Austern, Aart J.C Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. 2010. Pregel: a system for large-scale graph processing. In Proc. of the ACM SIGMOD Intl. Conf. on Manag. of Data (SIGMOD '10). ACM, New York, NY, USA, 135--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  94. Hossein Maserrat and Jian Pei. 2012. Community Preserving Lossy Compression of Social Networks. In 2012 IEEE 12th International Conference on Data Mining. 509--518.Google ScholarGoogle Scholar
  95. Robert Ryan McCune, Tim Weninger, and Greg Madey. 2015. Thinking like a vertex: a survey of vertex-centric frameworks for large-scale distributed graph processing. ACM Computing Surveys (CSUR) 48, 2 (2015), 25.Google ScholarGoogle ScholarDigital LibraryDigital Library
  96. Andrew McGregor, Sofya Vorotnikova, and Hoa T Vu. 2016. Better algorithms for counting triangles in data streams. In Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. ACM, 401--411.Google ScholarGoogle ScholarDigital LibraryDigital Library
  97. Gary L Miller, Richard Peng, Adrian Vladu, and Shen Chen Xu. 2015. Improved parallel algorithms for spanners and hopsets. In Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures. ACM, 192--201.Google ScholarGoogle ScholarDigital LibraryDigital Library
  98. Richard C Murphy, Kyle B Wheeler, Brian W Barrett, and James A Ang. 2010. Introducing the graph 500. Cray User's Group (CUG) (2010).Google ScholarGoogle Scholar
  99. Moni Naor. 1990. Succinct representation of general unlabeled graphs. Discrete Applied Mathematics 28, 3 (1990), 303--307.Google ScholarGoogle ScholarDigital LibraryDigital Library
  100. Saket Navlakha, Rajeev Rastogi, and Nisheeth Shrivastava. 2008. Graph summarization with bounded error. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data. ACM, 419--432.Google ScholarGoogle ScholarDigital LibraryDigital Library
  101. Donald Nguyen, Andrew Lenharth, and Keshav Pingali. 2013. A lightweight infrastructure for graph analytics. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles. ACM, 456--471.Google ScholarGoogle ScholarDigital LibraryDigital Library
  102. Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.Google ScholarGoogle Scholar
  103. Merav Parter, Ronitt Rubinfeld, Ali Vakilian, and Anak Yodpinyanee. 2018. Local Computation Algorithms for Spanners. In 10th Innovations in Theoretical Computer Science Conference (ITCS 2019). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.Google ScholarGoogle Scholar
  104. Aduri Pavan, Srikanta Tirthapura, et al. 2013. Counting and sampling triangles from a graph stream. (2013).Google ScholarGoogle Scholar
  105. David Peleg and Alejandro A Schäffer. 1989. Graph spanners. Journal of graph theory 13, 1 (1989), 99--116.Google ScholarGoogle ScholarCross RefCross Ref
  106. David Peleg and Jeffrey D Ullman. 1989. An optimal synchronizer for the hypercube. SIAM Journal on computing 18, 4 (1989), 740--747.Google ScholarGoogle Scholar
  107. Seth Pettie. 2010. Distributed algorithms for ultrasparse spanners and linear size skeletons. Distributed Computing 22, 3 (2010), 147--166.Google ScholarGoogle ScholarDigital LibraryDigital Library
  108. Adam Polak. 2016. Counting triangles in large graphs on GPU. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). IEEE, 740--746.Google ScholarGoogle ScholarCross RefCross Ref
  109. Rajeev Raman, Venkatesh Raman, and Srinivasa Rao Satti. 2007. Succinct indexable dictionaries with applications to encoding k-ary trees, prefix sums and multisets. ACM Transactions on Algorithms (TALG) 3, 4 (2007), 43.Google ScholarGoogle ScholarDigital LibraryDigital Library
  110. Raimundo Real and Juan M Vargas. 1996. The probabilistic basis of Jaccard's index of similarity. Systematic biology 45, 3 (1996), 380--385.Google ScholarGoogle Scholar
  111. Matteo Riondato, David García-Soriano, and Francesco Bonchi. 2017. Graph summarization with quality guarantees. Data Mining and Knowledge Discovery 31, 2 (2017), 314--349.Google ScholarGoogle ScholarDigital LibraryDigital Library
  112. Matteo Riondato and Evgenios M Kornaropoulos. 2016. Fast approximation of betweenness centrality through sampling. Data Mining and Knowledge Discovery 30, 2 (2016), 438--475.Google ScholarGoogle ScholarDigital LibraryDigital Library
  113. Matteo Riondato and Eli Upfal. 2018. ABRA: Approximating betweenness centrality in static and dynamic graphs with rademacher averages. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 5 (2018), 61.Google ScholarGoogle ScholarDigital LibraryDigital Library
  114. Liam Roditty and Virginia Vassilevska Williams. 2013. Fast approximation algorithms for the diameter and radius of sparse graphs. In Proceedings of the forty-fifth annual ACM symposium on Theory of computing. ACM, 515--524.Google ScholarGoogle ScholarDigital LibraryDigital Library
  115. Amin Sadri, Flora D. Salim, Yongli Ren, Masoomeh Zameni, Jeffrey Chan, and Timos Sellis. 2017. Shrink: Distance preserving graph compression. Information Systems 69 (2017), 180--193.Google ScholarGoogle ScholarDigital LibraryDigital Library
  116. Semih Salihoglu and Jennifer Widom. 2014. Optimizing graph algorithms on Pregel-like systems. Proceedings of the VLDB Endowment 7, 7 (2014), 577--588.Google ScholarGoogle ScholarDigital LibraryDigital Library
  117. Ahmet Erdem Sariyüce, Kamer Kaya, Erik Saule, and Ümit V Çatalyürek. 2013. Betweenness centrality on GPUs and heterogeneous architectures. In Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units. ACM, 76--85.Google ScholarGoogle ScholarDigital LibraryDigital Library
  118. Berkant Savas and Inderjit S Dhillon. 2011. Clustered low rank approximation of graphs in information science applications. In Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM, 164--175.Google ScholarGoogle ScholarCross RefCross Ref
  119. C Seshadhri. 2015. A simpler sublinear algorithm for approximating the triangle count. arXiv preprint arXiv:1505.01927 (2015).Google ScholarGoogle Scholar
  120. Comandur Seshadhri, Ali Pinar, and Tamara G Kolda. 2013. Fast triangle counting through wedge sampling. In Proceedings of the SIAM Conference on Data Mining, Vol. 4. 5.Google ScholarGoogle Scholar
  121. C Seshadhri, Ali Pinar, and Tamara G Kolda. 2014. Wedge sampling for computing clustering coefficients and triangle counts on large graphs. Statistical Analysis and Data Mining: The ASA Data Science Journal 7, 4 (2014), 294--307.Google ScholarGoogle ScholarDigital LibraryDigital Library
  122. Zechao Shang and Jeffrey Xu Yu. 2014. Auto-approximation of graph computing. Proceedings of the VLDB Endowment 7, 14 (2014), 1833--1844.Google ScholarGoogle ScholarDigital LibraryDigital Library
  123. Xuanhua Shi, Zhi gao Zheng, Yongluan Zhou, Hai Jin, Ligang He, Bo Liu, and Qiang-Sheng Hua. 2018. Graph processing on GPUs: A survey. ACM Computing Surveys (CSUR) 50, 6 (2018), 81.Google ScholarGoogle ScholarDigital LibraryDigital Library
  124. Kijung Shin, Amol Ghoting, Myunghwan Kim, and Hema Raghavan. 2019. Sweg: Lossless and lossy summarization of web-scale graphs. In Proceedings of the 28th International Conference on World Wide Web. ACM, Vol. 1. 1--2.Google ScholarGoogle ScholarDigital LibraryDigital Library
  125. Julian Shun and Kanat Tangwongsan. 2015. Multicore triangle computations without tuning. In 2015 IEEE 31st International Conference on Data Engineering. IEEE, 149--160.Google ScholarGoogle ScholarCross RefCross Ref
  126. Somesh Singh and Rupesh Nasre. 2018. Scalable and Performant Graph Processing on GPUs Using Approximate Computing. IEEE Transactions on Multi-Scale Computing Systems 4, 3 (2018), 190--203.Google ScholarGoogle ScholarCross RefCross Ref
  127. George M Slota and Kamesh Madduri. 2014. Complex network analysis using parallel approximate motif counting. In Parallel and Distributed Processing Symposium, 2014 IEEE 28th International. IEEE, 405--414.Google ScholarGoogle ScholarDigital LibraryDigital Library
  128. Tasuku Soma and Yuichi Yoshida. 2019. Spectral Sparsification of Hypergraphs. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2570--2581.Google ScholarGoogle ScholarCross RefCross Ref
  129. Daniel A Spielman and Nikhil Srivastava. 2011. Graph sparsification by effective resistances. SIAM J. Comput. 40, 6 (2011), 1913--1926.Google ScholarGoogle ScholarDigital LibraryDigital Library
  130. Daniel A Spielman and Shang-Hua Teng. 2011. Spectral sparsification of graphs. SIAM J. Comput. 40, 4 (2011), 981--1025.Google ScholarGoogle ScholarDigital LibraryDigital Library
  131. Xin Sui, Tsung-Hsien Lee, Joyce Jiyoung Whang, Berkant Savas, Saral Jain, Keshav Pingali, and Inderjit Dhillon. 2012. Parallel clustered low-rank approximation of graphs and its application to link prediction. In International Workshop on Languages and Compilers for Parallel Computing. Springer, 76--95.Google ScholarGoogle Scholar
  132. Daniel Yasumasa Takahashi, Joao Ricardo Sato, Carlos Eduardo Ferreira, and André Fujita. 2012. Discriminating different classes of biological networks by analyzing the graphs spectra distribution. PLoS One 7, 12 (2012), e49949.Google ScholarGoogle ScholarCross RefCross Ref
  133. Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, and Atte Hinkka. 2011. Compression of weighted graphs. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. 965--973.Google ScholarGoogle ScholarDigital LibraryDigital Library
  134. Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, and Atte Hinkka. 2012. Network compression by node and edge mergers. Bisociative Knowledge Discovery (2012), 199--217.Google ScholarGoogle Scholar
  135. Ioanna Tsalouchidou, Francesco Bonchi, Gianmarco De Francisci Morales, and Ricardo Baeza-Yates. 2018. Scalable Dynamic Graph Summarization. IEEE Transactions on Knowledge and Data Engineering (2018), 1--1.Google ScholarGoogle Scholar
  136. Charalampos E Tsourakakis. [n.d.]. Fast Counting of Triangles in Large Real Networks: Algorithms and Laws. cis. temple. edu ([n.d.]), 608--617.Google ScholarGoogle Scholar
  137. Charalampos E Tsourakakis, U Kang, Gary L Miller, and Christos Faloutsos. 2009. Doulion: counting triangles in massive graphs with a coin. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 837--846.Google ScholarGoogle ScholarDigital LibraryDigital Library
  138. György Turán. 1984. On the succinct representation of graphs. Discrete Applied Mathematics 8, 3 (1984), 289--294.Google ScholarGoogle ScholarCross RefCross Ref
  139. Jason TL Wang, Kaizhong Zhang, and Gung-Wei Chirn. 1995. Algorithms for approximate graph matching. Information Sciences 82, 1-2 (1995), 45--74.Google ScholarGoogle ScholarDigital LibraryDigital Library
  140. Leyuan Wang, Yangzihao Wang, Carl Yang, and John D Owens. 2016. A comparative study on exact triangle counting algorithms on the gpu. In Proceedings of the ACM Workshop on High Performance Graph Processing. ACM, 1--8.Google ScholarGoogle ScholarDigital LibraryDigital Library
  141. Tianyi Wang, Yang Chen, Zengbin Zhang, Tianyin Xu, Long Jin, Pan Hui, Beixing Deng, and Xing Li. 2011. Understanding graph sampling algorithms for social network analysis. In 2011 31st international conference on distributed computing systems workshops. IEEE, 123--128.Google ScholarGoogle ScholarDigital LibraryDigital Library
  142. Yongyu Wang and Zhuo Feng. 2017. Towards scalable spectral clustering via spectrum-preserving sparsification. arXiv preprint arXiv.1710.04584 (2017).Google ScholarGoogle Scholar
  143. Michael M Wolf, Mehmet Deveci, Jonathan W Berry, Simon D Hammond, and Sivasankaran Rajamanickam. 2017. Fast linear algebra-based triangle counting with kokkoskernels. In 2017 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 1--7.Google ScholarGoogle ScholarCross RefCross Ref
  144. Da Yan, James Cheng, Kai Xing, Yi Lu, Wilfred Ng, and Yingyi Bu. 2014. Pregel algorithms for graph connectivity problems with performance guarantees. Proceedings of the VLDB Endowment 7, 14 (2014), 1821--1832.Google ScholarGoogle ScholarDigital LibraryDigital Library
  145. Abdurrahman Yaşar, Sivasankaran Rajamanickam, Michael Wolf, Jonathan Berry, and Ümit V Çatalyürek. 2018. Fast Triangle Counting Using Cilk. In 2018 IEEE High Performance extreme Computing Conference (HPEC). IEEE, 1--7.Google ScholarGoogle Scholar
  146. Hao Zhang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, and Meihui Zhang. 2015. In-memory big data management and processing: A survey. IEEE Transactions on Knowledge and Data Engineering 27, 7 (2015), 1920--1948.Google ScholarGoogle ScholarDigital LibraryDigital Library
  147. Yunquan Zhang, Ting Cao, Shigang Li, Xinhui Tian, Liang Yuan, Haipeng Jia, and Athanasios V Vasilakos. 2016. Parallel processing systems for big data: a survey. Proc. IEEE 104, 11 (2016), 2114--2136.Google ScholarGoogle ScholarCross RefCross Ref
  148. Ying Zhang, Zhiqiang Zhao, and Zhuo Feng. 2018. Towards Scalable Spectral Sparsification of Directed Graphs. arXiv preprint arXiv:1812.04165 (2018).Google ScholarGoogle Scholar
  149. Fang Zhou, Qiang Qu, and Hannu Toivonen. 2017. Summarisation of weighted networks. Journal of Experimental and Theoretical Artificial Intelligence 29, 5 (2017), 1023--1052.Google ScholarGoogle ScholarCross RefCross Ref
  1. Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
      November 2019
      1921 pages
      ISBN:9781450362290
      DOI:10.1145/3295500

      Copyright © 2019 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 November 2019

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,516of6,373submissions,24%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader