research-article

Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics

Authors:

Lukas Gianinazzi,

Robert Gerstenberger,

Yishai Oltchik,

Torsten HoeflerAuthors Info & Claims

SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

Article No.: 35, Pages 1 - 25

https://doi.org/10.1145/3295500.3356182

Published: 17 November 2019 Publication History

Abstract

We propose Slim Graph: the first programming model and framework for practical lossy graph compression that facilitates high-performance approximate graph processing, storage, and analytics. Slim Graph enables the developer to express numerous compression schemes using small and programmable compression kernels that can access and modify local parts of input graphs. Such kernels are executed in parallel by the underlying engine, isolating developers from complexities of parallel programming. Our kernels implement novel graph compression schemes that preserve numerous graph properties, for example connected components, minimum spanning trees, or graph spectra. Finally, Slim Graph uses statistical divergences and other metrics to analyze the accuracy of lossy graph compression. We illustrate both theoretically and empirically that Slim Graph accelerates numerous graph algorithms, reduces storage used by graph datasets, and ensures high accuracy of results. Slim Graph may become the common ground for developing, executing, and analyzing emerging lossy graph compression schemes.

References

[1]

[n.d.]. Hyperlink Graph 2012. http://webdatacommons.org/hyperlinkgraph/2012-08/download.html.

[2]

Kook Jin Ahn, Sudipto Guha, and Andrew McGregor. 2012. Analyzing graph structure via linear measurements. In Proceedings of the twenty-third annual ACM-SIAM symposium on Discrete Algorithms. SIAM, 459--467.

[3]

Kook Jin Ahn, Sudipto Guha, and Andrew McGregor. 2012. Graph sketches: sparsification, spanners, and subgraphs. In Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI symposium on Principles of Database Systems. ACM, 5--14.

Digital Library

[4]

Stephen Alstrup, Søren Dahlgaard, Arnold Filtser, Morten Stöckel, and Christian Wulff-Nilsen. 2017. Constructing light spanners deterministically in near-linear time. arXiv preprint arXiv.1709.01960 (2017).

[5]

Ingo Althöfer, Gautam Das, David Dobkin, Deborah Joseph, and José Soares. 1993. On sparse spanners of weighted graphs. Discrete & Computational Geometry 9, 1 (1993), 81--100.

[6]

David G Anderson, Ming Gu, and Christopher Melgaard. 2014. An efficient algorithm for unweighted spectral graph sparsification. arXiv preprint arXiv:1410A273 (2014).

[7]

David A Bader, Shiva Kintali, Kamesh Madduri, and Milena Mihail. 2007. Approximating betweenness centrality. In International Workshop on Algorithms and Models for the Web-Graph. Springer, 124--137.

Digital Library

[8]

Michèle Basseville. 2010. Divergence measures for statistical data processing. (2010).

[9]

Surender Baswana, Telikepalli Kavitha, Kurt Mehlhorn, and Seth Pettie. 2010. Additive spanners and (α, β)-spanners. ACM Transactions on Algorithms (TALG) 7, 1 (2010), 5.

Digital Library

[10]

Surender Baswana and Sandeep Sen. 2007. A simple and linear time randomized algorithm for computing sparse spanners in weighted graphs. Random Structures & Algorithms 30, 4 (2007), 532--563.

[11]

Omar Batarfi, Radwa El Shawi, Ayman G Fayoumi, Reza Nouri, Ahmed Barnawi, Sherif Sakr, et al. 2015. Large scale graph processing systems: survey and an experimental evaluation. Cluster Computing 18, 3 (2015), 1189--1213.

Digital Library

[12]

Joshua Batson, Daniel A Spielman, Nikhil Srivastava, and Shang-Hua Teng. 2013. Spectral sparsification of graphs: theory and algorithms. Commun. ACM 56, 8 (2013), 87--94.

Digital Library

[13]

Scott Beamer, Krste Asanović, and David Patterson. 2015. The GAP benchmark suite. arXiv preprint arXiv.1508.03619 (2015).

[14]

Maham Anwar Beg, Muhammad Ahmad, Arif Zaman, and Imdadullah Khan. 2018. Scalable Approximation Algorithm for Graph Summarization. pacific-asia conference on knowledge discovery and data mining (2018), 502--514.

[15]

András A Benczúr and David R Karger. 1996. Approximating st minimum cuts in O (n 2) time. In Proceedings of the twenty-eighth annual ACM symposium on Theory of computing. ACM, 47--55.

[16]

Suman K Bera and Amit Chakrabarti. 2017. Towards tighter space bounds for counting triangles and other substructures in graph streams. In 34th Symposium on Theoretical Aspects of Computer Science (STACS 2017). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.

[17]

Maciej Besta and Torsten Hoefler. 2018. Survey and Taxonomy of Lossless Graph Compression and Space-Efficient Graph Representations. arXiv preprint arXiv:1806.01799 (2018).

[18]

Maciej Besta, Michał Podstawski, Linus Groner, Edgar Solomonik, and Torsten Hoefler. 2017. To push or to pull: On reducing communication and synchronization in graph computations. In Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing. ACM, 93--104.

Digital Library

[19]

Maciej Besta, Dimitri Stanojevic, Tijana Zivic, Jagpreet Singh, Maurice Hoerold, and Torsten Hoefler. 2018. Log (graph): a near-optimal high-performance graph representation. In Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques. ACM, 7.

Digital Library

[20]

Paolo Boldi, Marco Rosa, and Sebastiano Vigna. 2011. HyperANF: Approximating the neighbourhood function of very large graphs on a budget. In Proceedings of the 20th international conference on World wide web. ACM, 625--634.

Digital Library

[21]

Paolo Boldi and Sebastiano Vigna. 2004. The webgraph framework I: compression techniques. In Proceedings of the 13th international conference on World Wide Web. ACM, 595--602.

Digital Library

[22]

Michele Borassi and Emanuele Natale. 2016. KADABRA is an adaptive algorithm for betweenness via random approximation. arXiv preprint arXiv:1604.08553 (2016).

[23]

Stéphane Boucheron, Gábor Lugosi, and Pascal Massart. 2013. Concentration inequalities: A nonasymptotic theory of independence. Oxford university press.

[24]

Ulrik Brandes. 2001. A faster algorithm for betweenness centrality. Journal of mathematical sociology 25, 2 (2001), 163--177.

[25]

Nieves R Brisaboa, Susana Ladra, and Gonzalo Navarro. 2009. k2-Trees for Compact Web Graph Representation. In SPIRE, Vol. 9. Springer, 18--30.

[26]

Luciana S Buriol, Gereon Frahling, Stefano Leonardi, Alberto Marchetti-Spaccamela, and Christian Sohler. 2006. Counting triangles in data streams. In Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. ACM, 253--262.

Digital Library

[27]

Daniele Calandriello, Ioannis Koutis, Alessandro Lazaric, and Michal Valko. 2018. Improved large-scale graph learning through ridge spectral sparsification. In International Conference on Machine Learning.

[28]

Stéphane Campinas, Renaud Delbru, and Giovanni Tummarello. 2013. Efficiency and precision trade-offs in graph summary algorithms. In Proceedings of the 17th International Database Engineering and Applications Symposium on. 38--47.

Digital Library

[29]

Keren Censor-Hillel and Michal Dory. 2018. Distributed spanner approximation. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing. ACM, 139--148.

Digital Library

[30]

Keren Censor-Hillel, Ami Paz, and Noam Ravid. 2018. The Sparsest Additive Spanner via Multiple Weighted BFS Trees. arXiv preprint arXiv:1811.01997 (2018).

[31]

Sung-Hyuk Cha. 2007. Comprehensive survey on distance/similarity measures between probability density functions. City 1, 2 (2007), 1.

[32]

Bernard Chazelle, Ronitt Rubinfeld, and Luca Trevisan. 2005. Approximating the minimum spanning tree weight in sublinear time. SIAM Journal on computing 34, 6 (2005), 1370--1379.

[33]

Shiri Chechik, Daniel H Larkin, Liam Roditty, Grant Schoenebeck, Robert E Tarjan, and Virginia Vassilevska Williams. 2014. Better approximation algorithms for the graph diameter. In Proceedings of the twenty-fifth annual ACM-SIAM symposium on Discrete algorithms. Society for Industrial and Applied Mathematics, 1041--1052.

[34]

Mostafa Haghir Chehreghani, Albert Bifet, and Talel Abdessalem. 2018. Efficient Exact and Approximate Algorithms for Computing Betweenness Centrality in Directed Graphs. In Pacific-Asia Conference on Knowledge Discovery and Data Mining. Springer, 752--764.

[35]

Chen Chen, Cindy Xide Lin, Matt Fredrikson, Mihai Christodorescu, Xifeng Yan, and Jiawei Han. 2009. Mining graph patterns efficiently via randomized summaries. very large data bases 2, 1 (2009), 742--753.

[36]

Avery Ching, Sergey Edunov, Maja Kabiljo, Dionysios Logothetis, and Sambavi Muthukrishnan. 2015. One trillion edges: Graph processing at facebook-scale. Proceedings of the VLDB Endowment 8, 12 (2015), 1804--1815.

Digital Library

[37]

Nicos Christofides. 1976. Worst-case analysis of a new heuristic for the travelling salesman problem. Technical Report. Carnegie-Mellon Univ Pittsburgh Pa Management Sciences Research Group.

[38]

Timothy Chu, Yu Gao, Richard Peng, Sushant Sachdeva, Saurabh Sawlani, and Junxing Wang. 2018. Graph sparsification, spectral sketches, and faster resistance computation, via short cycle decompositions. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 361--372.

[39]

Thomas H Cormen, Charles E Leiserson, Ronald L Rivest, and Clifford Stein. 2009. Introduction to algorithms. MIT press.

[40]

Graham Cormode and Hossein Jowhari. 2017. A second look at counting triangles in graph streams (corrected). Theoretical Computer Science 683 (2017), 22--30.

[41]

Thomas M Cover and Joy A Thomas. 2012. Elements of information theory. John Wiley & Sons.

[42]

Ketan Date, Keven Feng, Rakesh Nagi, Jinjun Xiong, Nam Sung Kim, and Wen-Mei Hwu. 2017. Collaborative (cpu+ gpu) algorithms for triangle counting and truss decomposition on the minsky architecture: Static graph challenge: Subgraph isomorphism. In 2017 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 1--7.

[43]

Etienne de Klerk, Dmitrii V Pasechnik, and Joost P Warners. 2004. On approximate graph colouring and max-k-cut algorithms based on the θ-function. Journal of Combinatorial Optimization 8, 3 (2004), 267--294.

[44]

Camil Demetrescu, Andrew V Goldberg, and David S Johnson. 2009. The Shortest Path Problem: Ninth DIMLACS Implementation Challenge. Vol. 74. American Math. Soc.

[45]

Niels Doekemeijer and Ana Lucia Varbanescu. 2014. A survey of parallel graph processing frameworks. Delft University of Technology (2014), 21.

[46]

Michal Dory. 2018. Distributed Approximation of Minimum k-edge-connected Spanning Subgraphs. In Proceedings of the 2018 ACM Symposium on Principles of Distributed Computing. ACM, 149--158.

Digital Library

[47]

Stefania Dumbrava, Angela Bonifati, Amaia Nazabal Ruiz Diaz, and Romain Vuillemot. 2018. Approximate Evaluation of Label-Constrained Reachability Queries. arXiv preprint arXiv.1811.11561 (2018).

[48]

Cody Dunne and Ben Shneiderman. 2013. Motif simplification: improving network visualization readability with fan, connector, and clique glyphs. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 3247--3256.

Digital Library

[49]

Ghizlane ECHBARTHI and Hamamache KHEDDOUCI. 2017. Lasas: an aggregated search based graph matching approach. In The 29th International Conference on Software Engineering and Knowledge Engineering.

[50]

Talya Eden, Amit Levi, Dana Ron, and C Seshadhri. 2017. Approximately counting triangles in sublinear time. SIAM J. Comput. 46, 5 (2017), 1603--1646.

Digital Library

[51]

Michael Elkin and Ofer Neiman. 2018. Efficient algorithms for constructing very sparse spanners and emulators. ACM Transactions on Algorithms (TALG) 15, 1 (2018), 4.

[52]

Guy Even, Joseph Naor, Satish Rao, and Baruch Schieber. 1999. Fast approximate graph partitioning algorithms. SIAM J. Comput. 28, 6 (1999), 2187--2214.

Digital Library

[53]

Wenfei Fan, Jianzhong Li, Xin Wang, and Yinghui Wu. 2012. Query preserving graph compression. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data. 157--168.

Digital Library

[54]

Arash Farzan and J Ian Munro. 2008. Succinct representations of arbitrary graphs. In European Symposium on Algorithms. Springer, 393--404.

Digital Library

[55]

Zhuo Feng. 2016. Spectral graph sparsification in nearly-linear time leveraging efficient spectral perturbation analysis. In Proceedings of the 53rd Annual Design Automation Conference. ACM, 57.

Digital Library

[56]

Jacob Fox, Tim Roughgarden, C Seshadhri, Fan Wei, and Nicole Wein. 2018. Finding cliques in social networks: A new distribution-free model. arXiv preprint arXiv:1804.07431 (2018).

[57]

Haohuan Fu, Junfeng Liao, Jinzhe Yang, Lanning Wang, Zhenya Song, Xiaomeng Huang, Chao Yang, Wei Xue, Fangfang Liu, Fangli Qiao, et al. 2016. The Sunway TaihuLight supercomputer: system and applications. Science China Information Sciences 59, 7 (2016), 072001.

[58]

Robert Geisberger, Peter Sanders, and Dominik Schultes. 2008. Better approximation of betweenness centrality. In Proceedings of the Meeting on Algorithm Engineering & Expermiments. Society for Industrial and Applied Mathematics, 90--100.

Digital Library

[59]

Robert Gerstenberger, Maciej Besta, and Torsten Hoefler. 2014. Enabling highly-scalable remote memory access programming with MPI-3 one sided. Scientific Programming 22, 2 (2014), 75--91.

Digital Library

[60]

Oded Green, Pavan Yalamanchili, and Lluís-Miquel Munguía. 2014. Fast triangle counting on the GPU. In Proceedings of the 4th Workshop on Irregular Applications: Architectures and Algorithms. IEEE Press, 1--8.

Digital Library

[61]

Sudipto Guha and Andrew McGregor. 2012. Graph synopses, sketches, and streams: A survey. Proceedings of the VLDB Endowment 5, 12 (2012), 2030--2031.

Digital Library

[62]

Magnús M Halldórsson. 1993. A still better performance guarantee for approximate graph coloring. Inform. Process. Lett. 45, 1 (1993), 19--23.

Digital Library

[63]

Safiollah Heidari, Yogesh Simmhan, Rodrigo N Calheiros, and Rajkumar Buyya. 2018. Scalable graph processing frameworks: A taxonomy and open challenges. ACM Computing Surveys (CSUR) 51, 3 (2018), 60.

Digital Library

[64]

Wilko Henecka and Matthew Roughan. 2015. Lossy compression of dynamic, weighted graphs. In 2015 3rd International Conference on Future Internet of Things and Cloud. IEEE, 427--434.

Digital Library

[65]

Pili Hu and Wing Cheong Lau. 2013. A survey and taxonomy of graph sampling. arXiv preprint arXiv:1308.5865 (2013).

[66]

Yang Hu, Hang Liu, and H Howie Huang. 2018. High-Performance Triangle Counting on GPUs. In 2018 IEEE High Performance extreme Computing Conference (HPEC). IEEE, 1--5.

[67]

Anand Padmanabha Iyer, Zaoxing Liu, Xin Jin, Shivaram Venkataraman, Vladimir Braverman, and Ion Stoica. 2018. {ASAP}: Fast, Approximate Graph Pattern Mining at Scale. In 13th {USENIX} Symposium on Operating Systems Design and Implementation ({OSDI} 18). 745--761.

[68]

Anand Padmanabha Iyer, Aurojit Panda, Shivaram Venkataraman, Mosharaf Chowdhury, Aditya Akella, Scott Shenker, and Ion Stoica. 2018. Bridging the GAP: towards approximate graph analytics. In Proceedings of the 1st ACM SIGMOD Joint International Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). ACM, 10.

Digital Library

[69]

Arun Jambulapati and Aaron Sidford. 2018. Efficient O (n/epsilon) Spectral Sketches for the Laplacian and its Pseudoinverse. In Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2487--2503.

[70]

Madhav Jha, Ali Pinar, and C Seshadhri. 2015. Counting triangles in real-world graph streams: Dealing with repeated edges and time windows. In 2015 49th Asilomar Conference on Signals, Systems and Computers. IEEE, 1507--1514.

[71]

Madhav Jha, C Seshadhri, and Ali Pinar. 2015. A space-efficient streaming algorithm for estimating transitivity and triangle counts using the birthday paradox. ACM Transactions on Knowledge Discovery from Data (TKDD) 9, 3 (2015), 15.

[72]

Hossein Jowhari and Mohammad Ghodsi. 2005. New streaming algorithms for counting triangles in graphs. In International Computing and Combinatorics Conference. Springer, 710--716.

[73]

Vasiliki Kalavri, Tiago Simas, and Dionysios Logothetis. 2016. The shortest path is not always a straight line: leveraging semi-metricity in graph analysis. Proceedings of the VLDB Endowment 9, 9 (2016), 672--683.

Digital Library

[74]

David R. Karger. 2000. Minimum cuts in near-linear time. J. ACM 47, 1 (2000), 46--76.

Digital Library

[75]

Jonathan A Kelner and Alex Levin. 2013. Spectral sparsification in the semi-streaming setting. Theory of Computing Systems 53, 2 (2013), 243--262.

[76]

Subhash Khot and Oded Regev. 2008. Vertex cover might be hard to approximate to within 2- ε. J. Comput. System Sci 74, 3 (2008), 335--349.

Digital Library

[77]

Ioannis Koutis and Shen Chen Xu. 2016. Simple parallel and distributed algorithms for spectral graph sparsification. ACM Transactions on Parallel Computing (TOPC) 3, 2 (2016), 14.

[78]

Solomon Kullback. 1997. Information theory and statistics. Courier Corporation.

[79]

K. Ashwin Kumar and Petros Efstathopoulos. 2018. Utility-driven graph summarization. very large data bases 12, 4 (2018), 335--347.

[80]

Jérôme Kunegis. 2013. Konect: the koblenz network collection. In Proc. of Intl. Conf. on World Wide Web (WWW). ACM, 1343--1350.

Digital Library

[81]

Rasmus Kyng and Zhao Song. 2018. A Matrix Chernoff Bound for Strongly Rayleigh Distributions and Spectral Sparsifiers from a few Random Spanning Trees. In 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS). IEEE, 373--384.

[82]

Kyong-Ha Lee, Yoon-Joon Lee, Hyunsik Choi, Yon Dohn Chung, and Bongki Moon. 2012. Parallel data processing with MapReduce: a survey. AcM sIGMoD Record 40, 4 (2012), 11--20.

[83]

Yin Tat Lee and He Sun. 2018. Constructing linear-sized spectral sparsification in almost-linear time. SIAM J. Comput. 47, 6 (2018), 2315--2336.

[84]

Christoph Lenzen and Reut Levi. 2018. A Centralized Local Algorithm for the Sparse Spanning Graph Problem. In 45th International Colloquium on Automata, Languages, and Programming (ICALP 2018). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.

[85]

Jure Leskovec and Christos Faloutsos. 2006. Sampling from large graphs. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 631--636.

Digital Library

[86]

Jure Leskovec and Andrej Krevl. 2014. SNAP Datasets: Stanford Large Network Dataset Collection. http://snap.stanford.edu/data.

[87]

Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, et al. 2018. ShenTu: processing multi-trillion edge graphs on millions of cores in seconds. In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis. IEEE Press, 56.

Digital Library

[88]

Wei Liu, Andrey Kan, Jeffrey Chan, James Bailey, Christopher Leckie, Jian Pei, and Ramamohanarao Kotagiri. 2012. On compressing weighted time-evolving graphs. In Proceedings of the 21st ACM international conference on Information and knowledge management. ACM, 2319--2322.

Digital Library

[89]

Xingjie Liu, Yuanyuan Tian, Qi He, Wang-Chien Lee, and John McPherson. 2014. Distributed Graph Summarization. In Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. 799--808.

Digital Library

[90]

Yike Liu, Tara Safavi, Abhilash Dighe, and Danai Koutra. 2018. Graph Summarization Methods and Applications: A Survey. Comput. Surveys 51, 3 (2018), 62.

Digital Library

[91]

Yi Lu, James Cheng, Da Yan, and Huanhuan Wu. 2014. Large-scale distributed graph computing systems: An experimental evaluation. Proceedings of the VLDB Endowment 8, 3 (2014), 281--292.

Digital Library

[92]

Andrew Lumsdaine, Douglas Gregor, Bruce Hendrickson, and Jonathan W. Berry. 2007. Challenges in Parallel Graph Processing. Par. Proc. Let. 17, 1 (2007), 5--20.

[93]

Grzegorz Malewicz, Matthew H. Austern, Aart J.C Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. 2010. Pregel: a system for large-scale graph processing. In Proc. of the ACM SIGMOD Intl. Conf. on Manag. of Data (SIGMOD '10). ACM, New York, NY, USA, 135--146.

Digital Library

[94]

Hossein Maserrat and Jian Pei. 2012. Community Preserving Lossy Compression of Social Networks. In 2012 IEEE 12th International Conference on Data Mining. 509--518.

[95]

Robert Ryan McCune, Tim Weninger, and Greg Madey. 2015. Thinking like a vertex: a survey of vertex-centric frameworks for large-scale distributed graph processing. ACM Computing Surveys (CSUR) 48, 2 (2015), 25.

Digital Library

[96]

Andrew McGregor, Sofya Vorotnikova, and Hoa T Vu. 2016. Better algorithms for counting triangles in data streams. In Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. ACM, 401--411.

Digital Library

[97]

Gary L Miller, Richard Peng, Adrian Vladu, and Shen Chen Xu. 2015. Improved parallel algorithms for spanners and hopsets. In Proceedings of the 27th ACM Symposium on Parallelism in Algorithms and Architectures. ACM, 192--201.

Digital Library

[98]

Richard C Murphy, Kyle B Wheeler, Brian W Barrett, and James A Ang. 2010. Introducing the graph 500. Cray User's Group (CUG) (2010).

[99]

Moni Naor. 1990. Succinct representation of general unlabeled graphs. Discrete Applied Mathematics 28, 3 (1990), 303--307.

Digital Library

[100]

Saket Navlakha, Rajeev Rastogi, and Nisheeth Shrivastava. 2008. Graph summarization with bounded error. In Proceedings of the 2008 ACM SIGMOD international conference on Management of data. ACM, 419--432.

Digital Library

[101]

Donald Nguyen, Andrew Lenharth, and Keshav Pingali. 2013. A lightweight infrastructure for graph analytics. In Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles. ACM, 456--471.

Digital Library

[102]

Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. 1999. The PageRank citation ranking: Bringing order to the web. Technical Report. Stanford InfoLab.

[103]

Merav Parter, Ronitt Rubinfeld, Ali Vakilian, and Anak Yodpinyanee. 2018. Local Computation Algorithms for Spanners. In 10th Innovations in Theoretical Computer Science Conference (ITCS 2019). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.

[104]

Aduri Pavan, Srikanta Tirthapura, et al. 2013. Counting and sampling triangles from a graph stream. (2013).

[105]

David Peleg and Alejandro A Schäffer. 1989. Graph spanners. Journal of graph theory 13, 1 (1989), 99--116.

[106]

David Peleg and Jeffrey D Ullman. 1989. An optimal synchronizer for the hypercube. SIAM Journal on computing 18, 4 (1989), 740--747.

[107]

Seth Pettie. 2010. Distributed algorithms for ultrasparse spanners and linear size skeletons. Distributed Computing 22, 3 (2010), 147--166.

Digital Library

[108]

Adam Polak. 2016. Counting triangles in large graphs on GPU. In 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). IEEE, 740--746.

[109]

Rajeev Raman, Venkatesh Raman, and Srinivasa Rao Satti. 2007. Succinct indexable dictionaries with applications to encoding k-ary trees, prefix sums and multisets. ACM Transactions on Algorithms (TALG) 3, 4 (2007), 43.

Digital Library

[110]

Raimundo Real and Juan M Vargas. 1996. The probabilistic basis of Jaccard's index of similarity. Systematic biology 45, 3 (1996), 380--385.

[111]

Matteo Riondato, David García-Soriano, and Francesco Bonchi. 2017. Graph summarization with quality guarantees. Data Mining and Knowledge Discovery 31, 2 (2017), 314--349.

Digital Library

[112]

Matteo Riondato and Evgenios M Kornaropoulos. 2016. Fast approximation of betweenness centrality through sampling. Data Mining and Knowledge Discovery 30, 2 (2016), 438--475.

Digital Library

[113]

Matteo Riondato and Eli Upfal. 2018. ABRA: Approximating betweenness centrality in static and dynamic graphs with rademacher averages. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 5 (2018), 61.

Digital Library

[114]

Liam Roditty and Virginia Vassilevska Williams. 2013. Fast approximation algorithms for the diameter and radius of sparse graphs. In Proceedings of the forty-fifth annual ACM symposium on Theory of computing. ACM, 515--524.

Digital Library

[115]

Amin Sadri, Flora D. Salim, Yongli Ren, Masoomeh Zameni, Jeffrey Chan, and Timos Sellis. 2017. Shrink: Distance preserving graph compression. Information Systems 69 (2017), 180--193.

Digital Library

[116]

Semih Salihoglu and Jennifer Widom. 2014. Optimizing graph algorithms on Pregel-like systems. Proceedings of the VLDB Endowment 7, 7 (2014), 577--588.

Digital Library

[117]

Ahmet Erdem Sariyüce, Kamer Kaya, Erik Saule, and Ümit V Çatalyürek. 2013. Betweenness centrality on GPUs and heterogeneous architectures. In Proceedings of the 6th Workshop on General Purpose Processor Using Graphics Processing Units. ACM, 76--85.

Digital Library

[118]

Berkant Savas and Inderjit S Dhillon. 2011. Clustered low rank approximation of graphs in information science applications. In Proceedings of the 2011 SIAM International Conference on Data Mining. SIAM, 164--175.

[119]

C Seshadhri. 2015. A simpler sublinear algorithm for approximating the triangle count. arXiv preprint arXiv:1505.01927 (2015).

[120]

Comandur Seshadhri, Ali Pinar, and Tamara G Kolda. 2013. Fast triangle counting through wedge sampling. In Proceedings of the SIAM Conference on Data Mining, Vol. 4. 5.

[121]

C Seshadhri, Ali Pinar, and Tamara G Kolda. 2014. Wedge sampling for computing clustering coefficients and triangle counts on large graphs. Statistical Analysis and Data Mining: The ASA Data Science Journal 7, 4 (2014), 294--307.

Digital Library

[122]

Zechao Shang and Jeffrey Xu Yu. 2014. Auto-approximation of graph computing. Proceedings of the VLDB Endowment 7, 14 (2014), 1833--1844.

Digital Library

[123]

Xuanhua Shi, Zhi gao Zheng, Yongluan Zhou, Hai Jin, Ligang He, Bo Liu, and Qiang-Sheng Hua. 2018. Graph processing on GPUs: A survey. ACM Computing Surveys (CSUR) 50, 6 (2018), 81.

Digital Library

[124]

Kijung Shin, Amol Ghoting, Myunghwan Kim, and Hema Raghavan. 2019. Sweg: Lossless and lossy summarization of web-scale graphs. In Proceedings of the 28th International Conference on World Wide Web. ACM, Vol. 1. 1--2.

Digital Library

[125]

Julian Shun and Kanat Tangwongsan. 2015. Multicore triangle computations without tuning. In 2015 IEEE 31st International Conference on Data Engineering. IEEE, 149--160.

[126]

Somesh Singh and Rupesh Nasre. 2018. Scalable and Performant Graph Processing on GPUs Using Approximate Computing. IEEE Transactions on Multi-Scale Computing Systems 4, 3 (2018), 190--203.

[127]

George M Slota and Kamesh Madduri. 2014. Complex network analysis using parallel approximate motif counting. In Parallel and Distributed Processing Symposium, 2014 IEEE 28th International. IEEE, 405--414.

Digital Library

[128]

Tasuku Soma and Yuichi Yoshida. 2019. Spectral Sparsification of Hypergraphs. In Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms. SIAM, 2570--2581.

[129]

Daniel A Spielman and Nikhil Srivastava. 2011. Graph sparsification by effective resistances. SIAM J. Comput. 40, 6 (2011), 1913--1926.

Digital Library

[130]

Daniel A Spielman and Shang-Hua Teng. 2011. Spectral sparsification of graphs. SIAM J. Comput. 40, 4 (2011), 981--1025.

Digital Library

[131]

Xin Sui, Tsung-Hsien Lee, Joyce Jiyoung Whang, Berkant Savas, Saral Jain, Keshav Pingali, and Inderjit Dhillon. 2012. Parallel clustered low-rank approximation of graphs and its application to link prediction. In International Workshop on Languages and Compilers for Parallel Computing. Springer, 76--95.

[132]

Daniel Yasumasa Takahashi, Joao Ricardo Sato, Carlos Eduardo Ferreira, and André Fujita. 2012. Discriminating different classes of biological networks by analyzing the graphs spectra distribution. PLoS One 7, 12 (2012), e49949.

[133]

Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, and Atte Hinkka. 2011. Compression of weighted graphs. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining. 965--973.

Digital Library

[134]

Hannu Toivonen, Fang Zhou, Aleksi Hartikainen, and Atte Hinkka. 2012. Network compression by node and edge mergers. Bisociative Knowledge Discovery (2012), 199--217.

[135]

Ioanna Tsalouchidou, Francesco Bonchi, Gianmarco De Francisci Morales, and Ricardo Baeza-Yates. 2018. Scalable Dynamic Graph Summarization. IEEE Transactions on Knowledge and Data Engineering (2018), 1--1.

[136]

Charalampos E Tsourakakis. [n.d.]. Fast Counting of Triangles in Large Real Networks: Algorithms and Laws. cis. temple. edu ([n.d.]), 608--617.

[137]

Charalampos E Tsourakakis, U Kang, Gary L Miller, and Christos Faloutsos. 2009. Doulion: counting triangles in massive graphs with a coin. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. ACM, 837--846.

Digital Library

[138]

György Turán. 1984. On the succinct representation of graphs. Discrete Applied Mathematics 8, 3 (1984), 289--294.

[139]

Jason TL Wang, Kaizhong Zhang, and Gung-Wei Chirn. 1995. Algorithms for approximate graph matching. Information Sciences 82, 1-2 (1995), 45--74.

Digital Library

[140]

Leyuan Wang, Yangzihao Wang, Carl Yang, and John D Owens. 2016. A comparative study on exact triangle counting algorithms on the gpu. In Proceedings of the ACM Workshop on High Performance Graph Processing. ACM, 1--8.

Digital Library

[141]

Tianyi Wang, Yang Chen, Zengbin Zhang, Tianyin Xu, Long Jin, Pan Hui, Beixing Deng, and Xing Li. 2011. Understanding graph sampling algorithms for social network analysis. In 2011 31st international conference on distributed computing systems workshops. IEEE, 123--128.

Digital Library

[142]

Yongyu Wang and Zhuo Feng. 2017. Towards scalable spectral clustering via spectrum-preserving sparsification. arXiv preprint arXiv.1710.04584 (2017).

[143]

Michael M Wolf, Mehmet Deveci, Jonathan W Berry, Simon D Hammond, and Sivasankaran Rajamanickam. 2017. Fast linear algebra-based triangle counting with kokkoskernels. In 2017 IEEE High Performance Extreme Computing Conference (HPEC). IEEE, 1--7.

[144]

Da Yan, James Cheng, Kai Xing, Yi Lu, Wilfred Ng, and Yingyi Bu. 2014. Pregel algorithms for graph connectivity problems with performance guarantees. Proceedings of the VLDB Endowment 7, 14 (2014), 1821--1832.

Digital Library

[145]

Abdurrahman Yaşar, Sivasankaran Rajamanickam, Michael Wolf, Jonathan Berry, and Ümit V Çatalyürek. 2018. Fast Triangle Counting Using Cilk. In 2018 IEEE High Performance extreme Computing Conference (HPEC). IEEE, 1--7.

[146]

Hao Zhang, Gang Chen, Beng Chin Ooi, Kian-Lee Tan, and Meihui Zhang. 2015. In-memory big data management and processing: A survey. IEEE Transactions on Knowledge and Data Engineering 27, 7 (2015), 1920--1948.

Digital Library

[147]

Yunquan Zhang, Ting Cao, Shigang Li, Xinhui Tian, Liang Yuan, Haipeng Jia, and Athanasios V Vasilakos. 2016. Parallel processing systems for big data: a survey. Proc. IEEE 104, 11 (2016), 2114--2136.

[148]

Ying Zhang, Zhiqiang Zhao, and Zhuo Feng. 2018. Towards Scalable Spectral Sparsification of Directed Graphs. arXiv preprint arXiv:1812.04165 (2018).

[149]

Fang Zhou, Qiang Qu, and Hannu Toivonen. 2017. Summarisation of weighted networks. Journal of Experimental and Theoretical Artificial Intelligence 29, 5 (2017), 1023--1052.

Cited By

Cammarasana SPatané G(2025)Analysis and comparison of high-performance computing solvers for minimisation problems in signal processingMathematics and Computers in Simulation10.1016/j.matcom.2024.10.003229(525-538)Online publication date: Mar-2025
https://doi.org/10.1016/j.matcom.2024.10.003
Wang YKosyfaki CAmer-Yahia SCheng R(2024)A Sampling-Based Framework for Hypothesis Testing on Large Attributed GraphsProceedings of the VLDB Endowment10.14778/3681954.368199317:11(3192-3200)Online publication date: 1-Jul-2024
https://dl.acm.org/doi/10.14778/3681954.3681993
Wanye FGleyzer VKao EFeng W(2024)SamBaS: Sampling-Based Stochastic Block PartitioningIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.335830111:3(3053-3065)Online publication date: May-2024
https://doi.org/10.1109/TNSE.2024.3358301
Show More Cited By

Slim graph: practical lossy graph compression for approximate graph processing, storage, and analytics
1. Mathematics of computing
  1. Discrete mathematics
    1. Graph theory

Recommendations

Collapsible subgraphs of a 4-edge-connected graph
Abstract
Jaeger in 1979 showed that every 4-edge-connected graph is supereulerian, graphs that have spanning eulerian subgraphs. Catlin in 1988 sharpened Jaeger’s result by showing that every 4-edge-connected graph is collapsible, graphs that are ...
Trivially noncontractible edges in a contraction critically 5-connected graph

An edge of a k-connected graph is said to be k-contractible if the contraction of the edge results in a k-connected graph. A k-connected graph with no k-contractible edge is said to be contraction critically k-connected. An edge of a k-connected graph ...
Traversability and connectivity of the middle graph of a graph

We define a graph M(G) as an intersection graph @W(F) on the point set V(G) of any graph G. Let X(G) be the line set of G and F = V'(G) @__ __ X(G), where V'(G) indicates the family of all one point subsets of the set V(G). Let M(G) = @W(F). M(G) is ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

November 2019

1921 pages

ISBN:9781450362290

DOI:10.1145/3295500

General Chair:
Michela Taufer,
Program Chairs:
Pavan Balaji,
Antonio J. Peña

Copyright © 2019 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGHPC: ACM Special Interest Group on High Performance Computing, Special Interest Group on High Performance Computing

In-Cooperation

IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 17 November 2019

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Funding Sources

European Research Council (ERC)

Conference

SC '19

Sponsor:

SIGHPC

SC '19: The International Conference for High Performance Computing, Networking, Storage, and Analysis

November 17 - 19, 2019

Colorado, Denver

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

21
Total Citations
View Citations
943
Total Downloads

Downloads (Last 12 months)52
Downloads (Last 6 weeks)0

Reflects downloads up to 23 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Cammarasana SPatané G(2025)Analysis and comparison of high-performance computing solvers for minimisation problems in signal processingMathematics and Computers in Simulation10.1016/j.matcom.2024.10.003229(525-538)Online publication date: Mar-2025
https://doi.org/10.1016/j.matcom.2024.10.003
Wang YKosyfaki CAmer-Yahia SCheng R(2024)A Sampling-Based Framework for Hypothesis Testing on Large Attributed GraphsProceedings of the VLDB Endowment10.14778/3681954.368199317:11(3192-3200)Online publication date: 1-Jul-2024
https://dl.acm.org/doi/10.14778/3681954.3681993
Wanye FGleyzer VKao EFeng W(2024)SamBaS: Sampling-Based Stochastic Block PartitioningIEEE Transactions on Network Science and Engineering10.1109/TNSE.2024.335830111:3(3053-3065)Online publication date: May-2024
https://doi.org/10.1109/TNSE.2024.3358301
Anagnostopoulos AArrigoni VGullo FSalvatori GSeverini L(2024)General-purpose query processing on summary graphsSocial Network Analysis and Mining10.1007/s13278-024-01314-w14:1Online publication date: 9-Aug-2024
https://doi.org/10.1007/s13278-024-01314-w
Wafula MVippathalla PCoon JBadiu M(2023)Rate-Distortion Function of the Stochastic Block Model2023 57th Asilomar Conference on Signals, Systems, and Computers10.1109/IEEECONF59524.2023.10476987(699-703)Online publication date: 29-Oct-2023
https://doi.org/10.1109/IEEECONF59524.2023.10476987
Wanye FGleyzer VKao EFeng W(2023)An Integrated Approach for Accelerating Stochastic Block Partitioning2023 IEEE High Performance Extreme Computing Conference (HPEC)10.1109/HPEC58863.2023.10363599(1-7)Online publication date: 25-Sep-2023
https://doi.org/10.1109/HPEC58863.2023.10363599
Besta MMiglioli CLabini PTětek JIff PKanakagiri RAshkboos SJanda KPodstawski MKwaśniewski GGleinig NVella FMutlu OHoefler TWolf FShende SCulhane CAlam SJagode H(2022)ProbGraphProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/3571885.3571942(1-17)Online publication date: 13-Nov-2022
https://dl.acm.org/doi/10.5555/3571885.3571942
Chen XMinutoli MTian JHalappanavar MKalyanaraman ATao DKloeckner AMoreira J(2022)HBMaxProceedings of the International Conference on Parallel Architectures and Compilation Techniques10.1145/3559009.3569647(412-425)Online publication date: 8-Oct-2022
https://dl.acm.org/doi/10.1145/3559009.3569647
Vandierendonck HRauchwerger LCameron KNikolopoulos DPnevmatikatos D(2022)Software-defined floating-point number formats and their application to graph processingProceedings of the 36th ACM International Conference on Supercomputing10.1145/3524059.3532360(1-17)Online publication date: 28-Jun-2022
https://dl.acm.org/doi/10.1145/3524059.3532360
Gou XZou LZhao CYang T(2022)Graph Stream Sketch: Summarizing Graph Streams with High Speed and AccuracyIEEE Transactions on Knowledge and Data Engineering10.1109/TKDE.2022.3174570(1-1)Online publication date: 2022
https://doi.org/10.1109/TKDE.2022.3174570
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten