Abstract
Efficient graph partitioning plays an important role in distributed graph processing systems with the rapid growth of the scale of graph data. The quality of partitioning affects the performance of systems greatly. However, most existing vertex-cut graph partitioning algorithms only focused on degree information and ignored the cluster information of a coming edge when assigning edges. It is beneficial to assign an edge to a partition with more neighbors because keeping a dense subgraph in one partition would reduce the communication cost. In this paper, we propose DETER, an efficient vertex-cut streaming graph partitioning algorithm that takes both degree and cluster information into account when assigning an edge to one partition. Our evaluations suggest that DETER algorithm owns the ability to efficiently partition large graphs and reduce communication cost significantly compared to state-of-the-art graph partitioning algorithms.
Supported in part by National Key Research and Development Program of China under Grant 2017YFB1402400, in part by graduate research and innovation foundation of Chongqing, China under Grant CYB18058, in part by the Fundamental Research Funds for the Central Universities under Grant 2018CDYJSY0055, in part by the Frontier and Application Foundation Research Program, PR China of CQ CSTC under Grant cstc2017jcyjAX0340.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Andreev, K., Racke, H.: Balanced graph partitioning. Theory Comput. Syst. 39(6), 929–939 (2006)
Bali, P., Kalavri, V.: Streaming graph analytics framework design (2015). http://urn.kb.se/resolve
Donnelly, G.: Super-useful Facebook statistics for (75) (2018)
Fineschi, S., et al.: Metis: a novel coronagraph design for the solar orbiter mission. Proc. SPIE - Int. Soc. Opt. Eng. 8443(8), 457–469 (2012)
Gonzalez, J.E., Low, Y., Gu, H., Bickson, D., Guestrin, C.: Powergraph: distributed graph-parallel computation on natural graphs. In: USENIX Conference on Operating Systems Design & Implementation (2012)
Hu, K., Zeng, G., Jiang, H., Wang, W.: Partitioning big graph with respect to arbitrary proportions in a streaming manner. Future Gener. Comput. Syst. 80, 1–11 (2018)
Jain, N., Liao, G., Willke, T.L.: Graphbuilder: scalable graph ETL framework. In: International Workshop on Graph Data Management Experiences & Systems (2013)
Kalnis, P., Awara, K., Jamjoom, H., Khayyat, Z.: Mizan: optimizing graph mining in large parallel systems. Technical report, King Abdullah University of Science and Technology (2012)
Leskovec, J., Kleinberg, J., Faloutsos, C.: Graphs over time: densification laws, shrinking diameters and possible explanations. In: Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, pp. 177–187. ACM (2005)
Low, Y., Bickson, D., Gonzalez, J., Guestrin, C., Kyrola, A., Hellerstein, J.M.: Distributed graphlab: a framework for machine learning and data mining in the cloud. Proc. VLDB Endow. 5(8), 716–727 (2012)
Malewicz, G., et al.: Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data, pp. 135–146. ACM (2010)
Martella, C., Logothetis, D., Loukas, A., Siganos, G.: Spinner: scalable graph partitioning in the cloud. In: IEEE International Conference on Data Engineering (2017)
Mayer, C., Tariq, M.A., Mayer, R., Rothermel, K.: Graph: traffic-aware graph processing. IEEE Trans. Parallel Distrib. Syst. 29(6), 1289–1302 (2018)
Mofrad, M.H., Melhem, R., Hammoud, M.: Revolver: vertex-centric graph partitioning using reinforcement learning. In: 2018 IEEE 11th International Conference on Cloud Computing (CLOUD), pp. 818–821. IEEE (2018)
Nishimura, J., Ugander, J.: Restreaming graph partitioning: simple versatile algorithms for advanced balancing. In: ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2013)
Page, L., Brin, S., Motwani, R., Winograd, T.: The pagerank citation ranking: bringing order to the web. Technical report, Stanford InfoLab (1999)
Petroni, F., Querzoni, L., Daudjee, K., Kamali, S., Iacoboni, G.: HDRF: stream-based partitioning for power-law graphs. In: Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pp. 243–252. ACM (2015)
Prabhakaran, V., Wu, M., Weng, X., McSherry, F., Zhou, L., Haradasan, M.: Managing large graphs on multi-cores with graph awareness. In: Presented as Part of the 2012 USENIX Annual Technical Conference (USENIX ATC 2012), pp. 41–52 (2012)
Rossi, R.A., Ahmed, N.K.: The network data repository with interactive graph analytics and visualization. In: Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (2015). http://networkrepository.com
Stanton, I., Kliot, G.: Streaming graph partitioning for large distributed graphs. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1222–1230. ACM (2012)
Thorup, M.: Undirected single-source shortest paths with positive integer weights in linear time. J. ACM (JACM) 46(3), 362–394 (1999)
Tsourakakis, C.: Streaming graph partitioning in the planted partition model. In: Proceedings of the 2015 ACM on Conference on Online Social Networks, pp. 27–35. ACM (2015)
Tsourakakis, C., Gkantsidis, C., Radunovic, B., Vojnovic, M.: Fennel: streaming graph partitioning for massive scale graphs. In: Proceedings of the 7th ACM International Conference on Web Search and Data Mining, pp. 333–342. ACM (2014)
Xie, C., Yan, L., Li, W.J., Zhang, Z.: Distributed power-law graph computing: theoretical and empirical analysis. In: International Conference on Neural Information Processing Systems (2014)
Xin, R.S., Gonzalez, J.E., Franklin, M.J., Stoica, I.: GraphX: a resilient distributed graph system on spark. In: First International Workshop on Graph Data Management Experiences and Systems, p. 2. ACM (2013)
Yang, J., Leskovec, J.: Defining and evaluating network communities based on ground-truth. In: IEEE International Conference on Data Mining (2012)
Yin, H., Benson, A.R., Leskovec, J., Gleich, D.F.: Local higher-order graph clustering. In: ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2017)
Zheng, A., Labrinidis, A., Chrysanthis, P.K., Lange, J.: Argo: architecture-aware graph partitioning. In: IEEE International Conference on Big Data (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Hu, C., Zhong, J., Li, Q., Li, Q. (2020). DETER: Streaming Graph Partitioning via Combined Degree and Cluster Information. In: Wen, S., Zomaya, A., Yang, L. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2019. Lecture Notes in Computer Science(), vol 11944. Springer, Cham. https://doi.org/10.1007/978-3-030-38991-8_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-38991-8_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-38990-1
Online ISBN: 978-3-030-38991-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)