An effective framework for asynchronous incremental graph processing

Lv, Xinqiao; Xiao, Wei; Zhang, Yu; Liao, Xiaofei; Jin, Hai; Hua, Qiangsheng

doi:10.1007/s11704-018-7443-z

An effective framework for asynchronous incremental graph processing

Research Article
Published: 30 August 2018

Volume 13, pages 539–551, (2019)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

Xinqiao Lv¹,
Wei Xiao¹,
Yu Zhang¹,
Xiaofei Liao¹,
Hai Jin¹ &
…
Qiangsheng Hua¹

98 Accesses
15 Citations
Explore all metrics

Abstract

Although many graph processing systems have been proposed, graphs in the real-world are often dynamic. It is important to keep the results of graph computation up-to-date. Incremental computation is demonstrated to be an efficient solution to update calculated results. Recently, many incremental graph processing systems have been proposed to handle dynamic graphs in an asynchronous way and are able to achieve better performance than those processed in a synchronous way. However, these solutions still suffer from suboptimal convergence speed due to their slow propagation of important vertex state (important to convergence speed) and poor locality. In order to solve these problems, we propose a novel graph processing framework. It introduces a dynamic partition method to gather the important vertices for high locality, and then uses a priority-based scheduling algorithm to assign them with a higher priority for an effective processing order. By such means, it is able to reduce the number of updates and increase the locality, thereby reducing the convergence time. Experimental results show that our method reduces the number of updates by 30%, and reduces the total execution time by 35%, compared with state-of-the-art systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Asynchronous Distributed Incremental Computation on Evolving Graphs

A survey on dynamic graph processing on GPUs: concepts, terminologies and systems

Article 16 December 2023

Dynamic Graph Repartitioning: From Single Vertex to Vertex Group

References

Baluja S, Seth R, Sivakumar D, Jing Y S, Yagnik J, Kumar S, Ravichandran D, Aly M. Video suggestion and discovery for youtube: taking random walks through the view graph. In: Proceedings of the 17th International Conference on World Wide Web. 2008, 895–904
Chapter Google Scholar
Wang P, Xu B W, Wu Y R, Zhou X Y. Link prediction in social networks: the state-of-the-art. Science China Information Sciences, 2015, 58(1): 1–38
Google Scholar
Shang X Q, Wang Y, Chen B L. Identifying essential proteins based on dynamic protein-protein interaction networks and RNA-seq datasets. Science China Information Sciences, 2016, 59(7): 1–11
Article Google Scholar
Bhatotia P, Wieder A, Rodrigues R, Acar U A, Pasquin R. Incoop: mapreduce for incremental computations. In: Proceedings of the 2nd ACM Symposium on Cloud Computing. 2011, 1–14
Google Scholar
Zhang Y F, Gao Q X, Gao L X, Wang C R. iMapreduce: a distributed computing framework for iterative computation. In: Proceedings of the 25th IEEE International Symposium on Parallel and Distributed Processing. 2011, 1112–1121
Google Scholar
Zhang Y F, Chen S M, Wang Q, Yu G. i2Mapreduce: incremental mapreduce for mining evolving big data. In: Proceedings of the 32nd IEEE International Conference on Data Engineering. 2016, 1482–1483
Google Scholar
Yin J T, Gao L X. Asynchronous distributed incremental computation on evolving graphs. In: Proceedings of the 2016 Machine Learning and Knowledge Discovery in Databases. 2016, 722–738.
Chapter Google Scholar
Zhang Y F, Gao Q X, Gao L X, Wang C R. Maiter: an asynchronous graph processing framework for delta-based accumulative iterative computation. IEEE Transactions on Parallel & Distributed Systems, 2014, 25(8): 2091–2100
Article Google Scholar
Mihaylov S R, Ives Z G, Guha S. Rex: recursive, delta-based datacentric computation. Proceedings of the VLDB Endowment, 2012, 5(11): 1280–1291
Article Google Scholar
Popa L, Budiu M, Yu Y, Isard M. Dryadinc: reusing work in large-scale computations. In: Proceedings of the 2009 Conference on Hot Topics in Cloud Computing. 2009, 1–5
Google Scholar
Cheng R, Hong J, Kyrola A, Miao Y S, Weng X T, Wu M, Yang F, Zhou L D, Zhao F, Chen E H. Kineograph: taking the pulse of a fastchanging and connected world. In: Proceedings of the 7th ACM European Conference on Computer Systems. 2012, 85–98
Google Scholar
Murray D G, Mcsherry F, Isaacs R, Isard M, Barham P. Naiad: a timely dataflow system. In: Proceedings of the 24th ACM SIGOPS Symposium on Operating Systems Principles. 2013, 439–455
Google Scholar
Gonzalez J E, Low Y C, Gu H J, Bickson D, Guestrin C. Powergraph: distributed graph-parallel computation on natural graphs. In: Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation. 2012, 17–30
Google Scholar
Verma S, Leslie L M, Shin Y, Gupta I. An experimental comparison of partitioning strategies in distributed graph processing. Proceedings of the VLDB Endowment, 2017, 10(5): 493–504
Article Google Scholar
Karypis G, Kumar V. Multilevel graph partitioning schemes. In: Proceedings of the 1995 International Conference on Parallel Processing. 1995, 113–122
Google Scholar
Kwak H, Lee C, Park H, Moon S. What is twitter, a social network or a news media? In: Proceedings of the 19th International Conference on World Wide Web. 2010, 591–600
Chapter Google Scholar
Zaharia M, Chowdhury M, Franklin M J, Shenker S, Stoica I. Spark: cluster computing with working sets. In: Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing. 2010, 1–10
Google Scholar
Low Y C, Gonzalez J E, Kyrola A, Bickson D, Guestrin C E, Hellerstein J. Graphlab: a new framework for parallel machine learning. In: Proceedings of the 26th Conference on Uncertainty in Artificial Intelligence. 2010, 1–10
Google Scholar
Power R, Li J Y. Piccolo: building fast, distributed programs with partitioned tables. In: Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation. 2010, 1–14
Google Scholar
Bu Y Y, Howe B, Balazinska M, Ernst M D. Haloop: efficient iterative data processing on large clusters. Proceedings of the VLDB Endowment, 2010, 3(1): 285–296
Article Google Scholar
Ekanayake J, Li H, Zhang B J, Gunarathne T, Bae S H, Qiu J, Fox G. Twister: a runtime for iterative mapreduce. In: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing. 2010, 810–818
Chapter Google Scholar
Malewicz G, Austern M H, Bik A J, Dehnert J C, Horn I, Leiser N, Czajkowski G. Pregel: a system for large-scale graph processing. In: Proceedings of the 2010 ACM SIGMOD International Conference on Management of Data. 2010, 135–146
Chapter Google Scholar
Roy A, Mihailovic I, Zwaenepoel W. X-stream: edge-centric graph processing using streaming partitions. In: Proceedings of the 24th ACM Symposium on Operating Systems Principles. 2013, 472–488
Google Scholar
Tian Y Y, Balmin A, Corsten S A, Tatikonda S, McPherson J. From “think like a vertex” to “think like a graph”. Proceedings of the VLDB Endowment, 2013, 7(3): 193–204
Article Google Scholar
Yan D, Cheng J, Lu Y, Ng W. Effective techniques for message reduction and load balancing in distributed graph computation. In: Proceedings of the 24th International Conference on World Wide Web. 2015, 1307–1317
Chapter Google Scholar
Salihoglu S, Widom J. GPS: a graph processing system. In: Proceedings of the 2013 Conference on Scientific and Statistical Database Management. 2013, 1–12
Google Scholar
Kyrola A, Blelloch G, Guestrin C. Graphchi: large-scale graph computation on just a pc. In: Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation. 2012, 31–46
Google Scholar
Xie W L, Wang G Z, Bindel D, Demers A, Gehrke J. Fast iterative graph computation with block updates. Proceedings of the VLDB Endowment, 2013, 6(14): 2014–2025
Article Google Scholar
Yuan P P, Zhang W Y, Xie C F, Jin H, Liu L, Lee K. Fast iterative graph computation: a path centric approach. In: Proceedings of the 2014 International Conference for High Performance Computing, Networking, Storage and Analysis. 2015, 401–412
Google Scholar
Xie C N, Chen R, Guan H B, Zang B Y, Chen H B. SYNC or ASYNC: time to fuse for distributed graph-parallel computation. In: Proceedings of the 20th ACM Sigplan Symposium on Principles and Practice of Parallel Programming. 2015, 194–204
Google Scholar
Tsourakakis C, Gkantsidis C, Radunovic B, Vojnovic M. Fennel: streaming graph partitioning for massive scale graphs. In: Proceedings of the 7th ACM International Conference onWeb Search and Data Mining. 2014, 333–342
Google Scholar
Nishimura J, Ugander J. Restreaming graph partitioning: simple versatile algorithms for advanced balancing. In: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2013, 1106–1114
Chapter Google Scholar
Abdolrashidi A, Ramaswamy L. Continual and cost-effective partitioning of dynamic graphs for optimizing big graph processing systems. In: Proceedings of the 2016 IEEE International Congress on Big Data. 2016, 18–25
Chapter Google Scholar

Download references

Acknowledgements

This paper is supported by the National Natural Science Foundation of China (Grant No. 61702202), China Postdoctoral Science Foundation Funded Project (2017M610477 and 2017T100555).

Author information

Authors and Affiliations

Service Computing Technology and System Lab, Cluster and Grid Computing Lab School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, 430074, China
Xinqiao Lv, Wei Xiao, Yu Zhang, Xiaofei Liao, Hai Jin & Qiangsheng Hua

Authors

Xinqiao Lv
View author publications
Search author on:PubMed Google Scholar
Wei Xiao
View author publications
Search author on:PubMed Google Scholar
Yu Zhang
View author publications
Search author on:PubMed Google Scholar
Xiaofei Liao
View author publications
Search author on:PubMed Google Scholar
Hai Jin
View author publications
Search author on:PubMed Google Scholar
Qiangsheng Hua
View author publications
Search author on:PubMed Google Scholar

Corresponding author

Correspondence to Yu Zhang.

Additional information

Xinqiao Lv received his PhD degree in computer science and engineering from Huazhong University of Science and Technology (HUST), China. He is now an associate professor in school of Computer Science and Engineering at HUST. His main research interests include big data processing, cloud computing and distributed systems.

Wei Xiao received a BE degree in computer science from Huazhong University of Science and Technology (HUST), China in 2015. He is currently a master in school of computer science at HUST. His research interests include graph processing and cloud computing.

Yu Zhang received a PhD degree in computer science from Huazhong University of Science and Technology (HUST), China in 2016. He is now a postdoctor in school of computer science at HUST. His research interests include big data processing, system software and architecture. His current topic mainly focuses on application-driven big data processing and optimizations.

Xiaofei Liao received a PhD degree in computer science and engineering from Huazhong University of Science and Technology (HUST), China in 2005. He is now a professor in school of Computer Science and Engineering at HUST. His research interests are in the areas of system virtualization, system software, and Cloud computing.

Hai Jin is a Cheung Kung Scholars Chair Professor of computer science and engineering at Huazhong University of Science and Technology (HUST) in China. Jin received his PhD in computer engineering from HUST in 1994. In 1996, he was awarded a German Academic Exchange Service fellowship to visit the Technical University of Chemnitz in Germany. Jin worked at The University of Hong Kong between 1998 and 2000, and as a visiting scholar at the University of Southern California between 1999 and 2000. He was awarded Excellent Youth Award from the National Science Foundation of China in 2001. Jin is the chief scientist of ChinaGrid, the largest grid computing project in China, and the chief scientists of National 973 Basic Research Program Project of Virtualization Technology of Computing System, and Cloud Security. Jin is a fellow of CCF, senior member of the IEEE and a member of the ACM. He has co-authored 22 books and published over 800 research papers. His research interests include computer architecture, virtualization technology, cluster computing and cloud computing, peer-to-peer computing, network storage, and network security.

Qiangsheng Hua received the BE and ME degrees in 2001 and 2004, respectively, from the School of Information Science and Engineering, Central South University, China, and the PhD degree in 2009 from the Department of Computer Science, The University of Hong Kong, China. He is currently an associate professor in Huazhong University of Science and Technology, China. He is interested in parallel and distributed computing, including algorithms and implementations in real systems.

Electronic supplementary material

Supplementary material, approximately 376 KB.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lv, X., Xiao, W., Zhang, Y. et al. An effective framework for asynchronous incremental graph processing. Front. Comput. Sci. 13, 539–551 (2019). https://doi.org/10.1007/s11704-018-7443-z

Download citation

Received: 21 December 2017
Accepted: 13 April 2018
Published: 30 August 2018
Issue Date: June 2019
DOI: https://doi.org/10.1007/s11704-018-7443-z

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+

from $39.99 /Month

Starting from 10 chapters or articles per month
Access and download chapters and articles from more than 300k books and 2,500 journals
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An effective framework for asynchronous incremental graph processing

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Asynchronous Distributed Incremental Computation on Evolving Graphs

A survey on dynamic graph processing on GPUs: concepts, terminologies and systems

Dynamic Graph Repartitioning: From Single Vertex to Vertex Group

Explore related subjects

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material, approximately 376 KB.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now