skip to main content
10.1145/2764947.2764949acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

Large-Scale BSP Graph Processing in Distributed Non-Volatile Memory

Published: 31 May 2015 Publication History

Abstract

Processing large graphs is becoming increasingly important for many domains. Large-scale graph processing requires a large-scale cluster system, which is very expensive. Thus, for high-performance large-scale graph processing in small clusters, we have developed bulk synchronous parallel graph processing in distributed non-volatile memory that has lower bit cost, lower power consumption, and larger capacity than DRAM. When non-volatile memory is used, accessing non-volatile memory is a performance bottleneck because accesses to non-volatile memory are fine-grained random accesses and non-volatile memory has much larger latency than DRAM. Thus, we propose non-volatile memory group access method and the implementation for using non-volatile memory efficiently. Proposed method and implementation improve the access performance to non-volatile memory by changing fine-grained random accesses to random accesses the same size as a non-volatile memory page and hiding non-volatile memory latency with pipelining. An evaluation indicated that the proposed graph processing can hide the latency of non-volatile memory and has the proportional performance to non-volatile memory bandwidth. When non-volatile memory read/write mixture bandwidth is 4.2 GB/sec, the performance of proposed graph processing and the performance storing all data in main memory have the same order of magnitude (46%). In addition, the proposed graph processing had scalable performance for any number of nodes. The proposed method and implementation can process 125 times bigger graph than a DRAM-only system.

References

[1]
Apache Hadoop, hadoop.apache.org.
[2]
Apache Spark, spark.apache.org
[3]
Leiser and G. Czajkowski, "Pregel: a system for large-scale graph processing," in Proc. ACM/SIGMOD International Conference on Management of Data (SIGMOD), Indianapolis, USA, June 6--11, 2004.
[4]
Apache Giraph, giraph.apache.org.
[5]
Y. Low, J. Gonzalez, A. Kyrola, D. Bickson, and C. Guestrin, "Graphlab: A distributed framework for machine learning in the cloud," Arxiv preprint arXiv:1107.0922, 2011.
[6]
GraphX, spark.apache.org/graphx
[7]
L. Page, S. Brin, R. Motwani, and T. Winograd. The pagerank citation ranking: Bringing order to the web. Technical Report 1999--66, Stanford InfoLab, November 1999.
[8]
D. Chakrabarti, Y. Zhan, and C. Faloutsos, "R-MAT: A recursive model for graph mining," in Fourth SIAM International Conference on Data Mining, April 2004.
[9]
A. Kyrola, G. Glelloch, and C. Guestrin, "GraphChi: Large-Scale Graph Computation on Just aPC," in Proc. 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Hollywood, USA, October 8--10, 2012.
[10]
A.Roy, I.Mihailovic, and W.Zwaenepoel, "X-Stream: edge-centric graph processing using streaming partitions," in Proc. 24th ACM Symposium on Operating Systems Principles (SOSP), Pennsylvania, USA, Nov 3--6, pp. 472--488, 2013.
[11]
R. Pearce, M. Gokhale, and N. M. Amato, "Multithreaded Asynchronous Graph Traversal for In-Memory and Semi-External Memory," in Proc. the International Conference for High Performance Computing, Networking, Storage and Analysis (SuperComputing), New Orleans, USA, Nov. 13--19, 2010.
[12]
R. Pearce, M. Gokhale, and N. M. Amato, "Scaling Techniques for Massive Scale-Free Graphs in Distributed (External) Memory," in Proc. 27th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Boston, USA, May 20--24, 2013.

Cited By

View all
  • (2018)Efficient Data-Allocation Scheme for Eliminating Garbage Collection During Analysis of Big Graphs Stored in NAND Flash MemoryIEEE Transactions on Computers10.1109/TC.2017.277562467:5(646-657)Online publication date: 1-May-2018
  • (2017)A fast non-volatile memory aware algorithm for generating random scale-free networks2017 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2017.8257994(787-796)Online publication date: Dec-2017

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GRADES'15: Proceedings of the GRADES'15
May 2015
54 pages
ISBN:9781450336116
DOI:10.1145/2764947
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 May 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. big data
  2. distributed computing
  3. non-volatile memory
  4. parallel algorithms

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

SIGMOD/PODS'15
Sponsor:
SIGMOD/PODS'15: International Conference on Management of Data
May 31 - June 4, 2015
VIC, Melbourne, Australia

Acceptance Rates

Overall Acceptance Rate 29 of 61 submissions, 48%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 08 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2018)Efficient Data-Allocation Scheme for Eliminating Garbage Collection During Analysis of Big Graphs Stored in NAND Flash MemoryIEEE Transactions on Computers10.1109/TC.2017.277562467:5(646-657)Online publication date: 1-May-2018
  • (2017)A fast non-volatile memory aware algorithm for generating random scale-free networks2017 IEEE International Conference on Big Data (Big Data)10.1109/BigData.2017.8257994(787-796)Online publication date: Dec-2017

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media