Conferences >2013 IEEE International Confe...

A stream partitioning approach to processing large scale distributed graph datasets

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

RDF datasets are an important source of big data. Many of them, however, are too large to fit on a single machine. One approach to address this is to partition the RDF gr...Show More

Metadata

Abstract:

RDF datasets are an important source of big data. Many of them, however, are too large to fit on a single machine. One approach to address this is to partition the RDF graph across multiple machines, with each component residing on a single machine. A poor partition can incur significant communication costs, however, if as a result many queries involve multiple machines. A number of existing partitioning schemes seek to reduce these costs by finding partitions that avoid cutting edges in the RDF graph. While these can successfully find good partitions the partitioning process itself is often not very scalable, and not capable of handling incrementally-generated RDF data. In this paper, we develop a more scalable, effective and low complexity approach, online graph dataset partitioning, to produce high quality dataset partitions with fewer links between partitions. We show experimentally that it works well in reducing the communication cost of query processing, while at the same time improving scalability of the partitioning itself.

Published in: 2013 IEEE International Conference on Big Data

Date of Conference: 06-09 October 2013

Date Added to IEEE Xplore: 23 December 2013

Electronic ISBN:978-1-4799-1293-3

DOI: 10.1109/BigData.2013.6691619

Conference Location: Silicon Valley, CA, USA

Contents

References is not available for this document.

A stream partitioning approach to processing large scale distributed graph datasets

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

A stream partitioning approach to processing large scale distributed graph datasets

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?