Abstract
RDF is increasingly being used to encode data for the semantic web and data exchange. There have been a large number of studies that address RDF data management over different distributed platforms. In this paper we provide an overview of these studies. This paper divide the studies of existing distributed RDF systems into two categories: partitioning-based approaches and cloud-based approaches. We also introduce a partition-tolerant distributed RDF system, gStore\(^D\).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abdelaziz, I., Harbi, R., Khayyat, Z., Kalnis, P.: A survey and experimental comparison of distributed SPARQL engines for very large RDF data. PVLDB 10(13), 2049–2060 (2017)
Google: Freebase data dumps (2017)
He, L., et al.: Stylus: a strongly-typed store for serving massive RDF data. PVLDB 11(2), 203–216 (2017)
Huang, J., Abadi, D.J., Ren, K.: Scalable SPARQL querying of large RDF graphs. PVLDB 4(11), 1123–1134 (2011)
Kaoudi, Z., Manolescu, I.: RDF in the clouds: a survey. VLDB J. 24(1), 67–91 (2015)
Karypis, G., Kumar, V.: Multilevel graph partitioning schemes. In: ICPP, pp. 113–122 (1995)
Lehmann, J., et al.: DBpedia - a large-scale, multilingual knowledge base extracted from Wikipedia. Semant. Web 6(2), 167–195 (2015)
Madkour, A., Aly, A.M., Aref, W.G.: WORQ: workload-driven RDF query processing. In: Vrandečić, D., et al. (eds.) ISWC 2018. LNCS, vol. 11136, pp. 583–599. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00671-6_34
Mahdisoltani, F., Biega, J., Suchanek, F.M.: YAGO3: a knowledge base from multilingual Wikipedias (2015)
Peng, P., Zou, L., Chen, L., Zhao, D.: Query workload-based RDF graph fragmentation and allocation. In: EDBT, pp. 377–388 (2016)
Peng, P., Zou, L., Chen, L., Zhao, D.: Adaptive distributed RDF graph fragmentation and allocation based on query workload. IEEE Trans. Knowl. Data Eng. 31(4), 670–685 (2019)
Peng, P., Zou, L., Guan, R.: Accelerating partial evaluation in distributed SPARQL query evaluation. In: ICDE, pp. 112–123 (2019)
Peng, P., Zou, L., Özsu, M.T., Chen, L., Zhao, D.: Processing SPARQL queries over distributed RDF graphs. VLDB J. 25(2), 243–268 (2016)
Schätzle, A., Przyjaciel-Zablocki, M., Skilevic, S., Lausen, G.: S2RDF: RDF querying with SPARQL on spark. PVLDB 9(10), 804–815 (2016)
Shao, B., Wang, H., Li, Y.: Trinity: a distributed graph engine on a memory cloud. In: SIGMOD, pp. 505–516 (2013)
Wu, B., Zhou, Y., Yuan, P., Liu, L., Jin, H.: Scalable SPARQL querying using path partitioning. In: ICDE, pp. 795–806 (2015)
Wylot, M., Mauroux, P.: DiploCloud: efficient and scalable management of RDF data in the cloud. TKDE, PP(99) (2015)
Acknowledgment
This work was supported by NSFC under grant 61702171, Hunan Provincial Natural Science Foundation of China under grant 2018JJ3065, and the Fundamental Research Funds for the Central Universities.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Peng, P. (2019). Distributed Query Evaluation over Large RDF Graphs. In: Song, J., Zhu, X. (eds) Web and Big Data. APWeb-WAIM 2019. Lecture Notes in Computer Science(), vol 11809. Springer, Cham. https://doi.org/10.1007/978-3-030-33982-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-33982-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33981-4
Online ISBN: 978-3-030-33982-1
eBook Packages: Computer ScienceComputer Science (R0)