Reference Hub5
Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering

Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering

Minjae Song, Hyunsuk Oh, Seungmin Seo, Kyong-Ho Lee
Copyright: © 2019 |Volume: 30 |Issue: 1 |Pages: 19
ISSN: 1063-8016|EISSN: 1533-8010|EISBN13: 9781522563785|DOI: 10.4018/JDM.2019010102
Cite Article Cite Article

MLA

Song, Minjae, et al. "Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering." JDM vol.30, no.1 2019: pp.22-40. http://doi.org/10.4018/JDM.2019010102

APA

Song, M., Oh, H., Seo, S., & Lee, K. (2019). Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering. Journal of Database Management (JDM), 30(1), 22-40. http://doi.org/10.4018/JDM.2019010102

Chicago

Song, Minjae, et al. "Map-Side Join Processing of SPARQL Queries Based on Abstract RDF Data Filtering," Journal of Database Management (JDM) 30, no.1: 22-40. http://doi.org/10.4018/JDM.2019010102

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

The amount of RDF data being published on the Web is increasing at a massive rate. MapReduce-based distributed frameworks have become the general trend in processing SPARQL queries against RDF data. Currently, query processing systems that use MapReduce have not been able to keep up with the increase of semantic annotated data, resulting in non-interactive SPARQL query processing. The principal reason is that intermediate query results from join operations in a MapReduce framework are so massive that they consume all available network bandwidth. In this article, the authors present an efficient SPARQL processing system that uses MapReduce and HBase. The system runs a job optimized query plan using their proposed abstract RDF data to decrease the number of jobs and also decrease the amount of input data. The authors also present an efficient algorithm of using Map-side joins while also using the abstract RDF data to filter out unneeded RDF data. Experimental results show that the proposed approach demonstrates better performance when processing queries with a large amount of input data than those found in previous works.

Request Access

You do not own this content. Please login to recommend this title to your institution's librarian or purchase it from the IGI Global bookstore.