Abstract
Gupta et al. [11] studied the problem of handling multi-way spatial join queries on map-reduce platform and proposed the Controlled-Replicate algorithm for the same. In this paper we present ε-Controlled-Replicate - an improved Controlled-Replicate procedure for processing multi-way spatial join queries on map-reduce. We show that ε-Controlled-Replicate algorithm presented in this paper involves a significantly smaller communication cost vis-a-vis Controlled-Replicate. We discuss the details of ε-Controlled-Replicate algorithm and through an experimental study over synthetic as well as real-life California road datasets, we show the efficacy of the ε-Controlled-Replicate algorithm vis-a-vis Controlled-Replicate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Census 2000 Tiger/Line Data, http://www.esri.com/data/download/census2000-tigerline
Afrati, F.N., Ullman, J.D.: Optimizing joins in a map-reduce environment. In: EDBT (2011)
Bhatolia, P., et al.: HaLoop: Efficient Iterative Data Processing on Large Clusters. In: VLDB (2010)
Blanas, S., Patel, J.M., Ercegovac, V., Rao, J., Shekita, E.J., Tian, Y.: A comparison of join algorithms for log processing in map-reduce. In: SIGMOD (2010)
Brinkhoff, T., Kriegal, H., Seeger, B.: Parallel processing of spatial joins using R-trees. In: ICDE (1996)
Brinkhoff, T., Kriegal, H.P., Schneider, R., Seeger, B.: Multi-step processing of spatial joins. In: SIGMOD (1994)
Brinkhoff, T., Kriegal, H.P., Seeger, B.: Efficient processing of spatial joins using R-trees. In: SIGMOD (1993)
Dean, J., Ghemawat, S.: MapReduce: Simplified data processing on large clusters. Comm. of ACM 51(1) (2008)
Eltabakh, M.: et al. CoHadoop: Flexible Data Placement and its exploitation in Hadoop. In: VLDB (2011)
Gunther, O.: Efficient computation of spatial joins. In: ICDE (1993)
Gupta, H., Chawda, B., Negi, S., Faruquie, T., Subramaniam, L.V., Mohania, M.: Proceesing multi-way spatial joins on map-reduce. In: EDBT (2013)
Lo, M., Ravishankar, C.V.: Spatial hash joins. In: SIGMOD (1996)
Lo, M.L., Ravishankar, C.V.: Spatial joins using seeded trees. In: SIGMOD (1994)
Mamoulis, N., Papadias, D.: Multiway spatial joins. In: ACM Transaction on Database Systems (2001)
Okcan, A., Riedewald, M.: Processing theta-joins using mapreduce. In: SIGMOD (2011)
Papadias, D., Arkoumanis, D.: Approx processing of multiway spatial joins in very large databases. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, p. 179. Springer, Heidelberg (2002)
Patel, J., DeWitt, D.J.: Clone join and shadow join: Two parallel spatial join algorithms. In: ACM-GIS (2000)
Patel, J.M., DeWitt, D.J.: Partition based spatial-merge join. In: SIGMOD (1996)
Wang, K., Han, J., Tu, B., Dai, J., Zhu, W., Song, X.: Accelerating spatial data processing with map-reduce. In: ICPADS (2010)
Zhang, S., Han, J., Liu, Z., Wang, K., Feng, S.: Spatial queries evaluation with mapreduce. In: GCC (2009)
Zhang, S., Han, J., Liu, Z., Wang, K., Xu, Z.: Sjmr: Parallelizing spatial join with mapreduce on clusters. In: CLUSTER (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Gupta, H., Chawda, B. (2014). ε-Controlled-Replicate: An ImprovedControlled-Replicate Algorithm for Multi-way Spatial Join Processing on Map-Reduce. In: Benatallah, B., Bestavros, A., Manolopoulos, Y., Vakali, A., Zhang, Y. (eds) Web Information Systems Engineering – WISE 2014. WISE 2014. Lecture Notes in Computer Science, vol 8787. Springer, Cham. https://doi.org/10.1007/978-3-319-11746-1_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-11746-1_20
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11745-4
Online ISBN: 978-3-319-11746-1
eBook Packages: Computer ScienceComputer Science (R0)