Abstract
Analyzing large data sets is gaining more importance because of its wide variety of applications in parallel and distributed environment. Hadoop environment gives more flexibility to programmers in parallel computing. One of the advantages of Hadoop is query evaluation over large datasets. Join operations in query evaluation plays a major role over the large data. This paper Ferret outs the earlier solutions, prolongs them and recommends a new approach for the implementation of joins in Hadoop.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
J. Dean, S. Ghemawat, Mapreduce: simplified data processing on large clusters, in Design and Implementation 6th Symposium on Operating Systems, ACM, pp. 137–150 2004
Y. Mao, R. Morris, F. Kaashoek, Optimizing MapReduce for Multicore Architectures (Massachusetts Institute of Technology, Cambridge)
Thesis on Performance Analysis and Optimization of Left Outer Join on Map Side, Ming Hao, Stavanger, 15th June 2012
S. Blanas, J.M. Patel, V. Ercegovac, J. Rao,E.J. Shekita, Y. Tian, A comparison of joinalgorithms for log processing in MaPreduce, in Proceedings of the 2010 International Conference on Management of Data (2010) pp. 975–986
A. Abouzeid, K. Bajda-Pawlikowski, D. Abadi, A. Silberschatz, A. Rasin, Hadoopdb, An architectural hybrid of MapReduce and dbms technologies for analytical workloads, in VLDB, 2009
K.H. Lee, Y.J. Lee, H. Choi, Y.D. Chung, parallel Data Processing with MapReduce: a Survey, Department of Computer Science, Department of Computer Science and Engineering (Korea University in KAIST)
V. Jadhav1, J. Aghav, S. Dorwani2, Join algorithms using mapreduce a surveyn, in International Conference on Electrical Engineering and Computer Science, 21 Apr 2013
Binary Theta-Joins using MapReduce: Efficiency Analysis and Improvements, Ioannis K. Koumarelas, Athanasios Naskos, Anastasios Gounaris, Dept. of Informatics, Aristotle University
J. Dean, S. Ghemawat, Mapreduce: simplified data processing on large clusters. Commun. ACM 51(1), 107–113 (2008)
Thesis in Implementation and Analysis of Join Algorithms to handle skew for the Hadoop MapReduce Framework, Fariha Atta, University of Eidenburgh 2010
Minimal MapReduce Algorithms, Yufei Tao, 1Chinese University of Hong Kong, Hong Kong, Wenqing Lin, Korea Advanced Institute of Science and Technology, Korea, Xiaokui Xiao, Nanyang Technological University, Singapore
K. Palla, A comparative analysis of join algorithms using the hadoop MapReduce framework. Master’s thesis, MSc Informatics, School of Informatics, University of Edinburgh (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Pagadala, P.K., Vikram, M., Eswarawaka, R., Reddy, P.S. (2017). Join Operations to Enhance Performance in Hadoop MapReduce Environment. In: Satapathy, S., Bhateja, V., Udgata, S., Pattnaik, P. (eds) Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications . Advances in Intelligent Systems and Computing, vol 516. Springer, Singapore. https://doi.org/10.1007/978-981-10-3156-4_51
Download citation
DOI: https://doi.org/10.1007/978-981-10-3156-4_51
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3155-7
Online ISBN: 978-981-10-3156-4
eBook Packages: EngineeringEngineering (R0)