Abstract
MapReduce is a parallel computing programming model designed to process large-scale data. Therefore, the accuracy and efficiency for computing are needed to be assured and speculative execution is an efficient method for calculation of fault tolerance. It reaches the goals of shortening the execution time and increasing the cluster throughput through selecting slow tasks and speculative copy these tasks on a fast machine to be executed. Hadoop naïve speculative execution strategy assumes that the cluster is homogeneous, and this assumption leads to the poor performance in heterogeneous environment. Several speculative execution strategies which aim to improve the MapReduce Performance in the heterogeneous environments are reviewed in this paper like LATE, MCP, ex-MCP and ERUL, then the comparison between these methods are listed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Armbrust, M., Fox, A., Griffith, R., Joseph, A., Katz, R., Konwinski, A., Zaharia, M.: A view of cloud computing. Commun. ACM 53(4), 50–58 (2010)
Ghemawat, S., Gobioff, H., Leung, S.T.: The Google file system. ACM SIGOPS Oper. Syst. Rev. 37(5), 29–43 (2003)
Dean, J., Ghemawa, S.: MapReduce: simplified data processing on large clusters. Proc. Oper. Syst. Des. Implement. 51(1), 107–113 (2004)
Chang, F., Dean, J., Ghemawa, S.: A distributed storage system for structured data. ACM Trans. Comput. Syst. 26(2), 1–26 (2008)
Apache Hadoop (2013). http://Hadoop.Apache.Org/
Vijayalakshmi, B., Ravi, P.R.: The down of big Data-Hbase. In: IEEE 2014 Conference on IT in Business, Industry and Government (2014)
Apache Pig (2014). http://pig.apache.org/
Apache Hive (2014). https://hive.apache.org/
Xia, Z.H., Wang, X.H., Sun, X.H., Wang, Q.: A secure and dynamic multi-keyword ranked search scheme over encrypted cloud data. IEEE Trans. Parallel Distrib. Syst. 27(2), 340–352 (2015)
Fu, Z.J., Ren, K., Shu, J.G., Sun, X.M.: Enabling personalized search over encrypted outsourced data with efficiency improvement. IEEE Trans. Parallel Distrib. Syst. (in press)
Fu, Z.J., Sun, X.M., Li, Q., Zhou, L., Shu, J.G.: Achieving efficient cloud search services: multi-keyword ranked search over encrypted cloud data supporting parallel computing. IEICE Trans. Commun. E98-B(1), 190–200 (2015)
Yoo, D.G., Sim, K.M.: A comparative review of job scheduling for MapReduce. In: IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS), pp. 353–358. IEEE (2011)
Isard, M., Budiu, M., Yu, Y., Birrel, A., Fetterly, D.: Dryad: distributed data-parallel programs from sequential building blocks. Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems, pp. 59–72. ACM (2007)
Nenavath, S.N., Atul, N.: A review of adaptive approaches to MapReduce scheduling in heterogeneous environments. In: International Conference on Advances in Computing, Communications and Informatics, pp. 677–683. IEEE (2014)
Zaharia, M., Konwinski, A., Joseph, A., Katz, R., Stoica, I.: Improving MapReduce performance in heterogeneous environments. Proceedings of the 8th USENIX Conference on Operating Systems Design and Implementation (OSDI), pp. 29–42 (2008)
Chen, Q., Liu, C., Xiao, Z.: Improving MapReduce performance using smart speculative execution strategy. IEEE Trans. Comput. 63(4), 954–967 (2014)
Huang, X., Zhang, L.X., Li, R.F., Wan, L.J., Li, K.Q.: Novel heuristic speculative execution strategies in heterogeneous distributed environments. Comput. Electr. Eng. 50, 166–179 (2015)
Wu, H.C., Li, K., Tang, Z., Zhang, L.: A heuristic speculative execution strategy in heterogeneous distributed environments. In: 2014 Sixth International Symposium on Parallel Architectures, Algorithms and Programming (PAAP), pp. 268–273 (2014)
Acknowledgements
This work is supported by the NSFC (61300238, 61300237, 61232016, 1405254, 61373133), Marie Curie Fellowship (701697-CAR-MSCA-IFEF-ST), Basic Research Programs (Natural Science Foundation) of Jiangsu Province (BK20131004) and the PAPD fund.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing AG
About this paper
Cite this paper
Liu, Q., Jin, D., Liu, X., Linge, N. (2016). A Survey of Speculative Execution Strategy in MapReduce. In: Sun, X., Liu, A., Chao, HC., Bertino, E. (eds) Cloud Computing and Security. ICCCS 2016. Lecture Notes in Computer Science(), vol 10039. Springer, Cham. https://doi.org/10.1007/978-3-319-48671-0_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-48671-0_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48670-3
Online ISBN: 978-3-319-48671-0
eBook Packages: Computer ScienceComputer Science (R0)