Adaptive Scheduling Algorithm for Hadoop Node Capability in Heterogeneous Resource Environment

Zheng, Ming; Zhuo, Mugui

doi:10.1007/978-3-030-15235-2_182

Ming Zheng¹⁹ &
Mugui Zhuo¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 928))

Included in the following conference series:

The International Conference on Cyber Security Intelligence and Analytics

79 Accesses

Abstract

In order to overcome the shortcomings of the existing scheduling allocation methods for Hadoop clusters in heterogeneous resource environments, an adaptive scheduling algorithm NCAS (node capacity adaptive scheduling) based on node capability is proposed. Firstly, NCAS algorithm calculates the scheduling factor according to the node performance and task characteristics; then, the scheduling factor determines the amount of data and task slots that each node should share; finally, the data and tasks are distributed more to fast nodes and less to slow nodes. The experimental results show that compared with the traditional scheduling algorithm, NCAS algorithm greatly reduces the number of backup tasks to start, significantly reduces job completion time, and improves the efficiency of task execution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Glushkova D, Jovanovic P, Abello A (2019) Mapreduce performance model for Hadoop 2.x. Inf Syst 79:32–43
Google Scholar
Qin P, Dai B, Huang BX et al (2017) Bandwidth-aware scheduling with SDN in Hadoop: a new trend for big data. IEEE Syst J 11(4):2337–2344
Article Google Scholar
Pastorelli M, Carra D, Dell’Amico M et al (2017) HFSP: bringing size-based scheduling to Hadoop. IEEE Trans Cloud Comput 5(1):43–56
Article Google Scholar
Quresh NMF, Shin DR, Siddiqui IF et al (2017) Storage-tag-aware scheduler for Hadoop cluster. IEEE Access 5:13742–13755
Article Google Scholar
Guo YF, Rao J, Jiang CJ et al (2017) Moving Hadoop into the cloud with flexible slot management and speculative execution. IEEE Trans Parallel Distrib Syst 28(3):798–812
Article Google Scholar
Ferrucci F, Salza P, Sarro F (2018) Using Hadoop MapReduce for parallel genetic algorithms: a comparison of the global, grid and island models. Evol Comput 26(4):535–567
Article Google Scholar
Lin CY, Lin YC (2017) An overall approach to achieve load balancing for Hadoop distributed file system. Int J Web Grid Serv 13(4):448–466
Article MathSciNet Google Scholar
Alarabi L, Mokbel MF, Musleh M (2018) ST-Hadoop: a MapReduce framework for spatio-temporal data. Geoinformatica 22(4):785–813
Article Google Scholar
Lu XZ, Phang K (2018) An enhanced Hadoop heartbeat mechanism for MapReduce task scheduler using dynamic calibration. China Commun 15(11):93–110
Article Google Scholar
Oo MN, Parvin S, Thein T (2018) Forensic investigation through data remnants on Hadoop big data storage system. Comput Syst Sci Eng 33(3):203–217
Google Scholar

Download references

Acknowledgements

This work was supported by grants from The National Natural Science Foundation of China (No. 61862056), the Guangxi Natural Science Foundation (No. 2017GXNSFAA198148) foundation of Wuzhou University (No. 2017B001) and Guangxi Colleges and Universities Key Laboratory of Professional Software Technology, Wuzhou University.

Author information

Authors and Affiliations

Guangxi Colleges and Universities Key Laboratory of Professional Software Technology, Wuzhou University, Wuzhou, China
Ming Zheng & Mugui Zhuo

Authors

Ming Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Mugui Zhuo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mugui Zhuo .

Editor information

Editors and Affiliations

Shanghai University, Shanghai, China
Zheng Xu
University of Texas at San Antonio, San Antonio, TX, USA
Kim-Kwang Raymond Choo
University of Guelph, Guelph, ON, Canada
Ali Dehghantanha
Kennesaw State University, Marietta, GA, USA
Reza Parizi
Manchester Metropolitan University, Stockport, UK
Mohammad Hammoudeh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, M., Zhuo, M. (2020). Adaptive Scheduling Algorithm for Hadoop Node Capability in Heterogeneous Resource Environment. In: Xu, Z., Choo, KK., Dehghantanha, A., Parizi, R., Hammoudeh, M. (eds) Cyber Security Intelligence and Analytics. CSIA 2019. Advances in Intelligent Systems and Computing, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-030-15235-2_182

Download citation

DOI: https://doi.org/10.1007/978-3-030-15235-2_182
Published: 25 April 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-15234-5
Online ISBN: 978-3-030-15235-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics