Skip to main content

Adaptive Scheduling Algorithm for Hadoop Node Capability in Heterogeneous Resource Environment

  • Conference paper
  • First Online:
Cyber Security Intelligence and Analytics (CSIA 2019)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 928))

  • 79 Accesses

Abstract

In order to overcome the shortcomings of the existing scheduling allocation methods for Hadoop clusters in heterogeneous resource environments, an adaptive scheduling algorithm NCAS (node capacity adaptive scheduling) based on node capability is proposed. Firstly, NCAS algorithm calculates the scheduling factor according to the node performance and task characteristics; then, the scheduling factor determines the amount of data and task slots that each node should share; finally, the data and tasks are distributed more to fast nodes and less to slow nodes. The experimental results show that compared with the traditional scheduling algorithm, NCAS algorithm greatly reduces the number of backup tasks to start, significantly reduces job completion time, and improves the efficiency of task execution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Glushkova D, Jovanovic P, Abello A (2019) Mapreduce performance model for Hadoop 2.x. Inf Syst 79:32–43

    Google Scholar 

  2. Qin P, Dai B, Huang BX et al (2017) Bandwidth-aware scheduling with SDN in Hadoop: a new trend for big data. IEEE Syst J 11(4):2337–2344

    Article  Google Scholar 

  3. Pastorelli M, Carra D, Dell’Amico M et al (2017) HFSP: bringing size-based scheduling to Hadoop. IEEE Trans Cloud Comput 5(1):43–56

    Article  Google Scholar 

  4. Quresh NMF, Shin DR, Siddiqui IF et al (2017) Storage-tag-aware scheduler for Hadoop cluster. IEEE Access 5:13742–13755

    Article  Google Scholar 

  5. Guo YF, Rao J, Jiang CJ et al (2017) Moving Hadoop into the cloud with flexible slot management and speculative execution. IEEE Trans Parallel Distrib Syst 28(3):798–812

    Article  Google Scholar 

  6. Ferrucci F, Salza P, Sarro F (2018) Using Hadoop MapReduce for parallel genetic algorithms: a comparison of the global, grid and island models. Evol Comput 26(4):535–567

    Article  Google Scholar 

  7. Lin CY, Lin YC (2017) An overall approach to achieve load balancing for Hadoop distributed file system. Int J Web Grid Serv 13(4):448–466

    Article  MathSciNet  Google Scholar 

  8. Alarabi L, Mokbel MF, Musleh M (2018) ST-Hadoop: a MapReduce framework for spatio-temporal data. Geoinformatica 22(4):785–813

    Article  Google Scholar 

  9. Lu XZ, Phang K (2018) An enhanced Hadoop heartbeat mechanism for MapReduce task scheduler using dynamic calibration. China Commun 15(11):93–110

    Article  Google Scholar 

  10. Oo MN, Parvin S, Thein T (2018) Forensic investigation through data remnants on Hadoop big data storage system. Comput Syst Sci Eng 33(3):203–217

    Google Scholar 

Download references

Acknowledgements

This work was supported by grants from The National Natural Science Foundation of China (No. 61862056), the Guangxi Natural Science Foundation (No. 2017GXNSFAA198148) foundation of Wuzhou University (No. 2017B001) and Guangxi Colleges and Universities Key Laboratory of Professional Software Technology, Wuzhou University.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mugui Zhuo .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zheng, M., Zhuo, M. (2020). Adaptive Scheduling Algorithm for Hadoop Node Capability in Heterogeneous Resource Environment. In: Xu, Z., Choo, KK., Dehghantanha, A., Parizi, R., Hammoudeh, M. (eds) Cyber Security Intelligence and Analytics. CSIA 2019. Advances in Intelligent Systems and Computing, vol 928. Springer, Cham. https://doi.org/10.1007/978-3-030-15235-2_182

Download citation

Publish with us

Policies and ethics