Skip to main content

Advertisement

Log in

Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers

  • Published:
Mobile Networks and Applications Aims and scope Submit manuscript

Abstract

As clouds have been implemented and widely used in various fields, both the size and the number of cloud data centers (CDCs) are growing rapidly. Serious problems have been raised, such as the inefficient use of resources, high energy consumption, and failure of heterogeneous task execution. The existing studies have aimed to solve these challenging problems separately, but it is difficult to optimize resources and energy efficiency while simultaneously providing fault-tolerance. In this study, a dynamic task assignment and scheduling scheme, namely, the energy-aware fault-tolerant dynamic scheduling scheme (EFDTS), is developed to coordinately optimize resource utilization and energy consumption with a fault tolerant mechanism. In the task assignment scheme, a task classification method is developed to partition the coming tasks into different classes and then allocate them to the most suitable virtual machines based on their classes to reduce the mean response time while considering energy consumption. Replication is used for the fault tolerance to minimize the task rejection ratio caused by machine failure and delay. An elastic resource provisioning mechanism is designed in the context of fault-tolerance to improve resource utilization and energy efficiency. Furthermore, a migration policy is developed that can simultaneously improve resource utilization and energy efficiency. The experimental results show that compared with existing techniques, EFDTS significantly improves the overall scheduling performance, achieves a higher degree of fault tolerance with high CDC resource utilization, minimizes the mean response time and task rejection ratio, and reduces energy consumption.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Panda SK, Jana PK (2015) Efficient task scheduling algorithms for heterogeneous multi-cloud environment. J Supercomput 71:1505–1533. https://doi.org/10.1007/s11227-014-1376-6

    Article  Google Scholar 

  2. Verma A, Kaushal S, Sangaiah AK (2017) Computational intelligence based heuristic approach for maximizing energy efficiency in internet of things. Int Dec Supp Syst Sustain Comput 705:53–76

    Google Scholar 

  3. Qin X, Jiang H (2006) A novel fault-tolerant scheduling algorithm for precedence constrained tasks in real-time heterogeneous systems. Parallel Comput 32(5):331–356. https://doi.org/10.1016/j.parco.2006.06.006

    Article  MathSciNet  Google Scholar 

  4. He J, Mianxiong D, Ota K, Fan M, Wang G (2016) NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw Appl 9(1):67–81. https://doi.org/10.1007/s12083-014-0314-y

    Article  Google Scholar 

  5. Gital A Y, Ismail A S, Chen M, Chiroma H (2014) A Framework for the Design of Cloud Based Collaborative Virtual Environment Architecture. Proc Int Multi Conf Eng Comput Sci

  6. Lu K, Yahyapour R, Wieder P, Yaqub E, Abdullah M, Schloer B, Kotsokalis C (2016) Fault-tolerant service level agreement lifecycle management in clouds using actor system. Futur Gener Comput Syst 54:247–259. https://doi.org/10.1016/j.future.2015.03.016

    Article  Google Scholar 

  7. Moon YH, Youn CH (2015) Multihybrid job scheduling for fault-tolerant distributed computing in policy-constrained resource networks. Comput Netw 82:81–95. https://doi.org/10.1016/j.comnet.2015.02.030

    Article  Google Scholar 

  8. Nawi NM, Khan A, Rehman MZ, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic cuckoo search techniques for data classification. Math Probl Eng. https://doi.org/10.1155/2015/868375

  9. Mills B, Taieb Z, Melhem R (2014) Shadow computing: An energy-aware fault tolerant computing model. 2014 Int Conf Comput, Netw Commun (ICNC) 73–77. doi:https://doi.org/10.1109/ICCNC.2014.6785308

  10. Jasma B, Nedunchezhian R (2016) Performance-driven load balancing with a primary-backup approach for computational grids with low communication cost and replication cost. IEEE Trans Comput 62(5):990–1003. https://doi.org/10.1109/TC.2012.44

    MathSciNet  MATH  Google Scholar 

  11. Shafii Muhammad A, Shafie Abd Latiff M, Bakri BM (2014) On-demand grid provisioning using cloud infrastructures and related virtualization tools : a survey and taxonomy. Int J Adv Stud Comput Sci Eng IJASCSE 3(1):49–59

    Google Scholar 

  12. Singh V, Itm K (2014) A survey on various fault tolerant approaches for cloud environment during load balancing. Int J Comput Networking, Wirel Mob Commun 4(6):25–34

    Google Scholar 

  13. Plankensteiner K, Prodan R, Fahringer T (2007) Westminster Research Fault-tolerant behavior in state-of-the-art GridWorkflow Management Systems. Institute for Computer Science University of Innsbruck Attila Kert CoreGRID Technical Report Number TR-0091

  14. Plankensteiner K, Prodan R (2012) Meeting soft deadlines in scientific workflows using resubmission impact. IEEE Trans Parall Distribut Syst 23(5):890–901. https://doi.org/10.1109/TPDS.2011.221

    Article  Google Scholar 

  15. C-c H, Shin KG (2003) A fault-tolerant scheduling algorithm for real-time periodic tasks with possible software faults. IEEE Trans Comput 52(3):362–372. https://doi.org/10.1109/TC.2003.1183950

    Article  Google Scholar 

  16. Stavrinides GL, Karatza HD (2010) The journal of systems and software scheduling multiple task graphs with end-to-end deadlines in distributed real-time systems utilizing imprecise computations. J Syst Softw 83(6):1004–1014. https://doi.org/10.1016/j.jss.2009.12.025

    Article  Google Scholar 

  17. Cui X, Mills B, Znati T, Melhem R (2014) Shadow replication: an energy-aware, fault-tolerant computational model for green cloud computing. Energies 7(8):5151–5176. https://doi.org/10.3390/en7085151

    Article  Google Scholar 

  18. Jing W, Liu Y (2014) Multiple DAGs reliability model and fault-tolerant scheduling algorithm in cloud computing system. Comput Model NEW Technol 18(8):22–30

    MathSciNet  Google Scholar 

  19. Wang J, Bao W, Zhu X, Yang LT, Xiang Y (2015) FESTAL: fault-tolerant elastic scheduling algorithm for real-time tasks in virtualized clouds. IEEE Trans Comput 64(9):2545–2558. https://doi.org/10.3969/j.issn.1000-436x.2014.10.020

    Article  MathSciNet  MATH  Google Scholar 

  20. Manimaran G, Murthy CSR (1998) A fault-tolerant dynamic scheduling algorithm for multiprocessor real-time systems and its analysis. IEEE Trans Parall Distribut Syst 9(11):1137–1152. https://doi.org/10.1109/71.735960

    Article  Google Scholar 

  21. Al-Omari R, Somani Arun K, Manimaran G (2004) Efficient overloading techniques for primary-backup scheduling in real-time systems. J Paral Distribut Comput 64:629–648. https://doi.org/10.1016/j.jpdc.2004.03.015

    Article  MATH  Google Scholar 

  22. Ghosh S, Melhem R, Moss’e D (1997) Fault-tolerance through scheduling of aperiodic tasks in hard real-time multiprocessor systems. IEEE Trans Parall Distribut Syst 8(3):272–284. https://doi.org/10.1109/71.584093

    Article  Google Scholar 

  23. Zheng Q, Veeravalli B, Tham CK (2009) On the design of fault-tolerant scheduling strategies using primary-backup approach for computational grids with low replication costs. IEEE Trans Comput 58(3):380–393. https://doi.org/10.1109/TC.2008.172

    Article  MathSciNet  MATH  Google Scholar 

  24. Zhu X, Qin X, Meikang Q (2011) QoS-aware fault-tolerant scheduling for real-time tasks on heterogeneous clusters. IEEE Trans Comput 60(6):800–812. https://doi.org/10.1109/TC.2011.68

    Article  MathSciNet  MATH  Google Scholar 

  25. Manimaran G (2005) An adaptive scheme for fault-tolerant scheduling of soft real-time tasks in multiprocessor systems. J Parallel Distrib Comput 65(5):595–608. https://doi.org/10.1016/j.jpdc.2004.09.021

    Article  MATH  Google Scholar 

  26. Antony S, Antony S, Ajeena BA, Rajasree MS (2012) Task scheduling algorithm with fault tolerance for cloud. International conference on computing sciences pp 6-8. https://doi.org/10.1109/ICCS.2012.71

  27. Warneke D, Kao O (2011) Exploiting dynamic resource allocation for efficient parallel data processing in the cloud. IEEE Trans Parall Distribut Syst 22(6):985–997. https://doi.org/10.1109/TPDS.2011.65

    Article  Google Scholar 

  28. C-h H, Slagter KD, S-c C, Chung Y-c (2014) Optimizing energy consumption with task consolidation in clouds. Inf Sci 258:452–462. https://doi.org/10.1016/j.ins.2012.10.041

    Article  Google Scholar 

  29. Zhang P, Zhou M (2017) Dynamic Cloud Task Scheduling Based on a Two-Stage Strategy. IEEE Transactions on Automation Science and Engineering pp 1–12. doi:https://doi.org/10.1109/TASE.2017.2693688

  30. Gao Y, Lei Y (2017) Energy-aware Load Balancing in Heterogeneous Cloud Data Centers. ICMSS ‘17 Proceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences pp 80–84. doi:https://doi.org/10.1145/3034950.3035000

  31. Kaile Z, Shanlin Y, Zhen S (2016) Energy internet: the business perspective. Appl Energy 178:212–222. https://doi.org/10.1016/j.apenergy.2016.06.052

    Article  Google Scholar 

  32. Buyya R, Beloglazov A, Abawajy J (2010) Energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges. International Conference on Parallel and Distributed Processing Techniques and Applications, pp 6–17. http://hdl.handle.net/10536/DRO/DU:30033287

  33. Wu W, Lin W, Peng Z (2016) An intelligent power consumption model for virtual machines under CPU-intensive workload in cloud environment. Soft Computing pp 1–10. doi:https://doi.org/10.1007/s00500-016-2154-6

  34. Jin X, Zhang F, Wang L, Hu S, Zhou B, Liu Z (2015) Joint optimization of operational cost and performance interference in cloud data centers. IEEE Trans Cloud Comput 5(4):697–711. https://doi.org/10.1109/TCC.2015.2449839

    Article  Google Scholar 

  35. Jin X, Zhang F, Vasilakos V V, Liu Z (2016) Green Data Centers: A Survey, Perspectives, and Future Directions. Distributed, Parallel, and Cluster Computing. https://arxiv.org/pdf/1608.00687.pdf

  36. Gandhi A, Lefurgy C, Kephart J O (2009) Power Capping Via Forced Idleness. Workshop on Energy- Efficient Design(WEED)

  37. Graubner P, Schmidt M, Freisleben B (2011) Energy-efficient Management of Virtual Machines in eucalyptus. IEEE Int Conf Cloud Comput. https://doi.org/10.1109/CLOUD.2011.26

  38. Bayes Classifier. Wikipedia Available: https://en.wikipedia.org/wiki/Bayes classifier

  39. Xia Y, Zhou M, Luo X, Zhu Q, Li J, Huang Y (2015) Stochastic modeling and quality evaluation of infrastructure-as-a-service clouds. IEEE Trans Autom 12(1):162–170. https://doi.org/10.1109/TASE.2013.2276477

    Article  Google Scholar 

  40. Calheiros RN, Ranjan R, Beloglazov A, Cesar AFDR, Buyya R (2011) CloudSim:a toolkit for modeling and simulation of cloud computing environments and evaluation of resource. Software: Pract Exp 41(1):23–50. https://doi.org/10.1002/spe.995

    Google Scholar 

  41. Google Cluster Data. GitHub. https://github.com/google/cluster-data

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (Grants No. 61520106005, 61521092) and the National Key Research and Development Program of China (No. 2016YFB0800400). The first author gratefully acknowledges the “CAS-TWAS” Presidents Fellowship for funding his Ph.D. at the Chinese Academy of Sciences in Beijing, China.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhiyong Liu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Marahatta, A., Wang, Y., Zhang, F. et al. Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers. Mobile Netw Appl 24, 1063–1077 (2019). https://doi.org/10.1007/s11036-018-1062-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11036-018-1062-7

Keywords

Navigation