Abstract
As clouds have been implemented and widely used in various fields, both the size and the number of cloud data centers (CDCs) are growing rapidly. Serious problems have been raised, such as the inefficient use of resources, high energy consumption, and failure of heterogeneous task execution. The existing studies have aimed to solve these challenging problems separately, but it is difficult to optimize resources and energy efficiency while simultaneously providing fault-tolerance. In this study, a dynamic task assignment and scheduling scheme, namely, the energy-aware fault-tolerant dynamic scheduling scheme (EFDTS), is developed to coordinately optimize resource utilization and energy consumption with a fault tolerant mechanism. In the task assignment scheme, a task classification method is developed to partition the coming tasks into different classes and then allocate them to the most suitable virtual machines based on their classes to reduce the mean response time while considering energy consumption. Replication is used for the fault tolerance to minimize the task rejection ratio caused by machine failure and delay. An elastic resource provisioning mechanism is designed in the context of fault-tolerance to improve resource utilization and energy efficiency. Furthermore, a migration policy is developed that can simultaneously improve resource utilization and energy efficiency. The experimental results show that compared with existing techniques, EFDTS significantly improves the overall scheduling performance, achieves a higher degree of fault tolerance with high CDC resource utilization, minimizes the mean response time and task rejection ratio, and reduces energy consumption.
Similar content being viewed by others
References
Panda SK, Jana PK (2015) Efficient task scheduling algorithms for heterogeneous multi-cloud environment. J Supercomput 71:1505–1533. https://doi.org/10.1007/s11227-014-1376-6
Verma A, Kaushal S, Sangaiah AK (2017) Computational intelligence based heuristic approach for maximizing energy efficiency in internet of things. Int Dec Supp Syst Sustain Comput 705:53–76
Qin X, Jiang H (2006) A novel fault-tolerant scheduling algorithm for precedence constrained tasks in real-time heterogeneous systems. Parallel Comput 32(5):331–356. https://doi.org/10.1016/j.parco.2006.06.006
He J, Mianxiong D, Ota K, Fan M, Wang G (2016) NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw Appl 9(1):67–81. https://doi.org/10.1007/s12083-014-0314-y
Gital A Y, Ismail A S, Chen M, Chiroma H (2014) A Framework for the Design of Cloud Based Collaborative Virtual Environment Architecture. Proc Int Multi Conf Eng Comput Sci
Lu K, Yahyapour R, Wieder P, Yaqub E, Abdullah M, Schloer B, Kotsokalis C (2016) Fault-tolerant service level agreement lifecycle management in clouds using actor system. Futur Gener Comput Syst 54:247–259. https://doi.org/10.1016/j.future.2015.03.016
Moon YH, Youn CH (2015) Multihybrid job scheduling for fault-tolerant distributed computing in policy-constrained resource networks. Comput Netw 82:81–95. https://doi.org/10.1016/j.comnet.2015.02.030
Nawi NM, Khan A, Rehman MZ, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic cuckoo search techniques for data classification. Math Probl Eng. https://doi.org/10.1155/2015/868375
Mills B, Taieb Z, Melhem R (2014) Shadow computing: An energy-aware fault tolerant computing model. 2014 Int Conf Comput, Netw Commun (ICNC) 73–77. doi:https://doi.org/10.1109/ICCNC.2014.6785308
Jasma B, Nedunchezhian R (2016) Performance-driven load balancing with a primary-backup approach for computational grids with low communication cost and replication cost. IEEE Trans Comput 62(5):990–1003. https://doi.org/10.1109/TC.2012.44
Shafii Muhammad A, Shafie Abd Latiff M, Bakri BM (2014) On-demand grid provisioning using cloud infrastructures and related virtualization tools : a survey and taxonomy. Int J Adv Stud Comput Sci Eng IJASCSE 3(1):49–59
Singh V, Itm K (2014) A survey on various fault tolerant approaches for cloud environment during load balancing. Int J Comput Networking, Wirel Mob Commun 4(6):25–34
Plankensteiner K, Prodan R, Fahringer T (2007) Westminster Research Fault-tolerant behavior in state-of-the-art GridWorkflow Management Systems. Institute for Computer Science University of Innsbruck Attila Kert CoreGRID Technical Report Number TR-0091
Plankensteiner K, Prodan R (2012) Meeting soft deadlines in scientific workflows using resubmission impact. IEEE Trans Parall Distribut Syst 23(5):890–901. https://doi.org/10.1109/TPDS.2011.221
C-c H, Shin KG (2003) A fault-tolerant scheduling algorithm for real-time periodic tasks with possible software faults. IEEE Trans Comput 52(3):362–372. https://doi.org/10.1109/TC.2003.1183950
Stavrinides GL, Karatza HD (2010) The journal of systems and software scheduling multiple task graphs with end-to-end deadlines in distributed real-time systems utilizing imprecise computations. J Syst Softw 83(6):1004–1014. https://doi.org/10.1016/j.jss.2009.12.025
Cui X, Mills B, Znati T, Melhem R (2014) Shadow replication: an energy-aware, fault-tolerant computational model for green cloud computing. Energies 7(8):5151–5176. https://doi.org/10.3390/en7085151
Jing W, Liu Y (2014) Multiple DAGs reliability model and fault-tolerant scheduling algorithm in cloud computing system. Comput Model NEW Technol 18(8):22–30
Wang J, Bao W, Zhu X, Yang LT, Xiang Y (2015) FESTAL: fault-tolerant elastic scheduling algorithm for real-time tasks in virtualized clouds. IEEE Trans Comput 64(9):2545–2558. https://doi.org/10.3969/j.issn.1000-436x.2014.10.020
Manimaran G, Murthy CSR (1998) A fault-tolerant dynamic scheduling algorithm for multiprocessor real-time systems and its analysis. IEEE Trans Parall Distribut Syst 9(11):1137–1152. https://doi.org/10.1109/71.735960
Al-Omari R, Somani Arun K, Manimaran G (2004) Efficient overloading techniques for primary-backup scheduling in real-time systems. J Paral Distribut Comput 64:629–648. https://doi.org/10.1016/j.jpdc.2004.03.015
Ghosh S, Melhem R, Moss’e D (1997) Fault-tolerance through scheduling of aperiodic tasks in hard real-time multiprocessor systems. IEEE Trans Parall Distribut Syst 8(3):272–284. https://doi.org/10.1109/71.584093
Zheng Q, Veeravalli B, Tham CK (2009) On the design of fault-tolerant scheduling strategies using primary-backup approach for computational grids with low replication costs. IEEE Trans Comput 58(3):380–393. https://doi.org/10.1109/TC.2008.172
Zhu X, Qin X, Meikang Q (2011) QoS-aware fault-tolerant scheduling for real-time tasks on heterogeneous clusters. IEEE Trans Comput 60(6):800–812. https://doi.org/10.1109/TC.2011.68
Manimaran G (2005) An adaptive scheme for fault-tolerant scheduling of soft real-time tasks in multiprocessor systems. J Parallel Distrib Comput 65(5):595–608. https://doi.org/10.1016/j.jpdc.2004.09.021
Antony S, Antony S, Ajeena BA, Rajasree MS (2012) Task scheduling algorithm with fault tolerance for cloud. International conference on computing sciences pp 6-8. https://doi.org/10.1109/ICCS.2012.71
Warneke D, Kao O (2011) Exploiting dynamic resource allocation for efficient parallel data processing in the cloud. IEEE Trans Parall Distribut Syst 22(6):985–997. https://doi.org/10.1109/TPDS.2011.65
C-h H, Slagter KD, S-c C, Chung Y-c (2014) Optimizing energy consumption with task consolidation in clouds. Inf Sci 258:452–462. https://doi.org/10.1016/j.ins.2012.10.041
Zhang P, Zhou M (2017) Dynamic Cloud Task Scheduling Based on a Two-Stage Strategy. IEEE Transactions on Automation Science and Engineering pp 1–12. doi:https://doi.org/10.1109/TASE.2017.2693688
Gao Y, Lei Y (2017) Energy-aware Load Balancing in Heterogeneous Cloud Data Centers. ICMSS ‘17 Proceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences pp 80–84. doi:https://doi.org/10.1145/3034950.3035000
Kaile Z, Shanlin Y, Zhen S (2016) Energy internet: the business perspective. Appl Energy 178:212–222. https://doi.org/10.1016/j.apenergy.2016.06.052
Buyya R, Beloglazov A, Abawajy J (2010) Energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges. International Conference on Parallel and Distributed Processing Techniques and Applications, pp 6–17. http://hdl.handle.net/10536/DRO/DU:30033287
Wu W, Lin W, Peng Z (2016) An intelligent power consumption model for virtual machines under CPU-intensive workload in cloud environment. Soft Computing pp 1–10. doi:https://doi.org/10.1007/s00500-016-2154-6
Jin X, Zhang F, Wang L, Hu S, Zhou B, Liu Z (2015) Joint optimization of operational cost and performance interference in cloud data centers. IEEE Trans Cloud Comput 5(4):697–711. https://doi.org/10.1109/TCC.2015.2449839
Jin X, Zhang F, Vasilakos V V, Liu Z (2016) Green Data Centers: A Survey, Perspectives, and Future Directions. Distributed, Parallel, and Cluster Computing. https://arxiv.org/pdf/1608.00687.pdf
Gandhi A, Lefurgy C, Kephart J O (2009) Power Capping Via Forced Idleness. Workshop on Energy- Efficient Design(WEED)
Graubner P, Schmidt M, Freisleben B (2011) Energy-efficient Management of Virtual Machines in eucalyptus. IEEE Int Conf Cloud Comput. https://doi.org/10.1109/CLOUD.2011.26
Bayes Classifier. Wikipedia Available: https://en.wikipedia.org/wiki/Bayes classifier
Xia Y, Zhou M, Luo X, Zhu Q, Li J, Huang Y (2015) Stochastic modeling and quality evaluation of infrastructure-as-a-service clouds. IEEE Trans Autom 12(1):162–170. https://doi.org/10.1109/TASE.2013.2276477
Calheiros RN, Ranjan R, Beloglazov A, Cesar AFDR, Buyya R (2011) CloudSim:a toolkit for modeling and simulation of cloud computing environments and evaluation of resource. Software: Pract Exp 41(1):23–50. https://doi.org/10.1002/spe.995
Google Cluster Data. GitHub. https://github.com/google/cluster-data
Acknowledgements
This work was partially supported by the National Natural Science Foundation of China (Grants No. 61520106005, 61521092) and the National Key Research and Development Program of China (No. 2016YFB0800400). The first author gratefully acknowledges the “CAS-TWAS” Presidents Fellowship for funding his Ph.D. at the Chinese Academy of Sciences in Beijing, China.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Marahatta, A., Wang, Y., Zhang, F. et al. Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers. Mobile Netw Appl 24, 1063–1077 (2019). https://doi.org/10.1007/s11036-018-1062-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-018-1062-7