Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers

Marahatta, Avinab; Wang, Youshi; Zhang, Fa; Sangaiah, Arun Kumar; Tyagi, Sumarga Kumar Sah; Liu, Zhiyong

doi:10.1007/s11036-018-1062-7

Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers

Published: 16 June 2018

Volume 24, pages 1063–1077, (2019)
Cite this article

Mobile Networks and Applications Aims and scope Submit manuscript

Avinab Marahatta^1,2,
Youshi Wang^1,2,
Fa Zhang^2,3,
Arun Kumar Sangaiah⁴,
Sumarga Kumar Sah Tyagi^1,2 &
…
Zhiyong Liu^2,5

738 Accesses
39 Citations
Explore all metrics

Abstract

As clouds have been implemented and widely used in various fields, both the size and the number of cloud data centers (CDCs) are growing rapidly. Serious problems have been raised, such as the inefficient use of resources, high energy consumption, and failure of heterogeneous task execution. The existing studies have aimed to solve these challenging problems separately, but it is difficult to optimize resources and energy efficiency while simultaneously providing fault-tolerance. In this study, a dynamic task assignment and scheduling scheme, namely, the energy-aware fault-tolerant dynamic scheduling scheme (EFDTS), is developed to coordinately optimize resource utilization and energy consumption with a fault tolerant mechanism. In the task assignment scheme, a task classification method is developed to partition the coming tasks into different classes and then allocate them to the most suitable virtual machines based on their classes to reduce the mean response time while considering energy consumption. Replication is used for the fault tolerance to minimize the task rejection ratio caused by machine failure and delay. An elastic resource provisioning mechanism is designed in the context of fault-tolerance to improve resource utilization and energy efficiency. Furthermore, a migration policy is developed that can simultaneously improve resource utilization and energy efficiency. The experimental results show that compared with existing techniques, EFDTS significantly improves the overall scheduling performance, achieves a higher degree of fault tolerance with high CDC resource utilization, minimizes the mean response time and task rejection ratio, and reduces energy consumption.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Enhanced fault identification and optimal task prediction (EFIOTP) algorithm during multi-resource utilization in cloud-based knowledge and personal computing

Article 25 July 2019

J. M. Nandhini & T. Gnanasekaran

Enhancing the Performance of Cloud Environment by a Novel Three-Stage Task Scheduling Policy

HFTO: Hybrid Firebug Tunicate Optimizer for Fault Tolerance and Dynamic Task Scheduling in Cloud Computing

Article 02 December 2022

Manikandan Nanjappan, Gobalakrishnan Natesan & Pradeep Krishnadoss

References

Panda SK, Jana PK (2015) Efficient task scheduling algorithms for heterogeneous multi-cloud environment. J Supercomput 71:1505–1533. https://doi.org/10.1007/s11227-014-1376-6
Article Google Scholar
Verma A, Kaushal S, Sangaiah AK (2017) Computational intelligence based heuristic approach for maximizing energy efficiency in internet of things. Int Dec Supp Syst Sustain Comput 705:53–76
Google Scholar
Qin X, Jiang H (2006) A novel fault-tolerant scheduling algorithm for precedence constrained tasks in real-time heterogeneous systems. Parallel Comput 32(5):331–356. https://doi.org/10.1016/j.parco.2006.06.006
Article MathSciNet Google Scholar
He J, Mianxiong D, Ota K, Fan M, Wang G (2016) NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw Appl 9(1):67–81. https://doi.org/10.1007/s12083-014-0314-y
Article Google Scholar
Gital A Y, Ismail A S, Chen M, Chiroma H (2014) A Framework for the Design of Cloud Based Collaborative Virtual Environment Architecture. Proc Int Multi Conf Eng Comput Sci
Lu K, Yahyapour R, Wieder P, Yaqub E, Abdullah M, Schloer B, Kotsokalis C (2016) Fault-tolerant service level agreement lifecycle management in clouds using actor system. Futur Gener Comput Syst 54:247–259. https://doi.org/10.1016/j.future.2015.03.016
Article Google Scholar
Moon YH, Youn CH (2015) Multihybrid job scheduling for fault-tolerant distributed computing in policy-constrained resource networks. Comput Netw 82:81–95. https://doi.org/10.1016/j.comnet.2015.02.030
Article Google Scholar
Nawi NM, Khan A, Rehman MZ, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic cuckoo search techniques for data classification. Math Probl Eng. https://doi.org/10.1155/2015/868375
Mills B, Taieb Z, Melhem R (2014) Shadow computing: An energy-aware fault tolerant computing model. 2014 Int Conf Comput, Netw Commun (ICNC) 73–77. doi:https://doi.org/10.1109/ICCNC.2014.6785308
Jasma B, Nedunchezhian R (2016) Performance-driven load balancing with a primary-backup approach for computational grids with low communication cost and replication cost. IEEE Trans Comput 62(5):990–1003. https://doi.org/10.1109/TC.2012.44
MathSciNet MATH Google Scholar
Shafii Muhammad A, Shafie Abd Latiff M, Bakri BM (2014) On-demand grid provisioning using cloud infrastructures and related virtualization tools : a survey and taxonomy. Int J Adv Stud Comput Sci Eng IJASCSE 3(1):49–59
Google Scholar
Singh V, Itm K (2014) A survey on various fault tolerant approaches for cloud environment during load balancing. Int J Comput Networking, Wirel Mob Commun 4(6):25–34
Google Scholar
Plankensteiner K, Prodan R, Fahringer T (2007) Westminster Research Fault-tolerant behavior in state-of-the-art GridWorkflow Management Systems. Institute for Computer Science University of Innsbruck Attila Kert CoreGRID Technical Report Number TR-0091
Plankensteiner K, Prodan R (2012) Meeting soft deadlines in scientific workflows using resubmission impact. IEEE Trans Parall Distribut Syst 23(5):890–901. https://doi.org/10.1109/TPDS.2011.221
Article Google Scholar
C-c H, Shin KG (2003) A fault-tolerant scheduling algorithm for real-time periodic tasks with possible software faults. IEEE Trans Comput 52(3):362–372. https://doi.org/10.1109/TC.2003.1183950
Article Google Scholar
Stavrinides GL, Karatza HD (2010) The journal of systems and software scheduling multiple task graphs with end-to-end deadlines in distributed real-time systems utilizing imprecise computations. J Syst Softw 83(6):1004–1014. https://doi.org/10.1016/j.jss.2009.12.025
Article Google Scholar
Cui X, Mills B, Znati T, Melhem R (2014) Shadow replication: an energy-aware, fault-tolerant computational model for green cloud computing. Energies 7(8):5151–5176. https://doi.org/10.3390/en7085151
Article Google Scholar
Jing W, Liu Y (2014) Multiple DAGs reliability model and fault-tolerant scheduling algorithm in cloud computing system. Comput Model NEW Technol 18(8):22–30
MathSciNet Google Scholar
Wang J, Bao W, Zhu X, Yang LT, Xiang Y (2015) FESTAL: fault-tolerant elastic scheduling algorithm for real-time tasks in virtualized clouds. IEEE Trans Comput 64(9):2545–2558. https://doi.org/10.3969/j.issn.1000-436x.2014.10.020
Article MathSciNet MATH Google Scholar
Manimaran G, Murthy CSR (1998) A fault-tolerant dynamic scheduling algorithm for multiprocessor real-time systems and its analysis. IEEE Trans Parall Distribut Syst 9(11):1137–1152. https://doi.org/10.1109/71.735960
Article Google Scholar
Al-Omari R, Somani Arun K, Manimaran G (2004) Efficient overloading techniques for primary-backup scheduling in real-time systems. J Paral Distribut Comput 64:629–648. https://doi.org/10.1016/j.jpdc.2004.03.015
Article MATH Google Scholar
Ghosh S, Melhem R, Moss’e D (1997) Fault-tolerance through scheduling of aperiodic tasks in hard real-time multiprocessor systems. IEEE Trans Parall Distribut Syst 8(3):272–284. https://doi.org/10.1109/71.584093
Article Google Scholar
Zheng Q, Veeravalli B, Tham CK (2009) On the design of fault-tolerant scheduling strategies using primary-backup approach for computational grids with low replication costs. IEEE Trans Comput 58(3):380–393. https://doi.org/10.1109/TC.2008.172
Article MathSciNet MATH Google Scholar
Zhu X, Qin X, Meikang Q (2011) QoS-aware fault-tolerant scheduling for real-time tasks on heterogeneous clusters. IEEE Trans Comput 60(6):800–812. https://doi.org/10.1109/TC.2011.68
Article MathSciNet MATH Google Scholar
Manimaran G (2005) An adaptive scheme for fault-tolerant scheduling of soft real-time tasks in multiprocessor systems. J Parallel Distrib Comput 65(5):595–608. https://doi.org/10.1016/j.jpdc.2004.09.021
Article MATH Google Scholar
Antony S, Antony S, Ajeena BA, Rajasree MS (2012) Task scheduling algorithm with fault tolerance for cloud. International conference on computing sciences pp 6-8. https://doi.org/10.1109/ICCS.2012.71
Warneke D, Kao O (2011) Exploiting dynamic resource allocation for efficient parallel data processing in the cloud. IEEE Trans Parall Distribut Syst 22(6):985–997. https://doi.org/10.1109/TPDS.2011.65
Article Google Scholar
C-h H, Slagter KD, S-c C, Chung Y-c (2014) Optimizing energy consumption with task consolidation in clouds. Inf Sci 258:452–462. https://doi.org/10.1016/j.ins.2012.10.041
Article Google Scholar
Zhang P, Zhou M (2017) Dynamic Cloud Task Scheduling Based on a Two-Stage Strategy. IEEE Transactions on Automation Science and Engineering pp 1–12. doi:https://doi.org/10.1109/TASE.2017.2693688
Gao Y, Lei Y (2017) Energy-aware Load Balancing in Heterogeneous Cloud Data Centers. ICMSS ‘17 Proceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences pp 80–84. doi:https://doi.org/10.1145/3034950.3035000
Kaile Z, Shanlin Y, Zhen S (2016) Energy internet: the business perspective. Appl Energy 178:212–222. https://doi.org/10.1016/j.apenergy.2016.06.052
Article Google Scholar
Buyya R, Beloglazov A, Abawajy J (2010) Energy-efficient management of data center resources for cloud computing: a vision, architectural elements, and open challenges. International Conference on Parallel and Distributed Processing Techniques and Applications, pp 6–17. http://hdl.handle.net/10536/DRO/DU:30033287
Wu W, Lin W, Peng Z (2016) An intelligent power consumption model for virtual machines under CPU-intensive workload in cloud environment. Soft Computing pp 1–10. doi:https://doi.org/10.1007/s00500-016-2154-6
Jin X, Zhang F, Wang L, Hu S, Zhou B, Liu Z (2015) Joint optimization of operational cost and performance interference in cloud data centers. IEEE Trans Cloud Comput 5(4):697–711. https://doi.org/10.1109/TCC.2015.2449839
Article Google Scholar
Jin X, Zhang F, Vasilakos V V, Liu Z (2016) Green Data Centers: A Survey, Perspectives, and Future Directions. Distributed, Parallel, and Cluster Computing. https://arxiv.org/pdf/1608.00687.pdf
Gandhi A, Lefurgy C, Kephart J O (2009) Power Capping Via Forced Idleness. Workshop on Energy- Efficient Design(WEED)
Graubner P, Schmidt M, Freisleben B (2011) Energy-efficient Management of Virtual Machines in eucalyptus. IEEE Int Conf Cloud Comput. https://doi.org/10.1109/CLOUD.2011.26
Bayes Classifier. Wikipedia Available: https://en.wikipedia.org/wiki/Bayes classifier
Xia Y, Zhou M, Luo X, Zhu Q, Li J, Huang Y (2015) Stochastic modeling and quality evaluation of infrastructure-as-a-service clouds. IEEE Trans Autom 12(1):162–170. https://doi.org/10.1109/TASE.2013.2276477
Article Google Scholar
Calheiros RN, Ranjan R, Beloglazov A, Cesar AFDR, Buyya R (2011) CloudSim:a toolkit for modeling and simulation of cloud computing environments and evaluation of resource. Software: Pract Exp 41(1):23–50. https://doi.org/10.1002/spe.995
Google Scholar
Google Cluster Data. GitHub. https://github.com/google/cluster-data

Download references

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (Grants No. 61520106005, 61521092) and the National Key Research and Development Program of China (No. 2016YFB0800400). The first author gratefully acknowledges the “CAS-TWAS” Presidents Fellowship for funding his Ph.D. at the Chinese Academy of Sciences in Beijing, China.

Author information

Authors and Affiliations

University of Chinese Academy of Sciences, Beijing, China
Avinab Marahatta, Youshi Wang & Sumarga Kumar Sah Tyagi
Beijing Key laboratory of Mobile Computing and Pervasive Device, Institute of Computing Technology (ICT), Chinese Academy of Sciences (CAS) Beijing, Beijing, China
Avinab Marahatta, Youshi Wang, Fa Zhang, Sumarga Kumar Sah Tyagi & Zhiyong Liu
Key Laboratory of Intelligent Information Processing, ICT, CAS, Beijing, China
Fa Zhang
School of Computing Science, VIT University, Vellore, India
Arun Kumar Sangaiah
State Key Laboratory of Computer Architecture, ICT, CAS, Beijing, China
Zhiyong Liu

Authors

Avinab Marahatta
View author publications
You can also search for this author in PubMed Google Scholar
Youshi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Fa Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Arun Kumar Sangaiah
View author publications
You can also search for this author in PubMed Google Scholar
Sumarga Kumar Sah Tyagi
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyong Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhiyong Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Marahatta, A., Wang, Y., Zhang, F. et al. Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers. Mobile Netw Appl 24, 1063–1077 (2019). https://doi.org/10.1007/s11036-018-1062-7

Download citation

Published: 16 June 2018
Issue Date: 15 June 2019
DOI: https://doi.org/10.1007/s11036-018-1062-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers

Abstract

Access this article

Similar content being viewed by others

Enhanced fault identification and optimal task prediction (EFIOTP) algorithm during multi-resource utilization in cloud-based knowledge and personal computing

Enhancing the Performance of Cloud Environment by a Novel Three-Stage Task Scheduling Policy

HFTO: Hybrid Firebug Tunicate Optimizer for Fault Tolerance and Dynamic Task Scheduling in Cloud Computing

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Energy-Aware Fault-Tolerant Dynamic Task Scheduling Scheme for Virtualized Cloud Data Centers

Abstract

Access this article

Similar content being viewed by others

Enhanced fault identification and optimal task prediction (EFIOTP) algorithm during multi-resource utilization in cloud-based knowledge and personal computing

Enhancing the Performance of Cloud Environment by a Novel Three-Stage Task Scheduling Policy

HFTO: Hybrid Firebug Tunicate Optimizer for Fault Tolerance and Dynamic Task Scheduling in Cloud Computing

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation