Skip to main content

Advertisement

Log in

Fault tolerance aware scheduling technique for cloud computing environment using dynamic clustering algorithm

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

In cloud computing, resources are dynamically provisioned and delivered to users in a transparent manner automatically on-demand. Task execution failure is no longer accidental but a common characteristic of cloud computing environment. In recent times, a number of intelligent scheduling techniques have been used to address task scheduling issues in cloud without much attention to fault tolerance. In this research article, we proposed a dynamic clustering league championship algorithm (DCLCA) scheduling technique for fault tolerance awareness to address cloud task execution which would reflect on the current available resources and reduce the untimely failure of autonomous tasks. Experimental results show that our proposed technique produces remarkable fault reduction in task failure as measured in terms of failure rate. It also shows that the DCLCA outperformed the MTCT, MAXMIN, ant colony optimization and genetic algorithm-based NSGA-II by producing lower makespan with improvement of 57.8, 53.6, 24.3 and 13.4 % in the first scenario and 60.0, 38.9, 31.5 and 31.2 % in the second scenario, respectively. Considering the experimental results, DCLCA provides better quality fault tolerance aware scheduling that will help to improve the overall performance of the cloud environment.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Gital AY, Ismail AS, Chen M, Chiroma H (2014) A framework for the design of cloud based collaborative virtual environment architecture. In: Proceedings of the international multi conference of engineers and computer scientists

  2. Lu K, Yahyapour R, Wieder P, Yaqub E, Abdullah M, Schloer B, Kotsokalis C (2016) Fault-tolerant service level agreement lifecycle management in clouds using actor system. Future Gener Comput Syst 54:247–259

    Article  Google Scholar 

  3. Moon Y-H, Youn C-H (2015) Multihybrid job scheduling for fault-tolerant distributed computing in policy-constrained resource networks. Comput Netw 82:81–95

    Article  Google Scholar 

  4. He J, Dong M, Ota K, Fan M, Wang G (2014) NetSecCC: a scalable and fault-tolerant architecture for cloud computing security. Peer-to-Peer Netw Appl 9(1):67–81

    Article  Google Scholar 

  5. Nawi NM, Khan A, Rehman MZ, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic Cuckoo search techniques for data classification. Math Probl Eng. doi:10.1155/2015/868375

    Google Scholar 

  6. Mills B, Znati T, Melhem R (2014) Shadow computing: an energy-aware fault tolerant computing model. In: 2014 International conference on computing, networking and communications (ICNC), IEEE, pp 73–77

  7. Kashan HA (2009) League championship algorithm: a new algorithm for numerical function optimization. In: International conference of soft computing and pattern recognition, 2009. SOCPAR’09, IEEE, pp 43–48

  8. Kashan HA, Karimi B (2012) A new algorithm for constrained optimization inspired by the sport league championships. In: 2010 IEEE congress on evolutionary computation (CEC), IEEE, pp 1–8

  9. Abdulhamid SM, Latiff MSA, Ismaila I (2014) Tasks scheduling technique using league championship algorithm for makespan minimization in IAAS cloud. ARPN J Eng Appl Sci 9(12):2528–2533

    Google Scholar 

  10. Abdulhamid SM, Latiff MSA, Madni SHH, Oluwafemi O (2015) A survey of league championship algorithm: prospects and challenges. Indian Jo Sci Technol 8(S3):101–110

    Article  Google Scholar 

  11. Yang Y-G, Tian J, Lei H, Zhou Y-H, Shi W-M (2016) Novel quantum image encryption using one-dimensional quantum cellular automata. Inf Sci 345:257–270

    Article  Google Scholar 

  12. Dondi R, El-Mabrouk N, Swenson KM (2014) Gene tree correction for reconciliation and species tree inference: complexity and algorithms. J Discrete Algorithms 25:51–65

    Article  MathSciNet  MATH  Google Scholar 

  13. Abdulhamid SM, Latiff MSA, Bashir MB (2014) On-demand grid provisioning using cloud infrastructures and related virtualization tools: a survey and taxonomy. Int J Adv Stud Comput Sci Eng IJASCSE 3(1):49–59

    Google Scholar 

  14. Kushwah VS, Goyal SK, Narwariya P (2014) A survey on various fault tolerant approaches for cloud environment during load balancing. Int J Comput Netw Wirel Mobile Commun 4(6):25–34

    Google Scholar 

  15. Yang W, Zhang C, Shao Y, Shi Y, Li H, Khan M, Hussain F, Khan I, Cui L-J, He H (2014) A hybrid particle swarm optimization algorithm for service selection problem in the cloud. Int J Grid Distrib Comput 7(4):1–10

    Article  Google Scholar 

  16. Hussin M, Lee YC, Zomaya AY (2010) Dynamic job-clustering with different computing priorities for computational resource allocation. In: Proceedings of the 2010 10th IEEE/ACM international conference on cluster, cloud and grid computing, IEEE Computer Society, pp 589–590

  17. Vidhate D, Patil A, Guleria D (2010) Dynamic cluster resource allocations for jobs with known memory demands. In: Proceedings of the international conference and workshop on emerging trends in technology, ACM, pp 64–69

  18. SiM Abdulhamid, Latiff SMA, Bashir MB (2014) Scheduling techniques in on-demand grid as a service cloud: a review. J Theor Appl Inf Technol 63(1):10–19

    Google Scholar 

  19. Abdullahi M, Ngadi MA (2016) Symbiotic organism search optimization based task scheduling in cloud computing environment. Future Gener Comput Syst 56:640–650

    Article  Google Scholar 

  20. Madni SHH, Latiff MSA, Coulibaly Y (2016) An appraisal of meta-heuristic resource allocation techniques for IaaS cloud. Indian J Sci Technol 9(4):1–14. doi:10.17485/ijst/2016/v9i4/80561

    Article  Google Scholar 

  21. Chiroma H, Shuib NLM, Muaz SA, Abubakar AI, Ila LB, Maitama JZ (2015) A review of the applications of bio-inspired flower pollination algorithm. Procedia Comput Sci 62:435–441

    Article  Google Scholar 

  22. Boru D, Kliazovich D, Granelli F, Bouvry P, Zomaya AY (2015) Energy-efficient data replication in cloud computing datacenters. Clust Comput 18(1):385–402. doi:10.1007/s10586-014-0404-x

    Article  Google Scholar 

  23. Gangeshwari R, Subbiah J, Malathy K, Miriam D (2012) HPCLOUD: a novel fault tolerant architectural model for hierarchical MapReduce. In: 2012 international conference on recent trends in information technology (ICRTIT), IEEE, pp 179–184

  24. G\({\c a}\)sior J, Seredyński F (2013) Multi-objective parallel machines scheduling for fault-tolerant cloud systems. In: Joanna K, Di Martino B, Talia D, Xiong K (eds) Algorithms and architectures for parallel processing. Springer, Switzerland, pp 247–256. doi:10.1007/978-3-319-03859-9_21

  25. Ling Y, Ouyang Y, Luo Z (2012) A novel fault-tolerant scheduling algorithm with high reliability in cloud computing systems. J Converg Inf Technol 7(15):107–115. doi:10.4156/jcit.vol7.issue15.13

    Google Scholar 

  26. Tawfeek M, El-Sisi A, Keshk A, Torkey F (2015) Cloud task scheduling based on ant colony optimization. Int Arab J Inf Technol (IAJIT) 12(2):129–137

    Google Scholar 

  27. Ganga K, Karthik S (2013) A fault tolerent approach in scientific workflow systems based on cloud computing. In: 2013 international conference on pattern recognition, informatics and mobile engineering (PRIME), IEEE, pp 387–390

  28. Bala A, Chana I (2015) Autonomic fault tolerant scheduling approach for scientific workflows in Cloud computing. Concurr Eng 23(1):27–39. doi:10.1177/1063293X14567783

    Article  Google Scholar 

  29. Bonvin N, Papaioannou TG, Aberer K (2010) A self-organized, fault-tolerant and scalable replication scheme for cloud storage. In: Proceedings of the 1st ACM symposium on cloud computing, ACM, pp 205–216

  30. Sampaio AM, Barbosa JG (2015) A performance enforcing mechanism for energy-and failure-aware cloud systems. In: 2014 international green computing conference, IGCC 2014. doi:10.1109/IGCC.2014.7039151

  31. Patra PK, Singh H, Singh G (2013) Fault tolerance techniques and comparative implementation in cloud computing. Int J Comput Appl 64(14):37–41

    Google Scholar 

  32. Nawi NM, Khan A, Rehman M, Chiroma H, Herawan T (2015) Weight optimization in recurrent neural networks with hybrid metaheuristic Cuckoo search techniques for data classification. Math Probl Eng 501:868375

    Google Scholar 

  33. Xu H, Yang B, Qi W, Ahene E (2016) A multi-objective optimization approach to workflow scheduling in clouds considering fault recovery. KSII Trans Internet Inf Syst 10(3):976–995. doi:10.3837/tiis.2016.03.002

    Google Scholar 

  34. Kumar VS, Aramudhan M (2014) Hybrid optimized list scheduling and trust based resource selection in cloud computing. J Theor Appl Inf Technol 69(3):434–442

    Google Scholar 

  35. Choi S, Chung K, Yu H (2014) Fault tolerance and QoS scheduling using CAN in mobile social cloud computing. Clust Comput 17(3):911–926

    Article  Google Scholar 

  36. Kaveh A (2014) Particle swarm optimization. In: Advances in metaheuristic algorithms for optimal design of structures. Springer, Switzerland, pp 9–40. doi:10.1007/978-3-319-05549-7

  37. Yuan H, Li C, Du M (2014) Optimal virtual machine resources scheduling based on improved particle swarm optimization in cloud computing. J Softw 9(3):705–708

    Google Scholar 

  38. Verma A, Kaushal S (2014) Bi-criteria priority based particle swarm optimization workflow scheduling algorithm for cloud. In: 2014 recent advances in engineering and computational sciences (RAECS), IEEE, pp 1–6

  39. Ramezani F, Lu J, Hussain FK (2014) Task-based system load balancing in cloud computing using particle swarm optimization. Int J Parallel Program 42(5):739–754

    Article  Google Scholar 

  40. Yang W, Zhang C, Shao Y, Shi Y, Li H, Khan M, Hussain F, Khan I, Cui L-J, He H (2014) A hybrid particle swarm optimization algorithm for service selection problem in the cloud. Int J Grid Distrib Comput 7(4):1–10. doi:10.14257/ijgdc.2014.7.4.01

    Article  Google Scholar 

  41. Wu K (2014) A tunable workflow scheduling algorithm based on particle swarm optimization for cloud computing. Master’s Projects, Paper 358. San José State University, USA

  42. Zhang W, Xie H, Cao B, Cheng AM (2014) Energy-aware real-time task scheduling for heterogeneous multiprocessors with particle swarm optimization algorithm. Math Probl Eng 2014: 1–9. doi:10.1155/2014/287475

    MathSciNet  Google Scholar 

  43. Gao Y, Gupta SK, Wang Y, Pedram M (2014) An energy-aware fault tolerant scheduling framework for soft error resilient cloud computing systems. In: Design, automation and test in Europe conference and exhibition (DATE), 2014, IEEE, pp 1–6

  44. Hu Y, Gong B, Wang F (2010) Cloud model-based security-aware and fault-tolerant job scheduling for computing grid. In: ChinaGrid conference (ChinaGrid), 2010 fifth annual, IEEE, pp 25–30

  45. Qiang W, Jiang C, Ran L, Zou D, Jin H (2015) CDMCR: multi-level fault-tolerant system for distributed applications in cloud. Secur Commun Netw 2015:SCN-SI-077. doi:10.1002/sec.1187

    Google Scholar 

  46. Urgaonkar R, Wang SQ, He T, Zafer M, Chan K, Leung KK (2015) Dynamic service migration and workload scheduling in edge-clouds. Perform Eval 91:205–228. doi:10.1016/j.peva.2015.06.013

    Article  Google Scholar 

  47. Vobugari S, Somayajulu D, Subaraya BM (2015) Dynamic replication algorithm for data replication to improve system availability: a performance engineering approach. IETE J Res 61(2):132–141. doi:10.1080/03772063.2014.988757

    Article  Google Scholar 

  48. Rathore N, Chana I (2014) Load balancing and job migration techniques in grid: a survey of recent trends. Wirel Pers Commun 79(3):2089–2125

    Article  Google Scholar 

  49. Yadav S, Nanda SJ (2015) League championship algorithm for clustering. In: 2015 IEEE power, communication and information technology conference (PCITC), IEEE, pp 321–326

  50. Xu W, Wang R, Yang J (2015) An improved league championship algorithm with free search and its application on production scheduling. J Intell Manuf 1–10. doi:10.1007/s10845-015-1099-4

  51. Buyya R, Ranjan R, Calheiros RN (2009) Modeling and simulation of scalable cloud computing environments and the CloudSim toolkit: challenges and opportunities. In: International conference on high performance computing & simulation, 2009. HPCS’09, IEEE, pp 1–11

  52. Parallel Workload Archive - SDSC-SP2-1998-4.swf (2015). http://www.cs.huji.ac.il/labs/parallel/workload/l_sdsc_sp2/index.html. Accessed 30 Jan 2015

  53. Ramakrishnan L, Reed DA (2008) Performability modeling for scheduling and fault tolerance strategies for scientific workflows. In: Proceedings of the 17th international symposium on High performance distributed computing, ACM, pp 23–34

  54. Garg R, Singh AK (2014) Fault tolerant task scheduling on computational grid using checkpointing under transient faults. Arabian J Sci Eng 39(12):8775–8791

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgments

The authors would like to acknowledge and appreciate the support of Universiti Teknologi Malaysia (UTM), Research University Grant Q. J130000.2528.05H87 and the Nigerian Tertiary Education Trust Fund (TetFund) for their support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shafi’i Muhammad Abdulhamid.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Abdulhamid, S.M., Abd Latiff, M., Madni, S.H.H. et al. Fault tolerance aware scheduling technique for cloud computing environment using dynamic clustering algorithm. Neural Comput & Applic 29, 279–293 (2018). https://doi.org/10.1007/s00521-016-2448-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-016-2448-8

Keywords

Navigation