Skip to main content

Advertisement

Log in

A hybrid genetic algorithm for scientific workflow scheduling in cloud environment

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Nowadays, we live an unprecedented evolution in cloud computing technology that coincides with the development of the vast amount of complex interdependent data which make up the scientific workflows. All these circumstances developments have made the issue of workflow scheduling very important and of absolute priority to all overlapping parties as the provider and customer. For that, work must be focused on finding the best strategy for allocating workflow tasks to available computing resources. In this paper, we consider the scientific workflow scheduling in cloud computing. The main role of our model is to optimize the time needed to run a set of interdependent tasks in cloud and in turn reduces the computational cost while meeting deadline and budget. To this end, we offer a hybrid approach based on genetic algorithm for modelling and optimizing a workflow-scheduling problem in cloud computing. The heterogeneous earliest finish time (HEFT), an heuristic model, intervenes in the generation of the initial population. Based on results obtained from our simulations using real-world scientific workflow datasets, we demonstrate that the proposed approach outperforms existing HEFT and other strategies examined in this paper. In other words, experiments show high efficiency of our proposed approach, which makes it potentially applicable for cloud workflow scheduling. For this, we develop a GA-based module that was integrated to the WorkflowSim framework based on CloudSim.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Notes

  1. Description of an abstract workflow in eXtended Markup Language (XML) format that is used as the primary input into Pegasus.

  2. General text-oriented document format.

References

  1. Poola D, Ramamohanarao K, Buyya R (2014) Fault-tolerant workflow scheduling using spot instances on clouds. Procedia Comput Sci 29(12):523–533. https://doi.org/10.1016/j.procs.2014.05.047

    Article  Google Scholar 

  2. Wang J, Abdelbaky M, Diaz-Montes J, Purawat S, Parashar M, Altintas I (2016) Kepler + cometcloud: dynamic scientific workflow execution on federated cloud resources. Procedia Comput Sci 80(12):700–711. https://doi.org/10.1016/j.procs.2016.05.363

    Article  Google Scholar 

  3. Alkhanak EN, Lee SP, Rezaei R, Parizi RM (2016) Cost optimization approaches for scientific workflow scheduling in cloud and grid computing: a review, classifications, and open issues. J Syst Softw 113:1–26. https://doi.org/10.1016/j.jss.2015.11.023

    Article  Google Scholar 

  4. Aziza H, Krichen S (2018) Bi-objective decision support system for task-scheduling based on genetic algorithm in cloud computing. Computing 100(2):65–91. https://doi.org/10.1007/s00607-017-0566-5

    Article  MathSciNet  Google Scholar 

  5. Jiang Q, Lee YC, Arenaz M, Leslie LM, Zomaya AY (2014) Optimizing scientific workflows in the cloud: a montage example. In: 2014 IEEE/ACM 7th international conference on utility and cloud computing, December 2014, pp 517–522. https://doi.org/10.1109/UCC.2014.77

  6. Xiang B, Zhang B, Zhang L (2017) Greedy-ant: ant colony system-inspired workflow scheduling for heterogeneous computing. IEEE Access 5:11404–11412. https://doi.org/10.1109/ACCESS.2017.2715279

    Article  Google Scholar 

  7. Chirkin AM, Belloum ASZ, Kovalchuk SV, Makkes MX, Melnik MA, Visheratin AA, Nasonov DA (2017) Execution time estimation for workflow scheduling. Future Gener Comput Syst 75:376–387. https://doi.org/10.1016/j.future.2017.01.011

    Article  Google Scholar 

  8. Visheratin AA, Melnik M, Nasonov D (2016) Workflow scheduling algorithms for hard-deadline constrained cloud environments. Procedia Comput Sci 80:2098–2106. https://doi.org/10.1016/j.procs.2016.05.529 International conference on computational science 2016, ICCS 2016, 6–8 June 2016, San Diego, California, USA

    Article  Google Scholar 

  9. Visheratin A, Melnik M, Butakov N, Nasonov D (2015) Hard-deadline constrained workflows scheduling using metaheuristic algorithms. Procedia Comput Sci 66:506–514. https://doi.org/10.1016/j.procs.2015.11.057 4th international young scientist conference on computational science

    Article  Google Scholar 

  10. Calheiros RN, Buyya R (2014) Meeting deadlines of scientific workflows in public clouds with tasks replication. IEEE Trans Parallel Distrib Syst 25(7):1787–1796. https://doi.org/10.1109/TPDS.2013.238

    Article  Google Scholar 

  11. Li X, Cai Z (2015) Elastic resource provisioning for cloud workflow applications. IEEE Trans Autom Sci Eng 14(1–16):12. https://doi.org/10.1109/TASE.2015.2500574

    Article  Google Scholar 

  12. Zhou AC, He B, Liu C (2016) Monetary cost optimizations for hosting workflow-as-a-service in IaaS clouds. IEEE Trans Cloud Comput 4(1):34–48. https://doi.org/10.1109/TCC.2015.2404807

    Article  Google Scholar 

  13. Zhu Z, Zhang G, Li M, Liu X (2016) Evolutionary multi-objective workflow scheduling in cloud. IEEE Trans Parallel Distrib Syst 27(5):1344–1357. https://doi.org/10.1109/TPDS.2015.2446459

    Article  Google Scholar 

  14. Casas I, Taheri J, Ranjan R, Wang L, Zomaya AY (2018) GA-ETI: an enhanced genetic algorithm for the scheduling of scientific workflows in cloud environments. J Comput Sci 26:318–331. https://doi.org/10.1016/j.jocs.2016.08.007

    Article  Google Scholar 

  15. Zhang F, Cao J, Li K, Khan SU, Hwang K (2014) Multi-objective scheduling of many tasks in cloud platforms. Future Gener Comput Syst 37:309–320. https://doi.org/10.1016/j.future.2013.09.006 Special section: innovative methods and algorithms for advanced data-intensive computing. Special section: semantics, intelligent processing and services for big data. Special section: advances in data-intensive modelling and simulation. Special section: hybrid intelligence for growing internet and its applications

    Article  Google Scholar 

  16. Meena J, Kumar M, Vardhan M (2016) Cost effective genetic algorithm for workflow scheduling in cloud under deadline constraint. IEEE Access 4:5065–5082. https://doi.org/10.1109/ACCESS.2016.2593903

    Article  Google Scholar 

  17. Wu Q, Ishikawa F, Zhu Q, Xia Y, Wen J (2017) Deadline-constrained cost optimization approaches for workflow scheduling in clouds. IEEE Trans Parallel Distrib Syst 28(12):3401–3412. https://doi.org/10.1109/TPDS.2017.2735400

    Article  Google Scholar 

  18. Rodriguez MA, Buyya R (2014) Deadline based resource provisioning and scheduling algorithm for scientific workflows on clouds. IEEE Trans Cloud Comput 2(2):222–235. https://doi.org/10.1109/TCC.2014.2314655

    Article  Google Scholar 

  19. Haidri RA, Katti CP, Saxen PC (2017) Cost effective deadline aware scheduling strategy for workflow applications on virtual machines in cloud computing. J King Saud Univ Comput Inf Sci. https://doi.org/10.1016/j.jksuci.2017.10.009

    Article  Google Scholar 

  20. Gharooni-fard G, Moein-darbari F, Deldari H, Morvaridi A (2010) Scheduling of scientific workflows using a chaos-genetic algorithm. Procedia Comput Sci 1(1):1445–1454. https://doi.org/10.1016/j.procs.2010.04.160. ICCS 2010

    Article  Google Scholar 

  21. Shishido HY, Estrella JC, Toledo CFM, Arantes MS (2018) Genetic-based algorithms applied to a workflow scheduling algorithm with security and deadline constraints in clouds. Comput Electr Eng 69:378–394. https://doi.org/10.1016/j.compeleceng.2017.12.004

    Article  Google Scholar 

  22. Ghafarian T, Javadi B, Buyya R (2016) Multi-objective scheduling of scientific workflows in multisite clouds. Future Gener Comput Syst 63:76–95. https://doi.org/10.1016/j.future.2016.04.014 Modeling and management for big data analytics and visualization

    Article  Google Scholar 

  23. Lee YC, Han H, Zomaya AY, Yousif M (2015) Resource-efficient workflow scheduling in clouds. Knowl Based Syst 80:153–162. https://doi.org/10.1016/j.knosys.2015.02.012 25th anniversary of knowledge-based systems

    Article  Google Scholar 

  24. Sahni J, Vidyarthi DP (2016) Workflow-and-platform aware task clustering for scientific workflow execution in cloud environment. Future Gener Comput Syst 64:61–74. https://doi.org/10.1016/j.future.2016.05.008

    Article  Google Scholar 

  25. Zhang F, Cao J, Hwang K, Li K, Khan SU (2015) Adaptive workflow scheduling on cloud computing platforms with iterative ordinal optimization. IEEE Trans Cloud Comput 3(2):156–168. https://doi.org/10.1109/TCC.2014.2350490

    Article  Google Scholar 

  26. Fard HM, Prodan R, Fahringer T (2014) Multi-objective list scheduling of workflow applications in distributed computing infrastructures. J Parallel Distrib Comput 74(3):2152–2165. https://doi.org/10.1016/j.jpdc.2013.12.004

    Article  MATH  Google Scholar 

  27. Wang X, Yeo CS, Buyya R, Jinshu S (2011) Optimizing the makespan and reliability for workflow applications with reputation and a look-ahead genetic algorithm. Future Gener Comput Syst 27(8):1124–1134. https://doi.org/10.1016/j.future.2011.03.008

    Article  Google Scholar 

  28. Vinay K, Dilip Kumar SM (2016) Auto-scaling for deadline constrained scientific workflows in cloud environment. In: 2016 IEEE annual India conference (INDICON), December 2016, pp 1–6. https://doi.org/10.1109/INDICON.2016.7838908

  29. Malawski M, Juve G, Deelman E, Nabrzyski J (2015) Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds. Future Gener Comput Syst 48:1–18. https://doi.org/10.1016/j.future.2015.01.004 Special section: business and industry specific cloud

    Article  Google Scholar 

  30. Ghafarian T, Javadi B, Buyya R (2015) Decentralised workflow scheduling in volunteer computing systems. Int J Parallel Emerg Distrib Syst 30(5):343–365. https://doi.org/10.1080/17445760.2014.973876

    Article  Google Scholar 

  31. Workflow management system (2018) https://pegasus.isi.edu/

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hatem Aziza.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Aziza, H., Krichen, S. A hybrid genetic algorithm for scientific workflow scheduling in cloud environment. Neural Comput & Applic 32, 15263–15278 (2020). https://doi.org/10.1007/s00521-020-04878-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-020-04878-8

Keywords