Abstract
Nowadays, we live an unprecedented evolution in cloud computing technology that coincides with the development of the vast amount of complex interdependent data which make up the scientific workflows. All these circumstances developments have made the issue of workflow scheduling very important and of absolute priority to all overlapping parties as the provider and customer. For that, work must be focused on finding the best strategy for allocating workflow tasks to available computing resources. In this paper, we consider the scientific workflow scheduling in cloud computing. The main role of our model is to optimize the time needed to run a set of interdependent tasks in cloud and in turn reduces the computational cost while meeting deadline and budget. To this end, we offer a hybrid approach based on genetic algorithm for modelling and optimizing a workflow-scheduling problem in cloud computing. The heterogeneous earliest finish time (HEFT), an heuristic model, intervenes in the generation of the initial population. Based on results obtained from our simulations using real-world scientific workflow datasets, we demonstrate that the proposed approach outperforms existing HEFT and other strategies examined in this paper. In other words, experiments show high efficiency of our proposed approach, which makes it potentially applicable for cloud workflow scheduling. For this, we develop a GA-based module that was integrated to the WorkflowSim framework based on CloudSim.














Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Notes
Description of an abstract workflow in eXtended Markup Language (XML) format that is used as the primary input into Pegasus.
General text-oriented document format.
References
Poola D, Ramamohanarao K, Buyya R (2014) Fault-tolerant workflow scheduling using spot instances on clouds. Procedia Comput Sci 29(12):523–533. https://doi.org/10.1016/j.procs.2014.05.047
Wang J, Abdelbaky M, Diaz-Montes J, Purawat S, Parashar M, Altintas I (2016) Kepler + cometcloud: dynamic scientific workflow execution on federated cloud resources. Procedia Comput Sci 80(12):700–711. https://doi.org/10.1016/j.procs.2016.05.363
Alkhanak EN, Lee SP, Rezaei R, Parizi RM (2016) Cost optimization approaches for scientific workflow scheduling in cloud and grid computing: a review, classifications, and open issues. J Syst Softw 113:1–26. https://doi.org/10.1016/j.jss.2015.11.023
Aziza H, Krichen S (2018) Bi-objective decision support system for task-scheduling based on genetic algorithm in cloud computing. Computing 100(2):65–91. https://doi.org/10.1007/s00607-017-0566-5
Jiang Q, Lee YC, Arenaz M, Leslie LM, Zomaya AY (2014) Optimizing scientific workflows in the cloud: a montage example. In: 2014 IEEE/ACM 7th international conference on utility and cloud computing, December 2014, pp 517–522. https://doi.org/10.1109/UCC.2014.77
Xiang B, Zhang B, Zhang L (2017) Greedy-ant: ant colony system-inspired workflow scheduling for heterogeneous computing. IEEE Access 5:11404–11412. https://doi.org/10.1109/ACCESS.2017.2715279
Chirkin AM, Belloum ASZ, Kovalchuk SV, Makkes MX, Melnik MA, Visheratin AA, Nasonov DA (2017) Execution time estimation for workflow scheduling. Future Gener Comput Syst 75:376–387. https://doi.org/10.1016/j.future.2017.01.011
Visheratin AA, Melnik M, Nasonov D (2016) Workflow scheduling algorithms for hard-deadline constrained cloud environments. Procedia Comput Sci 80:2098–2106. https://doi.org/10.1016/j.procs.2016.05.529 International conference on computational science 2016, ICCS 2016, 6–8 June 2016, San Diego, California, USA
Visheratin A, Melnik M, Butakov N, Nasonov D (2015) Hard-deadline constrained workflows scheduling using metaheuristic algorithms. Procedia Comput Sci 66:506–514. https://doi.org/10.1016/j.procs.2015.11.057 4th international young scientist conference on computational science
Calheiros RN, Buyya R (2014) Meeting deadlines of scientific workflows in public clouds with tasks replication. IEEE Trans Parallel Distrib Syst 25(7):1787–1796. https://doi.org/10.1109/TPDS.2013.238
Li X, Cai Z (2015) Elastic resource provisioning for cloud workflow applications. IEEE Trans Autom Sci Eng 14(1–16):12. https://doi.org/10.1109/TASE.2015.2500574
Zhou AC, He B, Liu C (2016) Monetary cost optimizations for hosting workflow-as-a-service in IaaS clouds. IEEE Trans Cloud Comput 4(1):34–48. https://doi.org/10.1109/TCC.2015.2404807
Zhu Z, Zhang G, Li M, Liu X (2016) Evolutionary multi-objective workflow scheduling in cloud. IEEE Trans Parallel Distrib Syst 27(5):1344–1357. https://doi.org/10.1109/TPDS.2015.2446459
Casas I, Taheri J, Ranjan R, Wang L, Zomaya AY (2018) GA-ETI: an enhanced genetic algorithm for the scheduling of scientific workflows in cloud environments. J Comput Sci 26:318–331. https://doi.org/10.1016/j.jocs.2016.08.007
Zhang F, Cao J, Li K, Khan SU, Hwang K (2014) Multi-objective scheduling of many tasks in cloud platforms. Future Gener Comput Syst 37:309–320. https://doi.org/10.1016/j.future.2013.09.006 Special section: innovative methods and algorithms for advanced data-intensive computing. Special section: semantics, intelligent processing and services for big data. Special section: advances in data-intensive modelling and simulation. Special section: hybrid intelligence for growing internet and its applications
Meena J, Kumar M, Vardhan M (2016) Cost effective genetic algorithm for workflow scheduling in cloud under deadline constraint. IEEE Access 4:5065–5082. https://doi.org/10.1109/ACCESS.2016.2593903
Wu Q, Ishikawa F, Zhu Q, Xia Y, Wen J (2017) Deadline-constrained cost optimization approaches for workflow scheduling in clouds. IEEE Trans Parallel Distrib Syst 28(12):3401–3412. https://doi.org/10.1109/TPDS.2017.2735400
Rodriguez MA, Buyya R (2014) Deadline based resource provisioning and scheduling algorithm for scientific workflows on clouds. IEEE Trans Cloud Comput 2(2):222–235. https://doi.org/10.1109/TCC.2014.2314655
Haidri RA, Katti CP, Saxen PC (2017) Cost effective deadline aware scheduling strategy for workflow applications on virtual machines in cloud computing. J King Saud Univ Comput Inf Sci. https://doi.org/10.1016/j.jksuci.2017.10.009
Gharooni-fard G, Moein-darbari F, Deldari H, Morvaridi A (2010) Scheduling of scientific workflows using a chaos-genetic algorithm. Procedia Comput Sci 1(1):1445–1454. https://doi.org/10.1016/j.procs.2010.04.160. ICCS 2010
Shishido HY, Estrella JC, Toledo CFM, Arantes MS (2018) Genetic-based algorithms applied to a workflow scheduling algorithm with security and deadline constraints in clouds. Comput Electr Eng 69:378–394. https://doi.org/10.1016/j.compeleceng.2017.12.004
Ghafarian T, Javadi B, Buyya R (2016) Multi-objective scheduling of scientific workflows in multisite clouds. Future Gener Comput Syst 63:76–95. https://doi.org/10.1016/j.future.2016.04.014 Modeling and management for big data analytics and visualization
Lee YC, Han H, Zomaya AY, Yousif M (2015) Resource-efficient workflow scheduling in clouds. Knowl Based Syst 80:153–162. https://doi.org/10.1016/j.knosys.2015.02.012 25th anniversary of knowledge-based systems
Sahni J, Vidyarthi DP (2016) Workflow-and-platform aware task clustering for scientific workflow execution in cloud environment. Future Gener Comput Syst 64:61–74. https://doi.org/10.1016/j.future.2016.05.008
Zhang F, Cao J, Hwang K, Li K, Khan SU (2015) Adaptive workflow scheduling on cloud computing platforms with iterative ordinal optimization. IEEE Trans Cloud Comput 3(2):156–168. https://doi.org/10.1109/TCC.2014.2350490
Fard HM, Prodan R, Fahringer T (2014) Multi-objective list scheduling of workflow applications in distributed computing infrastructures. J Parallel Distrib Comput 74(3):2152–2165. https://doi.org/10.1016/j.jpdc.2013.12.004
Wang X, Yeo CS, Buyya R, Jinshu S (2011) Optimizing the makespan and reliability for workflow applications with reputation and a look-ahead genetic algorithm. Future Gener Comput Syst 27(8):1124–1134. https://doi.org/10.1016/j.future.2011.03.008
Vinay K, Dilip Kumar SM (2016) Auto-scaling for deadline constrained scientific workflows in cloud environment. In: 2016 IEEE annual India conference (INDICON), December 2016, pp 1–6. https://doi.org/10.1109/INDICON.2016.7838908
Malawski M, Juve G, Deelman E, Nabrzyski J (2015) Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS clouds. Future Gener Comput Syst 48:1–18. https://doi.org/10.1016/j.future.2015.01.004 Special section: business and industry specific cloud
Ghafarian T, Javadi B, Buyya R (2015) Decentralised workflow scheduling in volunteer computing systems. Int J Parallel Emerg Distrib Syst 30(5):343–365. https://doi.org/10.1080/17445760.2014.973876
Workflow management system (2018) https://pegasus.isi.edu/
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Aziza, H., Krichen, S. A hybrid genetic algorithm for scientific workflow scheduling in cloud environment. Neural Comput & Applic 32, 15263–15278 (2020). https://doi.org/10.1007/s00521-020-04878-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-020-04878-8