Abstract
Scientific workflows are used to process large amounts of data and perform complex analyses; thus, they require powerful computing resources to produce the desired results in an acceptable time and at reasonable costs. For this purpose, distributed resources such as cloud computing, with access to virtualized, infinite, and elastic resources are used to execute the workflows. For mapping tasks to computational resources, the problem must be modeled as a scheduling problem. The algorithm presented in this research is a hybrid algorithm based on a mathematical model called MHPSLP that performs the scheduling problem by breaking the problem into smaller subsets including scheduling bags of tasks, providing resources using an mixed integer linear mathematical (MILP) model. The benefit of this method against compared scheduling algorithms is reduction of executed task’s cost in a deadline constraint.











Similar content being viewed by others
Data availability
In this paper, we used scientific workflows for our experimentations as presented in Sect. 5 in our paper, including Montage, Epigenomics, Sipht, and Ligo. These workflows with a different number of tasks collect as a zip file and put as a public and downloadable link in Git at the address given in https://github.com/Baran7292/paper. Although, we used the workflows with approximately 1000 tasks in our experiment. Also, I put other workflows like Inspiral and Cybershake in different sizes for further experimentation that could help other authors to access a wide range of workflows. If anybody needs some explanation about these datasets, contact me with the Email: malihe.hariri@alumni.um.ac.ir
References
Rodriguez MA, Buyya R (2015) A responsive knapsack-based algorithm for resource provisioning and scheduling of scientific workflows in clouds. In: 2015 44th international conference on parallel processing (ICPP). IEEE, pp 839–848
Thai L, Varghese B, Barker A (2018) A survey and taxonomy of resource optimisation for executing bag-of-task applications on public clouds. Futur Gener Comput Syst 82:1–11
Rodriguez MA, Buyya R (2017) A taxonomy and survey on scheduling algorithms for scientific workflows in IaaS cloud computing environments. Concurr Comput Pract Exp 29(8):e4041
Smanchat S, Viriyapant K (2015) Taxonomies of workflow scheduling problem and techniques in the cloud. Futur Gener Comput Syst 52:1–12
Alkhanak EN, Lee SP, Khan SUR (2015) Cost-aware challenges for workflow scheduling approaches in cloud computing environments: taxonomy and opportunities. Futur Gener Comput Syst 50:3–21
Wu F, Wu Q, Tan Y (2015) Workflow scheduling in cloud: a survey. J Supercomput 71(9):3373–3418
Abrishami S, Naghibzadeh M, Epema DH (2013) Deadline-constrained workflow scheduling algorithms for infrastructure as a service clouds. Futur Gener Comput Syst 29(1):158–169
Calheiros RN, Buyya R (2014) Meeting deadlines of scientific workflows in public clouds with tasks replication. IEEE Trans Parallel Distrib Syst 25(7):1787–1796
Alejandra RM, Buyya R (2014) Deadline based resource provisioning and scheduling algorithm for scientific workflows on clouds. IEEE Trans Cloud Comput 2(2):222–235
Malawski M, Figiela K, Bubak M, Deelman E, Nabrzyski J (2015) Scheduling multilevel deadline-constrained scientific workflows on clouds based on cost optimization. Sci Program 2015:14
Mao M, Humphrey M (2011) Auto-scaling to minimize cost and meet application deadlines in cloud workflows. In: International conference on high performance computing, networking, storage and analysis (SC). IEEE, pp 1–12
Van den Bossche R, Vanmechelen K, Broeckhove J (2013) Online cost-efficient scheduling of deadline-constrained workloads on hybrid clouds. Futur Gener Comput Syst 29(4):973–985
Kumar M, Sharma S (2017) Dynamic load balancing algorithm for balancing the workload among virtual machine in cloud computing. Procedia Comput Sci 115:322–329
Byun E-K, Kee Y-S, Kim J-S, Maeng S (2011) Cost optimized provisioning of elastic resources for application workflows. Futur Gener Comput Syst 27(8):1011–1026
Rodriguez MA, Buyya R (2017) Budget-driven scheduling of scientific workflows in IaaS clouds with fine-grained billing periods. ACM Trans Auton Adapt Syst 12(2):22
Juve G, Chervenak A, Deelman E, Bharathi S, Mehta G, Vahi K (2013) Characterizing and profiling scientific workflows. Futur Gener Comput Syst 29(3):682–692
Bharathi S et al (2008) Characterization of scientific workflows. In: 2008 third workshop on workflows in support of large-scale science. IEEE
Berriman G, Laity A, Good J, Jacob J, Katz D, Deelman E, Singh G, Su M, Prince T (2006) Montage: the architecture and scientific applications of a national virtual observatory service for computing astronomical image mosaics. In: Proceedings of Earth Sciences Technology Conference
Livny J, Teonadi H, Livny M, Waldor MK (2008) High-throughput, kingdom-wide prediction and annotation of bacterial non-coding RNAs. PLoS ONE 3(9):e3197
USC Epigenome Center. http://epigenome.usc.edu. Accessed Oct 2015
Abramovici A, Althouse WE, Drever RW, Gürsel Y, Kawamura S, Raab FJ, Shoemaker D, Sievers L, Spero RE, Thorne KS et al (1992) Ligo: the laser interferometer gravitational-wave observatory. Science 256(5055):325–333
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Hariri, M., Nouri-Baygi, M. & Abrishami, S. A hybrid algorithm for scheduling scientific workflows in IaaS cloud with deadline constraint. J Supercomput 78, 16975–16996 (2022). https://doi.org/10.1007/s11227-022-04563-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-022-04563-8