Skip to main content

Advertisement

Log in

Performance-aware energy-efficient parallel job scheduling in HPC grid using nature-inspired hybrid meta-heuristics

  • Original Research
  • Published:
Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

Abstract

High-Performance Computing (HPC) systems offer massive computation strength to execute large-scale applications. However, the availability of thousands of CPU cores in the HPC Systems has also triggered a significant increase in the associated energy consumption translating to higher energy expenses of system providers and carbon emissions in the environment. Therefore efficient job schedulers, which can trade-off between user-desired performance and conflicting energy-efficiency objectives simultaneously, are the need of the hour and must nowadays. Job scheduling in HPC systems is a known NP-Hard problem for which meta-heuristics may provide a near-to-optimal solution. Cuckoo search (CS) is a well-known robust swarm-intelligence based meta-heuristic, which has been applied extensively in many optimization problems due to the strong searching efficiency and requirement of very few tuning parameters. However, it suffers from the likelihood of trapping in the local minima and lack of solution diversity towards the end of the algorithm. These drawbacks could result in unacceptable results when the CS algorithm applies to the parallel job scheduling problem. To overcome these limitations and improve the searching efficiency of the traditional CS, we have proposed a multi-objective hybrid scheduling algorithm called MOHCSFA to optimally schedule the batch of parallel jobs in HPC Grid. The proposed MOHCSFA policy combines the solution search mechanisms of both Cuckoo Search (CS) and Firefly algorithm (FA) during each generation. Our proposed policy is further integrated with efficient resource allocation (ERA) heuristic to improve job scheduler performance by effectively using multi-site resource allocation. The experiments are conducted on the GridSim simulator and the benchmarking of the proposed algorithm is done using real data-sets extracted from two supercomputing workload logs. The simulation results showed that the proposed MOHCSFA policy outperforms many heuristics and meta-heuristic scheduling policies for different test cases for both performance and energy-efficiency objectives. Specifically, in the case of Unilu-Gaia workloads, the MOHCSFA obtained 5.87–24.05%, 3.46–28.50%, and 7.06–26.76% performance improvement for the makespan, energy consumption and avg. flowtime, respectively over other tested scheduling policies. The statistical tests validated the stability and robustness of the proposed policy over other scheduling policies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  • Abraham A, Liu H, Zhang W et al (2006) Scheduling jobs on computational grids using fuzzy particle swarm algorithm. In: International conference on knowledge-based and intelligent information and engineering systems. Springer, pp. 500–507

  • Akbari M, Rashidi H, Alizadeh SH (2017) An enhanced genetic algorithm with new operators for task scheduling in heterogeneous computing systems. Eng Appl Artif Intell 61:35–46. https://doi.org/10.1016/j.engappai.2017.02.013

    Article  Google Scholar 

  • Ankita SSK (2020) Evolutionary based hybrid GA for solving multi-objective grid scheduling problem. Microsyst Technol 26:1405–1416. https://doi.org/10.1007/s00542-019-04673-z

    Article  Google Scholar 

  • Aydilek İB (2018) A hybrid firefly and particle swarm optimization algorithm for computationally expensive numerical problems. Appl Soft Comput 66:232–249. https://doi.org/10.1016/j.asoc.2018.02.025

    Article  Google Scholar 

  • Barman S, Goswami R, Das S, Ghosh TK (2017) Job scheduling in computational grid based on an improved cuckoo search method. IJCAT 55:138. https://doi.org/10.1504/IJCAT.2017.10003535

    Article  Google Scholar 

  • Biswas T, Kuila P, Ray AK (2019) A novel scheduling with multi-criteria for high-performance computing systems: an improved genetic algorithm-based approach. Eng Comput 35:1475–1490. https://doi.org/10.1007/s00366-018-0676-5

    Article  Google Scholar 

  • Biswas T, Kuila P, Ray AK (2018) A novel energy efficient scheduling for high performance computing systems. 2018 9th international conference on computing, communication and networking technologies (ICCCNT). IEEE, Bangalore, pp 1–6

    Google Scholar 

  • Blanco H, Llados J, Guirado F et al (2012) Ordering and allocating parallel jobs on multi-cluster systems. In: Proceedings of the 12th international conference on computational and mathematical methods in science and engineering, pp 196–206

  • Blanco H, Lérida JL, Cores F, Guirado F (2011) Multiple job co-allocation strategy for heterogeneous multi-cluster systems based on linear programming. J Supercomput 58:394–402. https://doi.org/10.1007/s11227-011-0596-2

    Article  Google Scholar 

  • Bose A, Biswas T, Kuila P (2019) A novel genetic algorithm based scheduling for multi-core systems. In: Tiwari S et al (eds) Smart innovations in communication and computational sciences, advances in intelligent systems and computing, p 851

  • Braun TD, Siegel HJ, Beck N et al (2001) A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems. J Parallel Distrib Comput 61:810–837. https://doi.org/10.1006/jpdc.2000.1714

    Article  Google Scholar 

  • Bucur AID, Epema DHJ (2007) Scheduling policies for processor coallocation in multicluster systems. IEEE Trans Parallel Distrib Syst 18:958–972. https://doi.org/10.1109/TPDS.2007.1036

    Article  Google Scholar 

  • Chandio AA, Bilal K, Tziritas N et al (2014) A comparative study on resource allocation and energy efficient job scheduling strategies in large-scale parallel computing systems. Cluster Comput 17:1349–1367. https://doi.org/10.1007/s10586-014-0384-x

    Article  Google Scholar 

  • Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Computat 6:182–197. https://doi.org/10.1109/4235.996017

    Article  Google Scholar 

  • Etinski M, Corbalan J, Labarta J, Valero M (2010) Utilization driven power-aware parallel job scheduling. Comput Sci Res Dev 25:207–216. https://doi.org/10.1007/s00450-010-0129-x

    Article  Google Scholar 

  • Etinski M, Corbalan J, Labarta J, Valero M (2012) Parallel job scheduling for power constrained HPC systems. Parallel Comput 38:615–630. https://doi.org/10.1016/j.parco.2012.08.001

    Article  MathSciNet  Google Scholar 

  • Feitelson D (2005) Parallel workloads archive. https://www.cs.huji.ac.il/labs/parallel/workload. Accessed 05 June 2019

  • Foster I (2013) Kesselman C (2013) The history of the grid computing. IOS Press, Amsterdam, Cloud Computing and Big Data

    Google Scholar 

  • Foster I, Kesselman C, Tuecke S (2001) The anatomy of the grid: enabling scalable virtual organizations. Int J High Perform Comput Appl 15:200–222. https://doi.org/10.1177/109434200101500302

    Article  Google Scholar 

  • Gabaldon E, Guirado F, Lerida J, Planes J (2016) Particle swarm optimization scheduling for energy saving in cluster computing heterogeneous environments. FiCloud Workshops, pp 321–325

  • Gabaldon E, Almenara SV, Guirado F et al (2017) Energy efficient scheduling on heterogeneous federated clusters using a fuzzy multi-objective meta-heuristics. In: IEEE international conference on fuzzy systems (FUZZ-IEEE), pp 1–6

  • Gabaldon E, Lerida JL, Guirado F, Planes J (2017) Blacklist muti-objective genetic algorithm for energy saving in heterogeneous environments. J Supercomput 73:354–369. https://doi.org/10.1007/s11227-016-1866-9

    Article  Google Scholar 

  • Ghasemiyeh R, Moghdani R, Sana SS (2017) A hybrid artificial neural network with metaheuristic algorithms for predicting stock price. Cybern Syst 48:365–392. https://doi.org/10.1080/01969722.2017.1285162

    Article  Google Scholar 

  • Ghosh TK, Das S, Ghoshal N (2020) Job scheduling in computational grid using a hybrid algorithm based on genetic algorithm and particle swarm optimization. In: Castillo O, Jana D, Giri D, Ahmed A (eds) Recent advances in intelligent information systems and applied mathematics. ICITAM 2019. Studies in Computational Intelligence, vol 863. Springer

  • Ghosh TK, Das S (2019) Solving job scheduling problem in computational grid systems using a hybrid algorithm. In: Sarfraz M (ed) Exploring critical approaches of evolutionary computation, pp 310–324. IGI Global

  • Ghosh TK, Das S (2018) A novel hybrid algorithm based on firefly algorithm and differential evolution for job scheduling in computational grid. Int J Distrib Syst Technol 9:1–15. https://doi.org/10.4018/IJDST.2018040101

    Article  Google Scholar 

  • Kamalinia A, Ghaffari A (2017) Hybrid task scheduling method for cloud computing by genetic and DE algorithms. Wirel Pers Commun 97:6301–6323. https://doi.org/10.1007/s11277-017-4839-2

    Article  Google Scholar 

  • Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of IEEE international conference on neural networks. IV. pp 1942–1948

  • Kołodziej J, Khan SU, Wang L et al (2013) Hierarchical genetic-based grid scheduling with energy optimization. Cluster Comput 16:591–609. https://doi.org/10.1007/s10586-012-0226-7

    Article  Google Scholar 

  • Kołodziej J, Khan SU, Wang L et al (2014) Security, energy, and performance-aware resource allocation mechanisms for computational grids. Future Gen Comput Syst 31:77–92. https://doi.org/10.1016/j.future.2012.09.009

    Article  Google Scholar 

  • Krusche P, Tiskin A (2006) Efficient longest common subsequence computation using bulk-synchronous parallelism. In: Gavrilova ML et al (eds) Computational science and its applications: ICCSA 2006. Lecture Notes in Computer Science, vol 3984. Springer, Berlin, Heidelberg, pp 165–174

  • Lerida JL, Solsona F, Hernandez P et al (2013) State-based predictions with self-correction on Enterprise Desktop Grid environments. J Parallel Distrib Comput 73:777–789. https://doi.org/10.1016/j.jpdc.2013.02.007

    Article  Google Scholar 

  • Li Y, Liu Y, Qian D (2009) A heuristic energy-aware scheduling algorithm for heterogeneous clusters. In: 15th international conference on parallel and distributed systems (ICPADS), pp 407–413

  • Liu H, Abraham A, Hassanien AE (2010) Scheduling jobs on computational grids using a fuzzy particle swarm optimization algorithm. Future Gen Comput Syst 26:1336–1343. https://doi.org/10.1016/j.future.2009.05.022

    Article  Google Scholar 

  • Mahato DP (2018) Cuckoo search-ant colony optimization based scheduling in grid computing. In: Proceedings of the 47th international conference on parallel processing companion, ICPP ’18. ACM Press, Eugene, OR, USA, pp 1–10

  • Monk PB, Parrott AK, Wesson PJ (1994) A parallel finite element method for electromagnetic scattering, COMPEL, Supp. A, vol 13, pp 237–242

  • Netto MAS, Calheiros RN, Rodrigues ER et al (2018) HPC cloud for scientific and business applications: taxonomy, vision, and research challenges. ACM Comput Surv 51:1–29. https://doi.org/10.1145/3150224

    Article  Google Scholar 

  • Nibhanupudi M, Norton C, Szymanski B (1995) Plasma simulation on networks of workstations using the bulk synchronous parallel model. In: Proceedings of the international conference on parallel and distributed processing techniques and applications, pp 13–22

  • Patel PS (2014) Multi-objective job scheduler using genetic algorithm in grid computing. IJCA 92:34–43. https://doi.org/10.5120/16079-5312

    Article  Google Scholar 

  • Pooranian Z, Shojafar M, Tavoli R et al (2013) A hybrid metaheuristic algorithm for job scheduling on computational grids. Informatica 37(2):157

    Google Scholar 

  • Pooranian Z, Shojafar M, Abawajy JH, Abraham A (2015) An efficient meta-heuristic algorithm for grid computing. J Comb Optim 30:413–434. https://doi.org/10.1007/s10878-013-9644-6

    Article  MathSciNet  MATH  Google Scholar 

  • Salem FA (2004) A BSP parallel model for the Gottfert Algorithm over F2, parallel processing and applied mathematics, pp 217–224

  • Sana SS, Ospina-Mateus H, Arrieta FG, Chedid JA (2019) Application of genetic algorithm to job scheduling under ergonomic constraints in manufacturing industry. J Ambient Intell Human Comput 10:2063–2090. https://doi.org/10.1007/s12652-018-0814-3

    Article  Google Scholar 

  • Skillicorn DB, Hill JMD, McColl WF (1997) Questions and answers about BSP. Sci Program 6:249–274. https://doi.org/10.1155/1997/532130

    Article  Google Scholar 

  • Sonmez O, Grundeken B, Mohamed H et al (2009) Scheduling strategies for cycle scavenging in multicluster grid systems. In: 9th IEEE/ACM international symposium on cluster computing and the grid, CCGRID, pp 12–19

  • Sonmez O, Mohamed H, Epema D (2010) On the benefit of processor coallocation in multicluster grid systems. IEEE Trans Parallel Distrib Syst 21:778–789. https://doi.org/10.1109/TPDS.2009.121

    Article  Google Scholar 

  • Srichandan S, Ashok Kumar T, Bibhudatta S (2018) Task scheduling for cloud computing using multi-objective hybrid bacteria foraging algorithm. Future Comput Inf J 3:210–230. https://doi.org/10.1016/j.fcij.2018.03.004

    Article  Google Scholar 

  • Stavrinides G, Karatza H (2014) Scheduling real-time jobs in distributed systems: simulation and performance analysis, 1st international workshop on sustainable ultrascale computing systems (NESUS 2014). Porto, Portugal, pp 13–18

    Google Scholar 

  • Stavrinides G, Karatza H (2017) Simulation-Based Performance Evaluation of an Energy-Aware Heuristic for the Scheduling of HPC Applications in Large-Scale Distributed Systems, The 8th ACM/SPEC conference, pp 49–54

  • Switalski P, Seredynski F (2012) A grid scheduling based on generalized external optimization for parallel job model, Parallel processing and applied mathematics. Lect Notes Comput Sci 7204:41–50

    Article  Google Scholar 

  • Switalski P, Seredynski F (2015) Scheduling parallel batch jobs in grids with evolutionary metaheuristics. J Sched 18:345–357. https://doi.org/10.1007/s10951-014-0382-0

    Article  MathSciNet  MATH  Google Scholar 

  • Tang X, Liao X (2018) Application-aware deadline constraint job scheduling mechanism on large-scale computational grid. PLoS ONE 13:e0207596. https://doi.org/10.1371/journal.pone.0207596

    Article  Google Scholar 

  • Tasgetiren FM, Liang Y-C, Sevkli M, Gencyilmaz G (2006) Particle swarm optimization and differential evolution for the single machine total weighted tardiness problem. Int J Prod Res 44:4737–4754. https://doi.org/10.1080/00207540600620849

    Article  MATH  Google Scholar 

  • Wang L, Zhong Y (2015) Cuckoo search algorithm with chaotic maps. Math Probl Eng 2015:1–14. https://doi.org/10.1155/2015/715635

    Article  MathSciNet  MATH  Google Scholar 

  • Wei J, Yu Y (2018) An effective hybrid cuckoo search algorithm for unknown parameters and time delays estimation of chaotic systems. IEEE Access 6:6560–6571. https://doi.org/10.1109/ACCESS.2017.2738006

    Article  Google Scholar 

  • Yan H, Shen XQ, Li X et al (2005) An improved ant algorithm for job scheduling in grid computing. IEEE Int Conf Mach Learn Cybern 5:2957–2961

    Google Scholar 

  • Yang XS, Deb S (2009) Cuckoo search via Lévy flights. In: IEEE World congress on nature and biologically inspired computing, pp 210–214

  • Yang XS, Deb S (2009) Firefly algorithms for multimodal optimization. In: Watanabe O, Zeugmann T (eds) Stochastic algorithms: foundations and applications. SAGA 2009, Lecture Notes in Computer Science, vol. 5792. Springer, Berlin, Heidelberg, pp 169–178

  • Yang XS, Deb S (2010) Engineering optimisation by cuckoo search. Int J Math Model Numer Optim 1(4):330–343

    MATH  Google Scholar 

  • Younis MT, Yang S (2017) Genetic algorithm for independent job scheduling in grid computing. mendel 23:65–72. https://doi.org/10.13164/mendel.2017.1.065

  • Younis MT, Yang S, Passow B (2017) Meta-heuristically seeded genetic algorithm for independent job scheduling in grid computing. In: Squillero G, Sim K (eds) Applications of evolutionary computation. Springer International Publishing, Cham, pp 177–189

    Chapter  Google Scholar 

  • Younis MT, Yang S (2018) Hybrid meta-heuristic algorithms for independent job scheduling in grid computing. Appl Soft Comput 72:498–517. https://doi.org/10.1016/j.asoc.2018.05.032

    Article  Google Scholar 

  • Zhang L, Liu L, Yang X-S, Dai Y (2016) A novel hybrid firefly algorithm for global optimization. PLoS ONE 11:e0163230. https://doi.org/10.1371/journal.pone.0163230

    Article  Google Scholar 

  • Zhang L, Chen Y, Sun R, Yang B (2008) A task scheduling algorithm based on PSO for grid computing. IJCIR. https://doi.org/10.5019/j.ijcir.2008.123

    Article  Google Scholar 

  • Xhafa F, Carretero JS, Abraham A (2007) Genetic algorithm based schedulers for grid computing systems. Int J Innov Comput Inf Control 3(6):1–19

    Google Scholar 

Download references

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Amit Chhabra.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chhabra, A., Singh, G. & Kahlon, K.S. Performance-aware energy-efficient parallel job scheduling in HPC grid using nature-inspired hybrid meta-heuristics. J Ambient Intell Human Comput 12, 1801–1835 (2021). https://doi.org/10.1007/s12652-020-02255-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12652-020-02255-w

Keywords

Navigation