Abstract
In this paper, we have proposed two backfill scheduling optimizations, i.e., Shortest Width First Backfill (SWFBF) and Shortest Area First Backfill (SAFBF). A near optimal simple, but effective job packing algorithm called the Select-Replace algorithm has also been presented to minimize external fragmentation. Proof of the concept has been given with the help of a simulation study. Five workloads which were derived from a clean version of the parallel workload archive (CTC, LANL, and SDSC. NASA) have been used to evaluate and compare proposed heuristics with previous techniques. With the simple but effective optimizations, significant (56.1%) performance improvement has been achieved as compared to EASY scheduler.
Similar content being viewed by others
References
Baker MA, Fox GC, Yau HW (2008) New challenges of parallel job scheduling. In: Lecture notes in computer science, vol 4942. Springer, Berlin, pp 1–23. ISBN 978-3-540-78698-6
Baeza-Yates R, Castillo C, Junqueira F, Plachouras V, Silvestri F (2007) Challenges on distributed web retrieval. In: Data engineering ICDE 2007. IEEE 23rd international conference on, April 2007, pp 6–20
Baker M, Apon A, Buyya R, Jin H (2002) Cluster computing and applications. In: Kent A, Williams J (eds) Encyclopedia of computer science and technology. Dekker, New York, pp 87–125
Mishima H, Lidral AC, Ni J (2008) Application of the Linux cluster for exhaustive window haplotype analysis using the FBAT and unphased programs. Dows Institute for Dental Research, University of Iowa, Iowa City, IA 52242, USA
Shmueli E, Feitelson DG (2005) Backfilling with look ahead to optimize the packing of parallel jobs. J Parallel Distrib Comput 65(9):1090–1107. ISSN 0743-7315
Zhang Y, Franke H, Moreira JE, Sivasubramaniam A (2002) Improving parallel job scheduling by combining gang scheduling and Backfilling techniques. In: Parallel and distributed processing symposium, IPDPS 2002, pp 133–142. ISBN: 0-7695-0574-0
Lawson B, Smirni E, Puiu D (2002) Self-adaptive backfill scheduling for parallel systems. In: Proceedings of the international conference on parallel processing (ICPP 2002), pp 583–592
Tsafrir D, Etsion Y, Feitelson DG (2007) Backfilling using system-generated predictions rather than user runtime estimates. IEEE Trans Parallel Distrib Syst 18(6):789–803. ISSN: 1045-9219
Tsafrir D, Feitelson DG (2006) The dynamics of backfilling: solving the mystery of why increased inaccuracy may help. In: IEEE international symposium on workload characterization, pp 131–141
Bode B, Halstead DM, Kendall R, Lei Z (2000) Portable batch system and the MAUI scheduler on Linux clusters. In: The proceedings of the 4th annual Linux showcase and conference, Atlanta, October 10–14, 2000
Feitelson D, Weil A (1998) Utilization and predictability in scheduling the IBM SP2 with backfilling. In: Proceedings of the 1st merged international parallel processing symposium and symposium on parallel and distributed processing (IPPS/SPDP-98), Los Alamitos, March 1998, pp 542–547
Jones JP, Nitzberg B (1999) Scheduling for parallel supercomputing: a historical perspective of achievable utilization. In: Job scheduling strategies for parallel processing. Lect notes comput sci, vol 1659. Springer, Berlin, pp 1–16
Ward WA Jr, Mahood CL, West JE (2002) Scheduling jobs on parallel systems using a relaxed backfill strategy. In: Job scheduling strategies for parallel processing. Lect notes comput sci, vol 2537. Springer, Berlin, pp 88–102
Srinivasan S, Kettimuthu R, Subramani V, Sadayappan P (2002) Selective reservation strategies for backfill job scheduling. In: Job scheduling strategies for parallel processing. Lect notes comput sci, vol 2537. Springer, Berlin, pp 55–71
Talby D, Feitelson DG (1999) Supporting priorities and improving utilization of the IBM SP scheduler using slack-based backfilling. In: 13th intl parallel processing symp (IPPS), Apr 1999, pp 513–517
Lawson BG, Smirni E (2002) Multiple-queue backfilling scheduling with priorities and reservations for parallel systems. In: Job scheduling strategies for parallel processing. Lect notes comput sci, vol 2537. Springer, Berlin, pp 72–87
Chiang S-H, Arpaci-Dusseau A, Vernon MK (2002) The impact of more accurate requested runtimes on production job scheduling performance. In: Job scheduling strategies for parallel processing. Lect notes comput sci, vol 2537. Springer, Berlin, pp 103–127
Parallel workloads archive. http://www.cs.huji.ac.il/labs/parallel/workload/: accessed July 5, 2008
Terashima-Marin H et al (2007) Comparing two models to generate hyper-heuristics for the 2d-regular bin-packing problem. In: Proceedings of the 9th annual conference on genetic and evolutionary computation. ACM, New York, pp 2182–2189. ISBN 978-1-59593-697-4
Klusáček D, Matyska L, Rudová H (2008) Alea—grid scheduling simulation environment. In: 7th international conference on parallel processing and applied mathematics (PPAM 2007). Springer, Berlin
Buyya R, Murshed M (2003) GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for grid computing. Concurr Comput Pract Exp 14(13–15):1175–1220. Special issue: grid computing environments
Feitelson DG, Tsafrir D (2006) Workload sanitation for performance evaluation. In: Performance analysis of systems and software, 2006 IEEE international symposium on, pp 221–230. ISBN: 1-4244-0186-0
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hussain Shah, S.M., Qureshi, K. & Rasheed, H. Optimal job packing, a backfill scheduling optimization for a cluster of workstations. J Supercomput 54, 381–399 (2010). https://doi.org/10.1007/s11227-009-0332-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-009-0332-3