skip to main content
10.1145/1088149.1088191acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
Article

Another approach to backfilled jobs: applying virtual malleability to expired windows

Published:20 June 2005Publication History

ABSTRACT

An efficient job scheduling must ensure high throughput and good performance. Moreover in highly parallel systems where processors are a critical resource, high machine utilization becomes an essential aspect.Backfilling consists on moving jobs ahead in the queue, given that they do not delay certain previously submitted jobs. When the execution time of a backfilled job was underestimated, some action has to be taken with it: abort, suspend/resume, checkpoint/restart, remain executing.In this paper we propose an alternative choice for that situation which consists on apply Virtual Malleability to the backfilled job. This means that its processors partition will be reduced, and as MPI jobs aren't really malleable, we make the job contend with itself for the use of processors by applying Co-scheduling. In this way resources are freed and the job at the head of the queue have a chance to start executing. In addition to this, as MPI parallel jobs can be Moldable, we add this possibility to the scheme.We obtained better performance than traditional backfilling in about 25 %, especially in high machine utilization. We claim also for the portability of our technique which does not requires special support from the operating system as checkpointing does.

References

  1. W. Cirne. Using Moldability to Improve the Performance of Supercomputer Jobs. Ph.D Thesis. Computer Science and Eng. University of California San Diego, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Bailey, T. Harris, W. Saphir, R. Wijngaart, A. Woo and M. Yarrow, "The NAS Parallel Benchmarks 2.0", Technical Report NAS-95-020, NASA, December 1995.]]Google ScholarGoogle Scholar
  3. M. V. Devarakonda, R. Iyer. Predictability of Process Resource Usage: A Measurement Based Study on UNIX. IEEE Trans. Soft. Eng. 15(12), pp. 1579--1586, Dec. 1989.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. Downey. A Model for Speedup of Parallel Programs. Technical Report CSD-97-933. University of California at Berkerley, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. G. Feitelson. Logs of real parallel workloads from production systems. http://www.cs.hujiac.il/labs/parallel/workload/logs.html.]]Google ScholarGoogle Scholar
  6. D. G. Feitelson and M. A. Jette. Improved Utilization and Responsiveness with Gang Scheduling. Job Scheduling Strategies for Parallel Processing, volume 1291 of Lecture Notes in Computer Science. Springer-Verlag 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. G. Feitelson, B. Nitzberg. Jobs Characteristis of a Production Parallel Scientific Workload on the NASA Ames Ipsc/860, in JSSPP Springer-Verlag, Lectures Notes in Computer Science, vol. 949, pp. 337--360, 1995.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. G. Feitelson, L. Rudolph, U. Schiwiegelshohn, K. Sevcik and P. Wong. Theory and Practice in Parallel Job Scheduling. Lecture Notes in Computer Science, 1291:1--34, 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. D. Jackson, Q. Snell and M. Clement. Core Algorithms of the Maui Scheduler. In Worshop on Job Sched Strategies for Parallel Processing, pp. 87--102, 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. M. Weil and D. Feitelson. Utilization, Predictabiligy, Workloads and User Runtimes Estimates in Scheduling the IBM SP2 with Backfilling, In IEEE Trans. on Parallel and Distributed Syst. 12(6), pp. 529--543, Jun. 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. E. Frachtenberg, D. Feitelson, J. Fernández, F. Petrini. Parallel Job Scheduling Under Dynamic Workloads. JSSPP 2003.]]Google ScholarGoogle Scholar
  12. E. Frachtenberg, D. G. Feitelson, F. Petrini, and J. Fernandez, "Flexible coscheduling: mitigating load imbalance and improving utilization of heterogeneous resources", In 17th Intl. Parallel & Distributed Processing Symp., Apr 2003.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. A. Gupta, A. Tucker, and S. Urushibara. The Impact of Operating System Scheduling Policies and Synchronization Methods on the Performance of Parallel Jobs. In Proceedings of the 1991 ACM SIGMETRICS Conference, pp 120--132, May 1991.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. B. Lawson and E. Smirni. Multiple-queue Backfilling Scheduling with Priorities and Reservations for Parallel Systems. In Job Sched, Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), Springer Verlag, Lect. Notes Comp. Sc. Vol. 2537, pp. 72--87, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. D. Lifka. The ANL/IBM SP scheduling system. In Job Scheduling Strategies for Parallel Processing, pp. 295--303, Springer Verlag, 1995 (LNCS 949).]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. X. Martorell, J. Corbalán, Dimitrios S. Nikolopoulos, Nacho Navarro, Eleftherios D. Polychronopoulos, Theodore S. Papatheodorou, Jesús Labarta: A Tool to Schedule Parallel Applications on Multiprocessors: The NANOS CPU MANAGER. JSSPP 2000: 87--112.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Message Passing Interface Forum. MPI: A Message-Passing Interface standard. Int. Journal of SuperComputer Jobs, 8(3/4):165--414, 1994.]]Google ScholarGoogle Scholar
  18. Q. Snell, Mark J. Clement, David B. Jackson: Preemption Based Backfill. JSSPP 2002: 24--37.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. V. Sarkar. Determining Average Program Execution Times and Their Variance. In Proc. SIGPLAN Conf, Prog. Lang. Dessign and Implementation, pp. 298--312, Jun 1989.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Albert Serra, Nacho Navarro, and Toni Cortes. DITools: Application-level support for dynamic extension and flexible composition. In Proc. USENIX Annual Technical Conf., pp 225--238, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. E. Shmueli, D. Feitelson, Backfilling with Lookahead to Optimize the Performance of Parallel Job Scheduling. Springer Verlag 2003. Lectures Notes Comp. Science.]]Google ScholarGoogle Scholar
  22. Silicon Graphics, Inc. IRIX Admin: Resource Administration, Document number 007-3700-005, http://techpubs.sgi.com, 2000.]]Google ScholarGoogle Scholar
  23. S. Srinivasan, R. Kettimuthu, V. Subramani, P. Sadayappan. Characterization of Backfilling strategies for Parallel Job Scheduling. In Proc. of 2002 Intl. Workshops on Parallel Proc, Aug, 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. S. Srinivasan, V. Subramani, R. Kettimuthu, P. Holenarsipur, and P. Sadayappan. Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs. In Proceedings of the 9th Intl. Conference on High Performance Computing, Dec. 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Sweep3D Bench http://www.llnl.gov/asci_benchmarks/asci/limited/sweep3d/asci_sweep3d.html]]Google ScholarGoogle Scholar
  26. D. Talb, D. Feitelson. Supporting Priorities and Improving Utilization of the IBM SP Scheduler Using Slack-Based Backfilling. In 13th Intl. Parallel Proc. Symp. (IPPS), pp. 513--517, Apr. 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. G. Utrera, J. Corbalán, J. Labarta. Implementing Malleability on MPI Jobs. In Proceedings of the Parallel Architecture and Compilation Techniques, 13th International Conference on (PACT'04), pp. 215--224, Antibes Juan-les-Pins, France, Sep 29 - Oct 03, 2004.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. G. Utrera, J. Corbalán, J. Labarta. Scheduling of MPI applications: Self Co-Scheduling.Euro-Par 2004, Lecture Notes in Computer Science 3149, pp 238--245.]]Google ScholarGoogle Scholar
  29. W. Ward Jr., C. L. Mahood, J. E. West. Scheduling Jobs on Parallel Systems Using a Relaxed Backfill Strategy. JSSPP 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Y. Zhang, H. Franke, J. Moreira, A. Sivasubramaniam. Improving Parallel Job Scheduling by Combining Gang Scheduling and Backfilling Techniques. IPDPS 2000.]]Google ScholarGoogle ScholarCross RefCross Ref
  31. C. McCann and J. Zahorjan, "Processor allocation policies for message passing parallel computers". In SIGMETRICS Conf. Measurement & Modeling of Comput. Syst., pp. 19--32, May 1994.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    ICS '05: Proceedings of the 19th annual international conference on Supercomputing
    June 2005
    414 pages
    ISBN:1595931678
    DOI:10.1145/1088149

    Copyright © 2005 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 20 June 2005

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate584of2,055submissions,28%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader