Memory usage in the LANL CM-5 workload

Feitelson, Dror G.

doi:10.1007/3-540-63574-2_17

Dror G. Feitelson¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1291))

Included in the following conference series:

Workshop on Job Scheduling Strategies for Parallel Processing

154 Accesses
25 Citations

Abstract

It is generally agreed that memory requirements should be taken into account in the scheduling of parallel jobs. However, so far the work on combined processor and memory scheduling has not been based on detailed information and measurements. To rectify this problem, we present an analysis of memory usage by a production workload on a large parallel machine, the 1024-node CM-5 installed at Los Alamos National Lab. Our main observations are

- The distribution of memory requests has strong discrete components, i.e. some sizes are much more popular than others.
- Many jobs use a relatively small fraction of the memory available on each node, so there is some room for time slicing among several memory-resident jobs.
- Larger jobs (using more nodes) tend to use more memory, but it is difficult to characterize the scaling of per-processor memory usage.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

G. Alverson, S. Kahan, R. Korry, C. McCann, and B. Smith, “Scheduling on the Tera MTA”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), pp. 19–44, Springer-Verlag, 1995. Lecture Notes in Computer Science Vol. 949.
Google Scholar
G. M. Amdahl, “Validity of the single processor approach to achieving large scale computer capabilities”. In AFIPS Spring Joint Comput. Conf., vol. 30, pp. 483–485, Apr 1967.
Google Scholar
D. C. Burger, R. S. Hyder, B. P. Miller, and D. A. Wood, “Paging tradeoffs in distributed-shared-memory multiprocessors”. J. Supercomput. 10(1), pp. 87–104, 1996.
Article Google Scholar
J. J. Dongarra, H. W. Meuer, and E. Strohmaier, “Top500 supercomputer sites”. http://www.netlib.org/benchmark/top500.html. (updated every 6 months).
Google Scholar
D. G. Feitelson, “Packing schemes for gang scheduling”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), pp. 89–110, Springer-Verlag, 1996. Lecture Notes in Computer Science Vol. 1162.
Google Scholar
D. G. Feitelson, A Survey of Scheduling in Multiprogrammed Parallel Systems. Research Report RC 19790 (87657), IBM T. J. Watson Research Center, Oct 1994.
Google Scholar
D. G. Feitelson and M. A. Jette, “Improved utilization and responsiveness with gang scheduling”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), Springer Verlag, 1997. Lecture Notes in Computer Science (this volume).
Google Scholar
D. G. Feitelson and B. Nitzberg, “Job characteristics of a production parallel scientific workload on the NASA Ames iPSC/860”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), pp. 337–360, Springer-Verlag, 1995. Lecture Notes in Computer Science Vol. 949.
Google Scholar
D. G. Feitelson and L. Rudolph, “Parallel job scheduling: issues and approaches”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), pp. 1–18, Springer-Verlag, 1995. Lecture Notes in Computer Science Vol. 949.
Google Scholar
D. G. Feitelson and L. Rudolph, “Toward convergence in job schedulers for parallel supercomputers”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), pp. 1–26, Springer-Verlag, 1996. Lecture Notes in Computer Science Vol. 1162.
Google Scholar
J. L. Gustafson, “Reevaluating Amdahl's law”. Comm. ACM 31(5), pp. 532–533, May 1988. See also Comm. ACM 32(2), pp. 262–264, Feb 1989, and Comm. ACM 32(8), pp. 1014–1016, Aug 1989.
Article Google Scholar
J. L. Gustafson, G. R. Montry, and R. E. Benner, “Development of parallel methods for a 1024-processor hypercube”. SIAM J. Sci. Statist. Comput. 9(4), pp. 609–638, Jul 1988.
Article MathSciNet Google Scholar
C. McCann and J. Zahorjan, “Scheduling memory constrained jobs on distributed memory parallel computers”. In SIGMETRICS Conf. Measurement éI Modeling of Comput. Syst., pp. 208–219, May 1995.
Google Scholar
Minnesota Supercomputer Center, Inc., The Distributed Job Manager Administration Guide. 1993. ftp://ec.msc.edu/pub/LIGHTNING/djm-1.0.O-Src.tar.Z.
Google Scholar
E. W. Parsons and K. C. Sevcik, “Coordinated allocation of memory and processors in multiprocessors”. In SIGMETRICS Conf. Measurement & Modeling of Comput. Syst., pp. 57–67, May 1996.
Google Scholar
V. G. J. Peris, M. S. Squillante, and V. K. Naik, “Analysis of the impact of memory in distributed parallel processing systems”. In SIGMETRICS Conf. Measurement & Modeling of Comput. Syst., pp. 5–18, May 1994.
Google Scholar
S. K. Setia, “The interaction between memory allocation and adaptive partitioning in message-passing multicomputers”. In Job Scheduling Strategies for Parallel Processing, D. G. Feitelson and L. Rudolph (eds.), pp. 146–165, Springer-Verlag, 1995. Lecture Notes in Computer Science Vol. 949.
Google Scholar
J. P. Singh, J. L. Hennessy, and A. Gupta, “Scaling parallel programs for multiprocessors: methodology and examples”. Computer 26(7), pp. 42–50, Jul 1993.
Article Google Scholar
X-H. Sun and L. M. Ni, “Scalable problems and memory-bounded speedup”. J. Parallel & Distributed Comput. 19(1), pp. 27–37, Sep 1993.
Google Scholar
Thinking Machines Corp., Connection Machine CM-5 Technical Summary. Nov 1992.
Google Scholar
K. Y. Wang and D. C. Marinescu, “Correlation of the paging activity of individual node programs in the SPMD execution model”. In 28th Hawaii Intl. Conf. System Sciences, vol. I, pp. 61–71, Jan 1995.
Google Scholar
P. H. Worley, “The effect of time constraints on scaled speedup”. SIAM J. Sci. Statist. Comput. 11(5), pp. 838–858, Sep 1990.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, The Hebrew University, 91904, Jerusalem, Israel
Dror G. Feitelson

Authors

Dror G. Feitelson
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Dror G. Feitelson Larry Rudolph

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Feitelson, D.G. (1997). Memory usage in the LANL CM-5 workload. In: Feitelson, D.G., Rudolph, L. (eds) Job Scheduling Strategies for Parallel Processing. JSSPP 1997. Lecture Notes in Computer Science, vol 1291. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63574-2_17

Download citation

DOI: https://doi.org/10.1007/3-540-63574-2_17
Published: 12 July 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63574-1
Online ISBN: 978-3-540-69599-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics