Abstract
Important insights into program operation can be gained by observing dynamic execution behavior. Unfortunately, many high-performance machines provide execution profile summaries as the only tool for performance investigation. We have developed a tracing library for the CRAY X-MP and CRAY-2 supercomputers that supports the low-overhead capture of execution events for sequential and multitasked programs. This library has been extended to use the automatic instrumentation facilities on these machines, allowing trace data from routine entry and exit, and other program segments, to be captured. To assess the utility of the trace-based tools, three of the Perfect Benchmark codes have been tested in scalar and vector modes with the tracing instrumentation. In addition to computing summary execution statistics from the traces, interesting execution dynamics appear when studying the trace histories. It is also possible to model application performance based on properties identified from traces. Our conclusion is that adding tracing support in Cray supercomputers can have significant returns in improved performance characterization and evaluation.
Similar content being viewed by others
References
Berry, M. 1989. The Perfect Club Benchmarks: Effective performance evaluation of supercomputers. The Internat. J. Supercomputer Applications, 3, 3 (fall), 5–40.
Carrington, D. 1983. Profiling under ELSI UNIX. Software Practice and Experience, 16, 9 (Sept.), 865–873.
Cray Research, Inc. 1989. UNICOS Performance Utilities Reference Manual. May.
Fromm, H., Hercksen, U., Herzog, U., John, K., Klar, R., and Kleinoder, W. 1983. Experiences with performance measurement and modeling of a processor array. IEEE Trans. Comps., 32, 1 (Jan.).
Gehringer, E., Siewiorek, D., and Segall, Z. 1987. Parallel Processing: The CM * Experience. Digital Press.
Graham, S., Kessler, P., and McKusick, M. 1982. gprof: A call graph execution profiler. In Proc., SIGPLAN '82 Symp. on Compiler Construction (Boston, June), ACM Press, pp. 120–126.
Graham, S., Kessler, P., and McKusick, M. 1983. An execution profiler for modular programs. Software Practice and Experience, 13: 671–685.
Jameson, A. 1983. Solution of the Euler equations for a two-dimensional transonic flow by a multigrid method. Applied Math. and Comp., 13:327.
Larson, J. 1985. CRAY X-MP hardware performance monitor. Cray Channels.
Larson, J., and Lutz, R. 1985. Perftrace user guide. Tech. rept., Cray Research, Inc. (Aug.).
Malony, A. 1990. Performance observability. Ph.D. thesis, Dept. of Comp. Sci., Univ. of Ill. at Urbana-Champaign, Urbana, Ill.
Malony, A. 1991. Event based performance perturbation: A case study. In Third ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming (to appear).
Malony, A., Reed, D., and Wijshoff, H. 1989. Performance measurement intrusion and perturbation analysis. Tech. rept. CSRD-923, Univ. of Ill., Center for Supercomputing Research and Development, Urbana, Ill.
Mellor-Crummey, J. 1989. Debugging and analysis of large-scale parallel programs. Ph.D. thesis, Dept. of Comp. Sci., Univ. of Rochester, Rochester, N.Y.
Model, M. 1978. Monitoring system behavior in a complex computational environment. Ph.D. thesis, Stanford Univ., Stanford, Calif.
Noor, A., and Peters, J. 1985. Model-size reduction techniques for the analysis of symmetric anisotropic structures. Eng. Computations, 2, 4 (Apr.), 285.
Orszag, S. 1984. Order and disorder in two and three-dimensional Benard convection. J. Fluid Mechanics, 174:1.
Pointer, L. 1990. Perfect: Performance evaluation for cost-effective transformations—Report 2. Tech. rept. CSRD No. 964, Univ. of Ill. at Urbana-Champaign, Center for Supercomputing Research and Development, Urbana, Ill.
Segall, Z., and Rudolph, L. 1985. PIE: A programming and instrumentation environment for parallel processing. IEEE Software, 2, 6 (Nov.), 22–37.
Simmons, M., Koskela, R., and Bucher, L, eds. 1989. Instrumentation for Future Parallel Computing Systems. ACM Press.
Simmons, M., Koskela, R., and Bucher, I., eds. 1990. Parallel Computer Systems: Performance Instrumentation and Visualization. ACM Press.
Author information
Authors and Affiliations
Additional information
Supported in part by the National Science Foundation under Grants No. NSF MIP-88-07775 and No. NSF ASC-84-04556, and the NASA Ames Research Center Grant No. NCC-2-559.
Supported in part by the National Science Foundation under grant NSF ASC-84-04556.
Supported in part by the National Science Foundation under grants NSF CCR-86-57696, NSF CCR-87-06653 and NSF CDA-87-22836 and by the National Aeronautics and Space Administration under NASA Contract Number NAG-1-613.
Rights and permissions
About this article
Cite this article
Malony, A.D., Larson, J.L. & Reed, D.A. Tracing application program execution on the CRAY X-MP and CRAY-2. J Supercomput 5, 137–162 (1991). https://doi.org/10.1007/BF00127841
Issue Date:
DOI: https://doi.org/10.1007/BF00127841