Skip to main content

Integrated Runtime Measurement Summarisation and Selective Event Tracing for Scalable Parallel Execution Performance Diagnosis

  • Conference paper
Applied Parallel Computing. State of the Art in Scientific Computing (PARA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4699))

Included in the following conference series:

Abstract

Straightforward trace collection and processing becomes increasingly challenging and ultimately impractical for more complex, long-running, highly parallel applications. Accordingly, the scalasca project is extending the kojak measurement system for mpi, openmp and partitioned global address space (pgas) parallel applications to incorporate runtime management and summarisation capabilities. This offers a more scalable and effective profile of parallel execution performance for an initial overview and to direct instrumentation and event tracing to the key functions and callpaths for comprehensive analysis. The design and re-structuring of the revised measurement system are described, highlighting the synergies possible from integrated runtime callpath summarisation and event tracing for scalable parallel execution performance diagnosis. Early results from measurements of 16,384 mpi processes on IBM BlueGene/L already demonstrate considerably improved scalability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Forschungszentrum Jüelich GmbH: SCALASCA: Scalable performance Analysis of Large-Scale parallel Applications, http://www.scalasca.org/

  2. Forschungszentrum Jüelich GmbH (ZAM) and the University of Tennessee (ICL): KOJAK: Kit for Objective Judgement and Knowledge-based detection of performance bottlenecks, http://www.fz-juelich.de/zam/kojak/

  3. Wolf, F., Mohr, B.: Automatic Performance Analysis of Hybrid MPI/OpenMP Applications. J. Systems Architecture 49(10-11), 421–439 (2003)

    Article  Google Scholar 

  4. Nagel, W., Arnold, A., Weber, M., Hoppe, H.-C., Solchenbach, K.: VAMPIR: Visualization and Analysis of MPI Resources. Supercomputer 63(1), 69–80 (1996)

    Google Scholar 

  5. Labarta, J., Girona, S., Pillet, V., Cortes, T., Gregoris, L.: DiP: A Parallel Program Development Environment. In: Fraigniaud, P., Mignotte, A., Robert, Y., Bougé, L. (eds.) Euro-Par 1996. LNCS, vol. 1124, pp. 665–674. Springer, Heidelberg (1996)

    Chapter  Google Scholar 

  6. Wolf, F., Freitag, F., Mohr, B., Moore, S., Wylie, B.J.N.: Large Event Traces in Parallel Performance Analysis. In: Proc. 19th Int’l Conf. on Architecture of Computing Systems, Frankfurt am Main, Germany. Lecture Notes in Informatics, p. 81. Gesellschaft für Informatik, pp. 264–273 (2006)

    Google Scholar 

  7. Shende, S.S., Malony, A.D.: The TAU Parallel Performance System. Int’l J. High Performance Computing Applications 20(2), 287–331 (2006)

    Article  Google Scholar 

  8. Cain, H.W., Miller, B.P., Wylie, B.J.N.: A Callgraph-based Search Strategy for Automated Performance Diagnosis. Concurrency and Computation: Practice and Experience 14(3), 203–217 (2002)

    Article  MATH  Google Scholar 

  9. Geimer, M., Wolf, F., Knüpfer, A., Mohr, B., Wylie, B.J.N.: A Platform for Scalable Parallel Trace Analysis. In: A Parallel Trace-Data Interface for Scalable performance Analysis, Umeå, Sweden. LNCS, pp. 398–408. Springer, Heidelberg (2006)

    Google Scholar 

  10. Song, F., Wolf, F., Bhatia, N., Dongarra, J., Moore, S.: An Algebra for Cross-Experiment Performance Analysis. In: Proc. 33rd Int’l Conf. on Parallel Processing (ICPP 2004), Montreal, Quebec, Canada, pp. 63–72. IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  11. Wolf, F., Mohr, B., Bhatia, N., Hermanns, M.-A.: EPILOG binary trace-data format, version 1.3 (2005), http://www.fz-juelich.de/zam/kojak/doc/epilog.pdf

  12. Gailly, J., Adler, M.: zlib general-purpose compression library, version 1.2.3 (2005), http://www.zlib.net/

  13. Vetter, J., Chambreau, C.: mpiP — lightweight, scalable MPI profiling (2005), http://www.llnl.gov/CASC/mpip/

  14. Fürlinger, K., Gerndt, M.: ompP — A Profiling Tool for OpenMP. In: Proc. 1st Int’l Work. on OpenMP (IWOMP) Eugene, OR, USA (2005)

    Google Scholar 

  15. Knüpfer, A., Brendel, R., Brunst, H., Mix, H., Nagel, W.E.: Introducing the Open Trace Format (OTF). In: Alexandrov, V.N., van Albada, G.D., Sloot, P.M.A., Dongarra, J.J. (eds.) ICCS 2006. LNCS, vol. 3992, pp. 526–533. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  16. The BlueGene/L Team at IBM and LLNL: An overview of the BlueGene/L supercomputer. In: Proc. SC2002, Baltimore, MD, USA. IEEE Computer Society (2002)

    Google Scholar 

  17. Advanced Simulation and Computing Program: The ASC SMG 2000 benchmark code (2001), http://www.llnl.gov/asc/purple/benchmarks/limited/smg/

  18. Geimer, M., Wolf, F., Wylie, B.J.N., Mohr, B.: Scalable Parallel Trace-based Performance Analysis. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds.) Recent Advances in Parallel Virtual Machine and Message Passing Interface. LNCS, vol. 4192, pp. 303–312. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Bo Kågström Erik Elmroth Jack Dongarra Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wylie, B.J.N., Wolf, F., Mohr, B., Geimer, M. (2007). Integrated Runtime Measurement Summarisation and Selective Event Tracing for Scalable Parallel Execution Performance Diagnosis. In: Kågström, B., Elmroth, E., Dongarra, J., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2006. Lecture Notes in Computer Science, vol 4699. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75755-9_55

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75755-9_55

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75754-2

  • Online ISBN: 978-3-540-75755-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics