Skip to main content

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 5205))

Abstract

We present the design and implementation of a debugging tool that displays a message-passing graph of the execution of an MPI application. Parts of the graph can be hidden or highlighted based on the stack trace, calling process or communicator of MPI calls. The tool incorporates several features enabling developers to explicitly control the ordering of message-passing events during the execution, and test that reordering these events does not compromise the correctness of the computations. In particular, we describe an automated running mode that detects competing sends matching the same wildcard receive and enables the developer to choose which execution path should be followed by the application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Allinea, the distributed debugging tool (DDT), http://www.allinea.com

  2. Al-Shabibi, A., Gerlach, S., Hersch, R.D., Schaeli, B.: A Debugger for Flow Graph Based Parallel Applications. In: Proceedings of the ACM International Symposium on Software Testing and Analysis (ISSTA 2007), Parallel and Distributed Systems: Testing and Debugging workshop (PADTAD 2007), London, UK (2007)

    Google Scholar 

  3. Automatic reflexion and serialization of C++ objects, http://home.gna.org/autoserial/

  4. Bouteiller, A., Bosilca, G., Dongarra, J.: Retrospect: Deterministic Replay of MPI Applications for Interactive Distributed Debugging. In: Cappello, F., Herault, T., Dongarra, J. (eds.) PVM/MPI 2007. LNCS, vol. 4757, pp. 297–306. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  5. Gropp, W., Huss-Lederman, S., Lumsdaine, A., Lusk, E., Nitzberg, B., Saphir, W., Snir, M.: MPI: The Complete Reference, vol. 2. MIT Press, Cambridge (1998)

    Google Scholar 

  6. Hong, C.-E., Lee, B.-S., On, G.-W., Chi, D.-H.: Replay for debugging MPI parallel programs. In: Proceedings of the MPI Developer’s Conference, pp. 156–160 (1996)

    Google Scholar 

  7. Jyothi, R., Lawlor, O.S., Kalé, L.V.: Debugging Support for Charm++. In: Procs. of the 18th International Parallel and Distributed Symposium (IPDPS 2004), Parallel and Distributed Systems: Testing and Debugging Workshop (PADTAD), p. 294 (2004)

    Google Scholar 

  8. Petitet, A., Whaley, R.C., Dongarra, J., Cleary, A.: HPL - A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers, http://www.netlib.org/benchmark/hpl/

  9. Sankaran, S., Squyres, J.M., Barrett, B., Lumsdaine, A., Duell, J., Hargrove, P., Roman, E.: The LAM/MPI Checkpoint/Restart Framework: System-Initiated Checkpointing. Int’l Journal of High Performance Computing Applications 19(4), 479–493 (2005)

    Article  Google Scholar 

  10. TotalView Technologies, the TotalView Debugger, http://www.totalviewtech.com

  11. Vakkalanka, S.S., Sharma, S., Gopalakrishnan, G., Kirby, R.M.: ISP: a tool for model checking MPI programs. In: Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2008), pp. 258–256 (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Alexey Lastovetsky Tahar Kechadi Jack Dongarra

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schaeli, B., Al-Shabibi, A., Hersch, R.D. (2008). Visual Debugging of MPI Applications. In: Lastovetsky, A., Kechadi, T., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2008. Lecture Notes in Computer Science, vol 5205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87475-1_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-87475-1_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-87474-4

  • Online ISBN: 978-3-540-87475-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics