Abstract
We present the design and implementation of a debugging tool that displays a message-passing graph of the execution of an MPI application. Parts of the graph can be hidden or highlighted based on the stack trace, calling process or communicator of MPI calls. The tool incorporates several features enabling developers to explicitly control the ordering of message-passing events during the execution, and test that reordering these events does not compromise the correctness of the computations. In particular, we describe an automated running mode that detects competing sends matching the same wildcard receive and enables the developer to choose which execution path should be followed by the application.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Allinea, the distributed debugging tool (DDT), http://www.allinea.com
Al-Shabibi, A., Gerlach, S., Hersch, R.D., Schaeli, B.: A Debugger for Flow Graph Based Parallel Applications. In: Proceedings of the ACM International Symposium on Software Testing and Analysis (ISSTA 2007), Parallel and Distributed Systems: Testing and Debugging workshop (PADTAD 2007), London, UK (2007)
Automatic reflexion and serialization of C++ objects, http://home.gna.org/autoserial/
Bouteiller, A., Bosilca, G., Dongarra, J.: Retrospect: Deterministic Replay of MPI Applications for Interactive Distributed Debugging. In: Cappello, F., Herault, T., Dongarra, J. (eds.) PVM/MPI 2007. LNCS, vol. 4757, pp. 297–306. Springer, Heidelberg (2007)
Gropp, W., Huss-Lederman, S., Lumsdaine, A., Lusk, E., Nitzberg, B., Saphir, W., Snir, M.: MPI: The Complete Reference, vol. 2. MIT Press, Cambridge (1998)
Hong, C.-E., Lee, B.-S., On, G.-W., Chi, D.-H.: Replay for debugging MPI parallel programs. In: Proceedings of the MPI Developer’s Conference, pp. 156–160 (1996)
Jyothi, R., Lawlor, O.S., Kalé, L.V.: Debugging Support for Charm++. In: Procs. of the 18th International Parallel and Distributed Symposium (IPDPS 2004), Parallel and Distributed Systems: Testing and Debugging Workshop (PADTAD), p. 294 (2004)
Petitet, A., Whaley, R.C., Dongarra, J., Cleary, A.: HPL - A Portable Implementation of the High-Performance Linpack Benchmark for Distributed-Memory Computers, http://www.netlib.org/benchmark/hpl/
Sankaran, S., Squyres, J.M., Barrett, B., Lumsdaine, A., Duell, J., Hargrove, P., Roman, E.: The LAM/MPI Checkpoint/Restart Framework: System-Initiated Checkpointing. Int’l Journal of High Performance Computing Applications 19(4), 479–493 (2005)
TotalView Technologies, the TotalView Debugger, http://www.totalviewtech.com
Vakkalanka, S.S., Sharma, S., Gopalakrishnan, G., Kirby, R.M.: ISP: a tool for model checking MPI programs. In: Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2008), pp. 258–256 (2008)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schaeli, B., Al-Shabibi, A., Hersch, R.D. (2008). Visual Debugging of MPI Applications. In: Lastovetsky, A., Kechadi, T., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2008. Lecture Notes in Computer Science, vol 5205. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87475-1_33
Download citation
DOI: https://doi.org/10.1007/978-3-540-87475-1_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87474-4
Online ISBN: 978-3-540-87475-1
eBook Packages: Computer ScienceComputer Science (R0)