Abstract
Reverse debugging is a technique for troubleshooting and analyzing software that allows developers to work directly from a software failure to the source code error that led to that failure. ReplayEngine makes this technique available for High Performance Computing (HPC) environments. This paper presents an exploration of the challenges we face and solutions that we are exploring as we develop ReplayEngine into a mature HPC reverse debugging solution.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bouteiller, A., Bosilca, G., Dongarra, J.: Retrospect: Deterministic replay of mpi applications for interactive distributed debugging. In: Cappello, F., Herault, T., Dongarra, J. (eds.) PVM/MPI 2007. LNCS, vol. 4757, pp. 297–306. Springer, Heidelberg (2007)
Xue, R., Liu, X., Wu, M., Guo, Z., Chen, W., Zheng, W., Zhang, Z., Voelker, G.M.: MPI WIZ: Subgroup Reproducible Replay of MPI Applications. In: Principles and Practices of Parallel Programming, Sheridan Printing (2009)
TotalView Technologies: TotalView Documentation End User Documentation (2009), http://www.totalviewtech.com/support/documentation.html
Gropp, W., Lusk, E.: MPICH2 (2006), http://www.mcs.anl.gov/mpi/mpich2/
Gabriel, E., Fagg, G.E., Bosilca, G., Angskun, T., Dongarra, J., Squyres, J.M., Sahay, V., Kambadur, P., Barrett, B.W., Lumsdaine, A., Castain, R.H., Daniel, D.J., Graham, R.L., Woodall, T.S.: Open MPI: Goals, concept, and design of a next generation MPI implementation. In: Kranzlmüller, D., Kacsuk, P., Dongarra, J. (eds.) EuroPVM/MPI 2004. LNCS, vol. 3241, pp. 97–104. Springer, Heidelberg (2004)
Cownie, J., Gropp, W.: A standard interface for debugger access to message queue information in MPI. In: Margalef, T., Dongarra, J., Luque, E. (eds.) PVM/MPI 1999. LNCS, vol. 1697, pp. 51–58. Springer, Heidelberg (1999)
Gottbrath, C., Barrett, B., Gropp, B., Lusk, E.R., Squyres, J.: An interface to support the identification of dynamic MPI 2 processes for scalable parallel debugging. In: Mohr, B., Träff, J.L., Worringen, J., Dongarra, J. (eds.) PVM/MPI 2006. LNCS, vol. 4192, pp. 115–122. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gottbrath, C. (2009). Bringing Reverse Debugging to HPC. In: Ropo, M., Westerholm, J., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2009. Lecture Notes in Computer Science, vol 5759. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03770-2_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-03770-2_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03769-6
Online ISBN: 978-3-642-03770-2
eBook Packages: Computer ScienceComputer Science (R0)