Abstract
In this paper, we describe the implementation of memory checking functionality based on instrumentation using Valgrind-Memcheck tool. The combination of Valgrind based checking functions within the MPI-implementation offers superior debugging functionalities, for errors that otherwise are not possible to detect with comparable MPI-debugging tools. The functionality is integrated into Open MPI as the so-called memchecker-framework. This allows other memory debuggers that offer a similar API to be integrated. The tight control of the user’s memory passed to Open MPI, allows not only to find application errors, but also helps track bugs within Open MPI itself. We describe the actual checks, classes of errors being found, how memory buffers internally are being handled, show errors actually found in user’s code and the performance implications of this instrumentation.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
DeSouza, J., Kuhn, B., Supinski, de B.R.: Automated, scalable debugging of MPI programs with Intel message checker. In: Proceedings of the 2nd International Workshop on Software engineering for high performance computing system applications, vol. 4, pp. 78–82. ACM Press, NY, USA (2005)
Keller, R., Resch, M.: Testing the correctness of MPI implementations. In: Proceedings of the 5th Int. Symp. on Parallel and Distributed Computing conference, pp. 291–295. Timisoara, Romania (2006)
Krammer, B., Müller, M.S., Resch, M.M.: Runtime checking of MPI applications with Marmot. In: PARCO’05. Malaga, Spain (2005)
Message Passing Interface Forum: MPI: A Message Passing Interface Standard (1995). http://www.mpi-forum.org
Message Passing Interface Forum: MPI-2: Extensions to the Message-Passing Interface (1997). http://www.mpi-forum.org
Seward, J., Nethercote, N.: Using Valgrind to detect undefined value errors with bit-precision. In: Proceedings of the USENIX’05 Annual Technical Conference. Anaheim, CA, USA (2005)
The Open Fabrics project webpage. WWW (2007). https://www.openfabrics.org
Totalview Memory Debugging capabilities. WWW. http://www.etnus.com/TotalView/Memory.html
Vetter, J.S., de Supinski, B.R.: Dynamic software testing of MPI applications with Umpire. In: Proceedings of Supercomputing (SC) (2000). http://www.sc2000.org/proceedings/techpapr/index.htm
Woodall, T., Graham, R., Castain, R., Daniel, D., Sukalski, M., Fagg, G., Gabriel, E., Bosilca, G., Angskun, T., Dongarra, J., Squyres, J., Sahay, V., Kambadur, P., Barrett, B., Lumsdaine, A.: Open MPI’s TEG Point-to-Point Communications Methodology: Comparison to Existing Implementations. In: Recent Advances in Parallel Virtual Machine and Message Passing Interface, vol. 3241, pp. 105–111. Springer, Budapest, Hungary (2004)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Fan, S., Keller, R., Resch, M. (2008). Enhanced Memory debugging of MPI-parallel Applications in Open MPI. In: Resch, M., Keller, R., Himmler, V., Krammer, B., Schulz, A. (eds) Tools for High Performance Computing. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68564-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-540-68564-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68561-6
Online ISBN: 978-3-540-68564-7
eBook Packages: Computer ScienceComputer Science (R0)