skip to main content
10.1145/2966884.2966905acmotherconferencesArticle/Chapter ViewAbstractPublication PageseurompiConference Proceedingsconference-collections
research-article

On the Expected and Observed Communication Performance with MPI Derived Datatypes

Authors Info & Claims
Published:25 September 2016Publication History

ABSTRACT

We examine natural expectations on communication performance using MPI derived datatypes in comparison to the baseline, "raw" performance of communicating simple, noncontiguous data layouts. We show that common MPI libraries sometimes violate these datatype performance expectations, and discuss reasons why this happens, but also show cases where MPI libraries perform well. Our findings are in many ways surprising and disappointing. First, the performance of derived datatypes is sometimes worse than the semantically equivalent packing and unpacking using the corresponding MPI functionality. Second, the communication performance equivalence stated in the MPI standard between a single contiguous datatype and the repetition of its constituent datatype does not hold universally. Third, the heuristics that are typically employed by MPI libraries at type-commit time are insufficient to enforce natural performance guidelines, and better type normalization heuristics may have a significant performance impact. We show cases where all the MPI type constructors are necessary to achieve the expected performance for certain data layouts. We describe our benchmarking approach to verify the datatype performance guidelines, and present extensive verification results for different MPI libraries.

References

  1. S. Byna, W. D. Gropp, X.-H. Sun, and R. Thakur. Improving the performance of MPI derived datatypes by optimizing memory-access cost. In CLUSTER, pages 412--419, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  2. A. Carpen-Amarie, S. Hunold, and J. L. Träff. MPI derived datatypes: Performance expectations and status quo. CoRR, abs/1607.00178, 2016.Google ScholarGoogle Scholar
  3. R. Ganian, M. Kalany, S. Szeider, and J. L. Träff. Polynomial-time construction of optimal MPI derived datatype trees. In IPDPS. IEEE Computer Society, 2016.Google ScholarGoogle ScholarCross RefCross Ref
  4. W. D. Gropp, T. Hoefler, R. Thakur, and J. L. Träff. Performance expectations and guidelines for MPI derived datatypes: a first analysis. In EuroMPI, pages 150--159. Springer, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. T. Hoefler and S. Gottlieb. Parallel zero-copy algorithms for fast fourier transform and conjugate gradient using MPI datatypes. In EuroPVM/MPI, pages 132--141, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Hunold, A. Carpen-Amarie, and J. L. Träff. Reproducible MPI micro-benchmarking isn't as easy as you think. In EuroMPI/ASIA, pages 69--76. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Kalany and J. L. Träff. Efficient, optimal MPI datatype reconstruction for vector and index types. In EuroMPI. ACM, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Kjolstad, T. Hoefler, and M. Snir. A transformation to convert packing code to compact datatypes for efficient zero-copy data transfer. Technical report, University of Illinois at Urbana-Champain, 2011. Retrieved from http://hdl.handle.net/2142/26452, last visited on 03/01/2016.Google ScholarGoogle Scholar
  9. F. Kjolstad, T. Hoefler, and M. Snir. Automatic datatype generation and optimization. In PPoPP, pages 327--328, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. MPI Forum. MPI: A Message-Passing Interface Standard. Version 3.1, June 4th 2015. www.mpi-forum.org.Google ScholarGoogle Scholar
  11. T. Prabhu and W. Gropp. DAME: A runtime-compiled engine for derived datatypes. In EuroMPI, 2015. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Reussner, J. L. Träff, and G. Hunzelmann. A benchmark for MPI derived datatypes. In EuroPVM/MPI, pages 10--17. Springer, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Ross, N. Miller, and W. D. Gropp. Implementing fast and reusable datatype processing. In EuroPVM/MPI, pages 404--413. Springer, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  14. R. B. Ross, R. Latham, W. Gropp, E. L. Lusk, and R. Thakur. Processing MPI datatypes outside MPI. In EuroPVM/MPI, pages 42--53, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. T. Schneider, R. Gerstenberger, and T. Hoefler. Application-oriented ping-pong benchmarking: how to assess the real communication overheads. Computing, 96(4):279--292, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. T. Schneider, F. Kjolstad, and T. Hoefler. MPI datatype processing using runtime compilation. In EuroMPI, pages 19--24, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Schulz, G. Bronevetsky, and B. R. de Supinski. On the performance of transparent MPI piggyback messages. In EuroPVM/MPI, pages 194--201, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. L. Träff. Optimal MPI datatype normalization for vector and index-block types. In EuroMPI/ASIA, pages 33--38. ACM, 2014. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. L. Träff, W. D. Gropp, and R. Thakur. Self-consistent MPI performance guidelines. IEEE TPDS, 21(5):698--709, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. L. Träff, R. Hempel, H. Ritzdorf, and F. Zimmermann. Flattening on the fly: efficient handling of MPI derived datatypes. In EuroPVM/MPI, pages 109--116. Springer, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Wu, P. Wyckoff, and D. K. Panda. High performance implementation of MPI derived datatype communication over InfiniBand. In IPDPS, page 14, 2004.Google ScholarGoogle Scholar
  22. Y. Wu, J. Song, K. Ren, and X. Li. MPI derived datatypes and data communication analysis in meteorological applications. In 2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity), pages 536--541, 2015.Google ScholarGoogle ScholarCross RefCross Ref

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Other conferences
    EuroMPI '16: Proceedings of the 23rd European MPI Users' Group Meeting
    September 2016
    225 pages
    ISBN:9781450342346
    DOI:10.1145/2966884

    Copyright © 2016 ACM

    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 25 September 2016

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • research-article
    • Research
    • Refereed limited

    Acceptance Rates

    Overall Acceptance Rate66of139submissions,47%

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader