Skip to main content

Message Strip-Mining Heuristics for High Speed Networks

  • Conference paper
High Performance Computing for Computational Science - VECPAR 2004 (VECPAR 2004)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3402))

  • 585 Accesses

Abstract

In this work we investigate how the compiler technique of message strip-mining performs in practice on contemporary high performance networks. Message strip-mining attempts to reduce the overall cost of communication in parallel programs by breaking up large message transfers into smaller ones that can be overlapped with computation. In practice, however, network resource constraints may negate the expected performance gains. By deriving a performance model and synthetic benchmarks we determine how network and application characteristics in.uence the applicability of this optimization. We use these .ndings to determine heuristics to follow when performing this optimization on parallel programs. We propose strip-mining with variable block size as an alternative strategy that performs almost as well as a highly tuned .xed block strategy and has the advantage of being performance portable across systems and application input sets. We evaluate both techniques using synthetic benchmarks and an application from the NAS Parallel Benchmark suite.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. IBM SP – Seaborg, http://hpcf.nersc.gov/computers/SP/

  2. NERSC Alvarez Cluster, http://www.nersc.gov/alvarez

  3. The NAS Parallel Benchmarks, http://www.nas.nasa.gov/Software/NPB

  4. The UPC Runtime Specification, v1.0. http:/upc.lbl.gov/docs/system

    Google Scholar 

  5. UPC Language Specification, Version 1.0, Available at http://upc.gwu.edu

  6. Alexandrov, A., Ionescu, M.F., Schauser, K.E., Scheiman, C.: LogGP: Incorporating Long Messages into the LogP Model for Parallel Computation. Journal of Parallel and Distributed Computing 44(1), 71–79 (1997)

    Article  Google Scholar 

  7. Bell, C., Bonachea, D., Cote, Y., Duell, J., Husbands, P., Hargrove, P., Iancu, C., Welcome, M., Yelick, K.: An Evaluation of Current High-Performance Networks. In: Proceedings of 17th International Parallel and Distributed Processing Symposium, IPDPS (2003)

    Google Scholar 

  8. Bonachea, D.: GASNet Specification, v1.1. Technical Report CSD-02-1207, University of California at Berkeley (October 2002)

    Google Scholar 

  9. Bozkus, Z., Choudhary, A., Fox, G., Haupt, T., Ranka, S.: A Compilation Approach for Fortran 90D/HPF Compilers on Distributed Memory MIMD Computers. In: Proceedings of the Sixth Workshop on Languages and Compilers for Parallel Computing, Portland, OR (1993)

    Google Scholar 

  10. Culler, D.E., Karp, R.M., Patterson, D.A., Sahay, A., Schauser, K.E., Santos, E., Subramonian, R., von Eicken, T.: LogP: Towards a Realistic Model of Parallel Computation. In: Principles Practice of Parallel Programming, pp. 1–12 (1993)

    Google Scholar 

  11. Gupta, M., Banerjee, P.: Compile-Time Estimation of Communication Costs on Multicomputers. In: Proceedings of the 6th International Parallel Processing Symposium, Beverly Hills, CA (1992)

    Google Scholar 

  12. Gupta, M., Midkiff, S., Schonberg, E., Seshadri, V., Shields, D., Wang, K.-Y., Ching, W.-M., Ngo, T.: An HPF compiler for the IBM SP2. In: Proceedings of the 1995 ACM/IEEE conference on Supercomputing (CDROM), p. 71. ACM Press, New York (1995)

    Chapter  Google Scholar 

  13. Hiranandani, S., Kennedy, K., Tseng, C.-W.: Compiling Fortran D for MIMD Distributed-Memory Machines. Communications of the ACM 35(8), 66–80 (1992)

    Article  Google Scholar 

  14. Lemieux, http://www.psc.edu/machines/tcs/lemieux.html

  15. Prieto, M., Llorente, I.M., Tirado, F.: Data Locality Exploitation in the Descomposition of regular Domain Problems. IEEE Trans. on Parallel and Distributed Systens, 11, 1141–1150 (2000)

    Google Scholar 

  16. Prylli, L., Tourancheau, B., Westrelin, R.: Modeling of a high speed network to maximize throughput performance: the experience of BIP over myrinet. In: Parallel and Distributed Processing Techniques and Applications, PDPTA 1998 (1998)

    Google Scholar 

  17. Prylli, L., Tourancheau, B., Westrelin, R.: The Design for a High-Performance MPI Implementation on the Myrinet Network. In: PVM/MPI, pp. 223–230 (1999)

    Google Scholar 

  18. Wakatani, A., Wolfe, M.: A New Approach to Array Redistribution: Strip Mining Redistribution. In: Halatsis, C., Philokyprou, G., Maritsas, D., Theodoridis, S. (eds.) PARLE 1994. LNCS, vol. 817, pp. 323–335. Springer, Heidelberg (1994)

    Google Scholar 

  19. Wakatani, A., Wolfe, M.: Effectiveness of Message Strip-Mining for Regular and Irregular Communication. In: PDCS (Las Vegas) (October 1994)

    Google Scholar 

  20. Zima, H., Chapman, B.: Compiling for Distributed-Memory Systems. In: Proceedings of the IEEE (1993)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Iancu, C., Husbands, P., Chen, W. (2005). Message Strip-Mining Heuristics for High Speed Networks. In: Daydé, M., Dongarra, J., Hernández, V., Palma, J.M.L.M. (eds) High Performance Computing for Computational Science - VECPAR 2004. VECPAR 2004. Lecture Notes in Computer Science, vol 3402. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11403937_33

Download citation

  • DOI: https://doi.org/10.1007/11403937_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25424-9

  • Online ISBN: 978-3-540-31854-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics