Skip to main content

Asynchronous Distributed Broadcasting in Cluster Environment

  • Conference paper
Recent Advances in Parallel Virtual Machine and Message Passing Interface (EuroPVM/MPI 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3241))

Abstract

Improving communication performance is an important issue in cluster systems. This paper investigates the possibility of accelerating group communication at the level of message passing libraries. A new algorithm for implementing the broadcast communication primitive will be introduced. It enhances the performance of fully-switched cluster systems by using message decomposition and asynchronous communication. The new algorithm shows the dynamism and the portability of the software solutions, while it has a constant asymptotic time complexity achieved only with hardware support before. Test measurements show that the algorithm really has a constant time complexity, and in certain cases it can outperform the widely used binary tree approach by 100 percent. The presented algorithm can be used to increase the performance of broadcasting, and can also indirectly speed up various group communication primitives used in standard message passing libraries.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Martin, et al.: Effects of Communication Latency, Overhead and Bandwidth in a Cluster Architecture. In: 24th Annual Symposium on Computer Architecture, Denver, pp. 85–97 (1997)

    Google Scholar 

  2. Chiola, G., Ciaccio, G.: Efficient Parallel Processing on Low-Cost Clusters with GAMMA Active Ports. Parallel Computing 26, 333–354 (2000)

    Article  MATH  Google Scholar 

  3. Geist, A., Beguelin, A., Dongarra, J., Jiang, W., Sunderam, V.: Parallel Virtual Machine – A User’s Guide and Tutorial for Networked Parallel Computing. MIT Press, London (1994)

    Google Scholar 

  4. Snir, M., Otto, S., Huss-Lederman, S., Walker, D., Dongarra, J.: MPI–The Complete Reference, 2nd edn. The MPI-1 Core, vol. 1. MIT Press, Cambridge (1998)

    Google Scholar 

  5. Meyer, U., et al.: Algorithms for Memory Hierarchies. LNCS, vol. 2625, pp. 320–354. Springer, Heidelberg (2003)

    Book  MATH  Google Scholar 

  6. Baugh Jr., J.W., Konduri, R.K.S.: Discrete element modeling on a cluster of workstations. Engineering with Computers 17, 1–15 (2001)

    Article  MATH  Google Scholar 

  7. Gropp, W., Lusk, E., Doss, N.: A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard. Parallel Computing 22(6), 789–828 (1996)

    Article  MATH  Google Scholar 

  8. Indiana University, Indiana University’s Open Systems Lab: LAM/MPI, http://www.lam-mpi.org/

  9. Kielmann, T., Hofman, F.H., et al.: MagPIe: MPI’s collective communication operations for clustered wide area systems. ACM SIGPLAN Notices 34(8), 131–140 (1999)

    Article  Google Scholar 

  10. Barnett, M., Gupta, S., Payne, D., Shuler, L., van de Geijn, R., Watts, J.: Interprocessor Collective Communication Library (InterCom). In: Proceedings of Supercomputing 1994 (1994)

    Google Scholar 

  11. Thakur, R., Gropp, W.: Improving the Performance of Collective Operations in MPICH. In: Dongarra, J., Laforenza, D., Orlando, S. (eds.) EuroPVM/MPI 2003. LNCS, vol. 2840, pp. 257–267. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  12. Floyd, S., et al.: A Reliable Multicast Framework for Light-Weight Sessions and Application Level Framing. IEEE/ACM Transactions on Networking 5(6), 784–803 (1997)

    Article  Google Scholar 

  13. Pingali, S., Towsley, D., Kurose, J.F.: A Comparison of Sender-Initiated and Receiver- Initiated Reliable Multicast Protocols. In: Sigmetrics Conference on Measurement and Computer Systems, pp. 221–230. ACM Press, New York (1994)

    Chapter  Google Scholar 

  14. Buntinas, D., Panda, D.K., Brightwell, R.: Application-Bypass Broadcast in MPICH over GM. In: International Symposium on Cluster Computing and the Grid (2003)

    Google Scholar 

  15. Juhász, S., Charaf, H.: Exploiting Fast Ethernet Performance in Multiplatform Cluster Environment. In: 19th ACM Symposium on Applied Computing, Nicosia, pp. 1407–1411 (2004)

    Google Scholar 

  16. RWTH Aachen: Multi-Platform MPICH, http://www.lfbs.rwth-aachen.de/mp-mpich/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Juhász, S., Kovács, F. (2004). Asynchronous Distributed Broadcasting in Cluster Environment. In: Kranzlmüller, D., Kacsuk, P., Dongarra, J. (eds) Recent Advances in Parallel Virtual Machine and Message Passing Interface. EuroPVM/MPI 2004. Lecture Notes in Computer Science, vol 3241. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30218-6_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30218-6_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23163-9

  • Online ISBN: 978-3-540-30218-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics