- 1.D. Culler et. al. A Case for Networks of Workstations: NOW. IEEE Micro, Feb, 1995. Google ScholarDigital Library
- 2.Alan Mainwaring, David Culler. Active Messages: Organization and Applications Programming Interface. Berkeley Technical Report, 1995, http://now.cs.berkeley.edu/Papers/Papers/am_spec.ps. Google ScholarDigital Library
- 3.M. Lauria and A. Chien. MPI-FM: Higher Performance MPI on Workstation Clusters. Parallel and Distributed Computing, pp. 4-18, Jan. 1997. Google ScholarDigital Library
- 4.Steve Lumetta, Alan Mainwaring, David Culler. Multi-Protocol Active Messages on a Cluster of SMP's. Proceedings of SuperComputing '97. Google ScholarDigital Library
- 5.W. Gropp, E. Lusk, N. Doss, A. Skjellum. A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard. Parallel Computing, volume 22, number 6, pp. 789-828, Sep. 1996. Google ScholarDigital Library
- 6.Parry Husbands and James Hoe. MPI_StarT: Delivering Network Performance to Numerical Applications.Google Scholar
- 7.Alan Charlesworth. Benchmarking Starfire Memory Performance. Sun Microsystems internal report, unpublished.Google Scholar
- 8.Alan Charlesworth et. al. The Starfire SMP Interconnect. Proceedings of SuperComputing '97. Google ScholarDigital Library
- 9.Ruud van der Pas, Lisa Noordergraaf. Performance Experiences on Sun's WildFire Prototype. Proceedings of Super- Computing '99. Google ScholarDigital Library
Index Terms
- Optimization of MPI collectives on clusters of large-scale SMP's
Recommendations
MPI Collectives for Multi-core Clusters: Optimized Performance of the Hybrid MPI+MPI Parallel Codes
ICPP Workshops '19: Workshop Proceedings of the 48th International Conference on Parallel ProcessingThe advent of multi-/many-core processors in clusters advocates hybrid parallel programming, which combines Message Passing Interface (MPI) for inter-node parallelism with a shared memory model for on-node parallelism. Compared to the traditional hybrid ...
MPI-StarT: delivering network performance to numerical applications
SC '98: Proceedings of the 1998 ACM/IEEE conference on SupercomputingWe describe an MPI implementation for a cluster of SMPs interconnected by a high-performance interconnect. This work is a collaboration between a numerical applications programmer and a cluster interconnect architect. The collaboration started with the ...
Performance comparison of MPI and three openMP programming styles on shared memory multiprocessors
SPAA '03: Proceedings of the fifteenth annual ACM symposium on Parallel algorithms and architecturesWhen using a shared memory multiprocessor, the programmer faces the selection of the portable programming model which will deliver the best performance. Even if he restricts his choice to the standard programming environments (MPI and OpenMP), he has a ...
Comments