Abstract
The one-to-all broadcast is the most primary collective communication pattern in a multicomputer network. This paper studies this problem in a circuit-switched torus with α-port capability, where a node can simultaneously send and receive α messages at one time. This is a generalization of the one-port and all-port models. We show how to efficiently perform broadcast in tori of any dimension, any size, square or nonsquare, using near optimal numbers of steps. The main techniques used are: (i) a “span-by-dimension” approach, which makes our solution scalable to torus dimensions, and (ii) a “squeeze-then-expand” approach, which makes possible solving the difficult cases where tori are non-square. Existing results, as compared to ours, can only solve very restricted sizes or dimensions of tori, or use more numbers of steps.
Similar content being viewed by others
References
CM-5 technical summary. Thanking Machines Corp., 1991.
V. Bala, J. Bruck, R. Cypher, P. Elustondo, A. Ho, C. T. Ho, S. Kipmis, and Snir. CCL: A portable and tunable collective communication library for scalable parallel computers. In International Parallel Processing Symposium, Cancun, Mexico, pp. 835-843, April 1994.
C.-T. Ho and M.-Y. Kao. Optimal broadcasting in all-port wormhole-routed hypercubes. IEEE Transactions on Parallel andDistributedSystems, 6(2):200-204, 1995.
S.-K. Lee and J.-Y. Lee. Optimal broadcast in α-port wormhole-routed mesh networks. In International Conference on Parallel andDistributedSystems, pp. 109-114, 1997.
P. K. McKinley, Y.-J. Tsai, and D. F. Robinson. Collective communication in wormhole-routed massively parallel computers. IEEE Computers, 28(12):39-50, 1995.
Message Passing Interface Forum. Document for standard message-passing interface, November 1993.
W. K. Nicholson. Linear Algebra with Applications, 3rd ed. PWS Publishing Company, 1995.
S. F. Nugent. The iPSC/2 direct-connect technology. In Proceedings of 3rd ACM Conference on Hypercube Concurrent Computers andApplications, pp. 51-60, 1988.
J. L. Park and H.-A. Choi. Circuit-switched broadcasting in tori and meshes networks. IEEE Transactions on Parallel andDistributedSystems, 7(2):184-190, 1996.
J. L. Park, S.-K. Lee, and H.-A. Choi. Circuit-switched broadcasting in d-dimensional torus and mesh networks. In International Parallel Processing Symposium, pp. 26-29, 1994.
J. G. Peters and M. Syska. Circuit-switched broadcasting in torus networks. IEEE Transactions on Parallel andDistributedSystems, 7(3):246-255, 1996.
R. Ponnusamy, A. Choudhary, and G. Fox. Communication overhead on CM5: an experimental performance evaluation. In Symposium on Frontiers of Massively Parallel Computation, pp. 108-115, 1992.
D. F. Robinson, P. K. Mckinley, and B. H. C. Cheng. Optimal multicast communication in wormholerouted torus networks. IEEE Transactions on Parallel andDistributedSystems, 6(10):1029-1042, 1995.
Y.-J. Tsai and P. K. McKinley. A broadcasting algorithm for all-port wormhole-routed torus networks. IEEE Transactions on Parallel andDistributedSystems, 7(8):876-885, 1996.
Y.-C. Tseng. A dilated-diagonal-based scheme for broadcast in a wormhole-routed 2d torus. IEEE Transactions on Computing, 46:947-952, 1997.
Y.-C. Tseng, S.-Y, Ni, and J.-P. Sheu. Toward optimal complete exchange on wormhole-routed tori. IEEE Transactions on Computing, 48(10):1065-1082, 1999.
C.-M. Wang and C.-Y. Ku. A near-optimal broadcasting algorithm in all-port wormhole-routed hypercubes. In ACM International Conference on Supercomputing, pp. 147-153, 1995.
S.-Y. Wang and Y.-C. Tseng. Algebraic foundations and broadcasting algorithms for wormholerouted all-port tori. IEEE Transactions on Computing, 49(3):246-258, 2000.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wang, SY., Tseng, YC., Ni, SY. et al. Circuit-Switched Broadcasting in Multi-Port Multi-Dimensional Torus Networks. The Journal of Supercomputing 20, 217–241 (2001). https://doi.org/10.1023/A:1011650615731
Issue Date:
DOI: https://doi.org/10.1023/A:1011650615731