Abstract
We consider parallel computing on a network of workstations using a connection-oriented protocol (e.g., Asynchronous Transfer Mode) for data communication. In a connection-oriented protocol, a virtual circuit of guaranteed bandwidth is established for each pair of communicating workstations. Since all virtual circuits do not have the same guaranteed bandwidth, a parallel application must deal with the unequal bandwidths between workstations. Since most works in the design of parallel algorithms assume equal bandwidths on all the communication links, they often do not perform well when executed on networks of workstations using connection-oriented protocols. In this paper, we first evaluate the performance degradation caused by unequal bandwidths on the execution of conventional parallel algorithms such as the fast Fourier transform and bitonic sort. We then present a strategy based on dynamic redistribution of data points to reduce the bottlenecks caused by unequal bandwidths. We also extend this strategy to deal with processor heterogeneity. Using analysis and simulation we show that there is a considerable reduction in the runtime if the proposed redistribution strategy is adopted. The basic idea presented in this paper can also be used to improve the runtimes of other parallel applications in connection-oriented environments.
Similar content being viewed by others
References
A. Abualsamid, R. Alqadi and P. Ramanathan, Distributed synthesis of real-time computer systems, in: Proceedings of the IEEE Real-Time Technology and Applications Symposium(May 1995) pp. 154-163.
T. Agerwala et al., SP2 system architecture, IBM Systems Journal 34(2) (1995) 152-184.
D.A. Bader and J.J. Já, Practical parallel algorithms for dynamic data redistribution, median finding and selection, in: Proceedings International Parallel Processing Symposium(April 1996) pp. 292-301.
J.-Y.L. Boudec, The Asynchronous Transfer Mode: A tutorial, Computer Networks and ISDN Systems 24 (1992) 279-309.
S.-L. Chang, D.H.-C. Du, J. Hsieh, R.P. Tsang and M. Lin, Enhanced PVM communications over a high-speed LAN, IEEE Parallel and Distributed Technology (1995) 20-32.
M. Cierniak, W. Li and M.J. Zaki, Loop scheduling for heterogeneity, in: Proceedings High Performance Distributed Computing(August 1995) 78-85.
D. Culler et al., LogP: Towards a realistic model of parallel computation, in: Proceedings ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming(May 1993).
A. Gupta and V. Kumar, The scalability of FFT on parallel computers, IEEE Transactions on Parallel and Distributed Systems 4(8) (August 1993) 922-932.
Y. Huang, C.C. Huang and P.K. McKinley, Multicast virtual topologies for collective communication in MPCs and ATM clusters, in: Proceedings Supercomputing(December 1995).
Y. Huang and P.K. McKinley, Efficient collective operations with ATM network interface support, in: Proceedings International Conference on Parallel Processing(August 1996).
V. Kumar, A. Grama, A. Gupta and G. Karypis, Introduction to Parallel Computing: Design and Analysis of Algorithms(Benjamin/Cummings, Redwood City, CA, 1994).
F.T. Leighton, Introduction to Parallel Algorithms and Architectures: Arrays, Trees and Hypercubes(Morgan-Kaufmann, San Mateo, CA, 1992).
C.E. Leiserson, Z.S. Abuhamdeh, D.C. Douglas, C.R. Feynman, M.N. Ganmukhi, J.V. Hill, W.D. Hillis, B.C. Kuszmaul, M.A.S. Pierre, D.S. Wells, M.C. Wong, S.-W. Yang and R. Zak, The network architecture of the connection machine CM-5, in: Proceedings ACM Symposium of Parallel Algorithms and Architectures(July 1992).
I. Pramanick and A.K. Pramanick, Distributed solutions to the delay fault test quality evaluation problem, in: Proceedings High Performance Distributed Computing(August 1994) pp. 177-185.
C.S. Raghavendra, S. Chalasani and R.V. Boppana, Improved algorithms for load balancing in circuit-switched hypercubes, in: Proceedings International Parallel Processing Symposium(April 1991) pp. 537-542.
M. ten Bruggencate and S. Chalasani, Parallel implementations of the power system transient stability problem on clusters of workstations, in: Proceedings Supercomputing(December 1995).
J. Woo and S. Sahni, Load balancing on a hypercube, in: Proceedings International Parallel Processing Symposium(April 1991) pp. 525-530.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Chalasani, S., Ramanathan, P. Parallel FFT on ATM‐based networks of workstations. Cluster Computing 1, 13–26 (1998). https://doi.org/10.1023/A:1019008726420
Issue Date:
DOI: https://doi.org/10.1023/A:1019008726420