Parallel FFT on ATM‐based networks of workstations

Chalasani, Suresh; Ramanathan, Parameswaran

doi:10.1023/A:1019008726420

Parallel FFT on ATM‐based networks of workstations

Published: May 1998

Volume 1, pages 13–26, (1998)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Suresh Chalasani¹ &
Parameswaran Ramanathan¹

46 Accesses
1 Citation
Explore all metrics

Abstract

We consider parallel computing on a network of workstations using a connection-oriented protocol (e.g., Asynchronous Transfer Mode) for data communication. In a connection-oriented protocol, a virtual circuit of guaranteed bandwidth is established for each pair of communicating workstations. Since all virtual circuits do not have the same guaranteed bandwidth, a parallel application must deal with the unequal bandwidths between workstations. Since most works in the design of parallel algorithms assume equal bandwidths on all the communication links, they often do not perform well when executed on networks of workstations using connection-oriented protocols. In this paper, we first evaluate the performance degradation caused by unequal bandwidths on the execution of conventional parallel algorithms such as the fast Fourier transform and bitonic sort. We then present a strategy based on dynamic redistribution of data points to reduce the bottlenecks caused by unequal bandwidths. We also extend this strategy to deal with processor heterogeneity. Using analysis and simulation we show that there is a considerable reduction in the runtime if the proposed redistribution strategy is adopted. The basic idea presented in this paper can also be used to improve the runtimes of other parallel applications in connection-oriented environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

A. Abualsamid, R. Alqadi and P. Ramanathan, Distributed synthesis of real-time computer systems, in: Proceedings of the IEEE Real-Time Technology and Applications Symposium(May 1995) pp. 154-163.
T. Agerwala et al., SP2 system architecture, IBM Systems Journal 34(2) (1995) 152-184.
Article Google Scholar
D.A. Bader and J.J. Já, Practical parallel algorithms for dynamic data redistribution, median finding and selection, in: Proceedings International Parallel Processing Symposium(April 1996) pp. 292-301.
J.-Y.L. Boudec, The Asynchronous Transfer Mode: A tutorial, Computer Networks and ISDN Systems 24 (1992) 279-309.
Article MATH Google Scholar
S.-L. Chang, D.H.-C. Du, J. Hsieh, R.P. Tsang and M. Lin, Enhanced PVM communications over a high-speed LAN, IEEE Parallel and Distributed Technology (1995) 20-32.
M. Cierniak, W. Li and M.J. Zaki, Loop scheduling for heterogeneity, in: Proceedings High Performance Distributed Computing(August 1995) 78-85.
D. Culler et al., LogP: Towards a realistic model of parallel computation, in: Proceedings ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming(May 1993).
A. Gupta and V. Kumar, The scalability of FFT on parallel computers, IEEE Transactions on Parallel and Distributed Systems 4(8) (August 1993) 922-932.
Article Google Scholar
Y. Huang, C.C. Huang and P.K. McKinley, Multicast virtual topologies for collective communication in MPCs and ATM clusters, in: Proceedings Supercomputing(December 1995).
Y. Huang and P.K. McKinley, Efficient collective operations with ATM network interface support, in: Proceedings International Conference on Parallel Processing(August 1996).
V. Kumar, A. Grama, A. Gupta and G. Karypis, Introduction to Parallel Computing: Design and Analysis of Algorithms(Benjamin/Cummings, Redwood City, CA, 1994).
MATH Google Scholar
F.T. Leighton, Introduction to Parallel Algorithms and Architectures: Arrays, Trees and Hypercubes(Morgan-Kaufmann, San Mateo, CA, 1992).
MATH Google Scholar
C.E. Leiserson, Z.S. Abuhamdeh, D.C. Douglas, C.R. Feynman, M.N. Ganmukhi, J.V. Hill, W.D. Hillis, B.C. Kuszmaul, M.A.S. Pierre, D.S. Wells, M.C. Wong, S.-W. Yang and R. Zak, The network architecture of the connection machine CM-5, in: Proceedings ACM Symposium of Parallel Algorithms and Architectures(July 1992).
I. Pramanick and A.K. Pramanick, Distributed solutions to the delay fault test quality evaluation problem, in: Proceedings High Performance Distributed Computing(August 1994) pp. 177-185.
C.S. Raghavendra, S. Chalasani and R.V. Boppana, Improved algorithms for load balancing in circuit-switched hypercubes, in: Proceedings International Parallel Processing Symposium(April 1991) pp. 537-542.
M. ten Bruggencate and S. Chalasani, Parallel implementations of the power system transient stability problem on clusters of workstations, in: Proceedings Supercomputing(December 1995).
J. Woo and S. Sahni, Load balancing on a hypercube, in: Proceedings International Parallel Processing Symposium(April 1991) pp. 525-530.

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of Wisconsin‐Madison, Madison, WI, 53706‐1691, USA
Suresh Chalasani & Parameswaran Ramanathan

Authors

Suresh Chalasani
View author publications
You can also search for this author in PubMed Google Scholar
Parameswaran Ramanathan
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chalasani, S., Ramanathan, P. Parallel FFT on ATM‐based networks of workstations. Cluster Computing 1, 13–26 (1998). https://doi.org/10.1023/A:1019008726420

Download citation

Issue Date: May 1998
DOI: https://doi.org/10.1023/A:1019008726420

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Parallel FFT on ATM‐based networks of workstations

Abstract

Access this article

Similar content being viewed by others

Mapping Parallel Computations to Distributed Systems Based on RapidIO Technology

Improving Parallel I/O Performance Using Multithreaded Two-Phase I/O with Processor Affinity Management

Architecture, Implementation and Performance Optimization in Organizing Parallel Computations for Simulation Environment

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Parallel FFT on ATM‐based networks of workstations

Abstract

Access this article

Similar content being viewed by others

Mapping Parallel Computations to Distributed Systems Based on RapidIO Technology

Improving Parallel I/O Performance Using Multithreaded Two-Phase I/O with Processor Affinity Management

Architecture, Implementation and Performance Optimization in Organizing Parallel Computations for Simulation Environment

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation