Skip to main content
Log in

Parallel FFT on ATM‐based networks of workstations

  • Published:
Cluster Computing Aims and scope Submit manuscript

Abstract

We consider parallel computing on a network of workstations using a connection-oriented protocol (e.g., Asynchronous Transfer Mode) for data communication. In a connection-oriented protocol, a virtual circuit of guaranteed bandwidth is established for each pair of communicating workstations. Since all virtual circuits do not have the same guaranteed bandwidth, a parallel application must deal with the unequal bandwidths between workstations. Since most works in the design of parallel algorithms assume equal bandwidths on all the communication links, they often do not perform well when executed on networks of workstations using connection-oriented protocols. In this paper, we first evaluate the performance degradation caused by unequal bandwidths on the execution of conventional parallel algorithms such as the fast Fourier transform and bitonic sort. We then present a strategy based on dynamic redistribution of data points to reduce the bottlenecks caused by unequal bandwidths. We also extend this strategy to deal with processor heterogeneity. Using analysis and simulation we show that there is a considerable reduction in the runtime if the proposed redistribution strategy is adopted. The basic idea presented in this paper can also be used to improve the runtimes of other parallel applications in connection-oriented environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. A. Abualsamid, R. Alqadi and P. Ramanathan, Distributed synthesis of real-time computer systems, in: Proceedings of the IEEE Real-Time Technology and Applications Symposium(May 1995) pp. 154-163.

  2. T. Agerwala et al., SP2 system architecture, IBM Systems Journal 34(2) (1995) 152-184.

    Article  Google Scholar 

  3. D.A. Bader and J.J. Já, Practical parallel algorithms for dynamic data redistribution, median finding and selection, in: Proceedings International Parallel Processing Symposium(April 1996) pp. 292-301.

  4. J.-Y.L. Boudec, The Asynchronous Transfer Mode: A tutorial, Computer Networks and ISDN Systems 24 (1992) 279-309.

    Article  MATH  Google Scholar 

  5. S.-L. Chang, D.H.-C. Du, J. Hsieh, R.P. Tsang and M. Lin, Enhanced PVM communications over a high-speed LAN, IEEE Parallel and Distributed Technology (1995) 20-32.

  6. M. Cierniak, W. Li and M.J. Zaki, Loop scheduling for heterogeneity, in: Proceedings High Performance Distributed Computing(August 1995) 78-85.

  7. D. Culler et al., LogP: Towards a realistic model of parallel computation, in: Proceedings ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming(May 1993).

  8. A. Gupta and V. Kumar, The scalability of FFT on parallel computers, IEEE Transactions on Parallel and Distributed Systems 4(8) (August 1993) 922-932.

    Article  Google Scholar 

  9. Y. Huang, C.C. Huang and P.K. McKinley, Multicast virtual topologies for collective communication in MPCs and ATM clusters, in: Proceedings Supercomputing(December 1995).

  10. Y. Huang and P.K. McKinley, Efficient collective operations with ATM network interface support, in: Proceedings International Conference on Parallel Processing(August 1996).

  11. V. Kumar, A. Grama, A. Gupta and G. Karypis, Introduction to Parallel Computing: Design and Analysis of Algorithms(Benjamin/Cummings, Redwood City, CA, 1994).

    MATH  Google Scholar 

  12. F.T. Leighton, Introduction to Parallel Algorithms and Architectures: Arrays, Trees and Hypercubes(Morgan-Kaufmann, San Mateo, CA, 1992).

    MATH  Google Scholar 

  13. C.E. Leiserson, Z.S. Abuhamdeh, D.C. Douglas, C.R. Feynman, M.N. Ganmukhi, J.V. Hill, W.D. Hillis, B.C. Kuszmaul, M.A.S. Pierre, D.S. Wells, M.C. Wong, S.-W. Yang and R. Zak, The network architecture of the connection machine CM-5, in: Proceedings ACM Symposium of Parallel Algorithms and Architectures(July 1992).

  14. I. Pramanick and A.K. Pramanick, Distributed solutions to the delay fault test quality evaluation problem, in: Proceedings High Performance Distributed Computing(August 1994) pp. 177-185.

  15. C.S. Raghavendra, S. Chalasani and R.V. Boppana, Improved algorithms for load balancing in circuit-switched hypercubes, in: Proceedings International Parallel Processing Symposium(April 1991) pp. 537-542.

  16. M. ten Bruggencate and S. Chalasani, Parallel implementations of the power system transient stability problem on clusters of workstations, in: Proceedings Supercomputing(December 1995).

  17. J. Woo and S. Sahni, Load balancing on a hypercube, in: Proceedings International Parallel Processing Symposium(April 1991) pp. 525-530.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chalasani, S., Ramanathan, P. Parallel FFT on ATM‐based networks of workstations. Cluster Computing 1, 13–26 (1998). https://doi.org/10.1023/A:1019008726420

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1019008726420

Keywords

Navigation