Skip to main content
Log in

Mapping Parallel Application Communication Topology to Rhombic Overlapping-Cluster Multiprocessors

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

This paper extends research into rhombic overlapping-connectivity interconnection networks into the area of parallel applications. As a foundation for a shared-memory non-uniform access bus-based multiprocessor, these interconnection networks create overlapping groups of processors, buses, and memories, forming a clustered computer architecture where the clusters overlap. This overlapping-membership characteristic is shown to be useful for matching parallel application communication topology to the architecture's bandwidth characteristics. Many parallel applications can be mapped to the architecture topology so that most or all communication is localized within an overlapping cluster, at the low latency of processor direct to cache (or memory) over a bus. The latency of communication between parallel threads does not degrade parallel performance or limit the graininess of applications. Parallel applications can execute with good speedup and scaling on a proposed architecture which is designed to obtain maximum advantage from the overlapping-cluster characteristic, and also allows dynamic workload migration without moving the instructions or data. Scalability limitations of bus-based shared-memory multiprocessors are overcome by judicious workload allocation schemes, that take advantage of the overlapping-cluster memberships. Bus-based rhombic shared-memory multiprocessors are examined in terms of parallel speedup models to explain their advantages and justify their use as a foundation for the proposed computer architecture. Interconnection bandwidth is maximized with bi-directional circular and segmented overlapping buses. Strategies for mapping parallel application communication topologies to rhombic architectures are developed. Analytical models of enhanced rhombic multiprocessor performance are developed with a unique bandwidth modeling technique, and are compared with the results of simulation.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. X. BaiQiang and N. Ida. A dynamically segmented bus architecture. Computers and Electronics Engineering, 16(3), 1990.

  2. L. N. Bhuyan, R. R. Iyer, T. Askar, A. K. Nanda, and M. Kumar. Performance of multistage bus networks for a distributed shared memory multiprocessor. IEEE Transactions on Parallel and Distributed Systems, 8(1), January 1997.

  3. E. A. Carmona. Modeling the serial and parallel fractions of a parallel program. Journal of Parallel and Distributed Computing, 13: 286-298, 1991.

    Google Scholar 

  4. C. Evequoz. Cost-performance analysis of cascaded crossbar interconnected multiprocessors. IEEE Proceedings on Computers and Digital Technology, 142(2), March 1995.

  5. J. L. Gustafson. Reevaluating Amdahl's law. Communications of the ACM, 31(5), May 1988.

  6. K. E. Hoganson. Rhombus multiprocessor architecture and operating system. Master's thesis, University of North Carolina at Charlotte, 1990.

  7. K. E. Hoganson, B. Wilkinnson, and W. H. Carlisle. Applications of rhombic multiprocessors. In Proceedings of the 1997 International Conference on Parallel and Disributed Processing Techniques and Applications ('97), July 1997.

  8. K. E. Hoganson. Parallel applications and architectures utilizing bi-directional rhombic interconnection networks with circular and segmented buses. Ph.D. dissertation, Auburn University, 1997.

  9. T. Ishikawa. Hypercube multiprocessor with bus connections for improving communication performance. IEEE Transactions on Computers, 44(11), November 1995.

  10. C. Katsinis. A multicomputer architecture with a segmented shared bus. Computers and Electronics Engineering, 21(1), 1995.

  11. D. E. Lenoski, A. Gupta, J. Hennessy, et al. The Dash Prototype: Implementation and Performance. In Proceedings of the 19th Annual International Symposium on Computer Architecture, vol. 20, no. 2, ACM, 1992.

  12. D. E. Lenoski. Scalable Shared-Memory Multiprocessing., Morgan Kaufman Publishers, San Francisco, Calif., 1995.

    Google Scholar 

  13. T. Lovitt and T. Shreekant. The symmetry multiprocessor system. In Proceedings of the 1988 International Conference on Parallel Processing, IEEE, 1988.

  14. X. H. Sun and J. L. Gustafson. Toward a better parallel performance metric. Parallel Computing, 17: 1093-1109, 1991.

    Google Scholar 

  15. F. A. Van-Catledge. Toward a general model for evaluating the relative performance of computer systems. The International Journal of Supercomputer Applications, 3(2): 100-108, Summer 1989.

    Google Scholar 

  16. B. Wilkinson. Cascaded rhombic crossbar interconnection networks. Journal of Parallel and Distributed Computing, 10(1), 1990.

  17. B. Wilkinson. Comparative performance of overlapping connectivity multiprocessor interconnection networks. The Computer Journal, 34(3), 1991.

  18. B. Wilkinson. Multiple bus network with overlapping connectivity. IEEE Proceedings-E, 138(4), 1991.

  19. B. Wilkinson. Overlapping connectivity interconnection networks for shared memory multiprocessor systems. Journal of Parallel and Distributed Computing, 15, 1992.

  20. B. Wilkinson. On crossbar switch and multiple bus interconnection networks with overlapping connectivity. IEEE Transactions on Computers, 41(6), June 1992.

  21. B. Wilkinson and J. M. Farmer. Reflective interconnection networks. Computers and Electronics Engineering, 20(4), 1994.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hoganson, K. Mapping Parallel Application Communication Topology to Rhombic Overlapping-Cluster Multiprocessors. The Journal of Supercomputing 17, 67–90 (2000). https://doi.org/10.1023/A:1008171822063

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008171822063

Navigation