Abstract
The study of interconnection networks is important because the overall performance of a distributed system is often critically hinged on the effectiveness of its interconnection network. This paper addresses the problem of interconnection networks performance modeling of large-scale distributed systems with emphases on heterogeneous multi-cluster computing systems. We present an analytical model to predict message latency in multi-cluster systems in the presence of node, network and system organization heterogeneity. The model is validated through comprehensive simulation, which demonstrates that the proposed model exhibits a good degree of accuracy for various system organizations and under different working conditions.
Similar content being viewed by others
References
Xu MQ (2001) Effective meta-computing using LSF multi-cluster. In: Proceedings of the IEEE international conference on cluster and grid, Brisbane, Australia, 15–18 May 2001, pp 100–106
Foster I (2002) The grid: a new infrastructure for 21st century science. Phys Today 55(2):42–48
Abawajy JH, Dandamudi SP (2003) Parallel job scheduling on multi-cluster computing systems. In: Proceedings of the IEEE international conference on cluster computing, Hong Kong, 1–4 Dec 2003, pp 11–18
DAS-2 (2002) The DAS-2 supercomputer. http://www.cs.vu.nl/das2
Boas B (2003) Storage on the lunatic fringe. Lawrence Livermore National Laboratory. In: Panel at Supercomputing Conference 2003, Phoenix, AZ, 15–21 Nov 2003
Bucur AID, Epema DHJ (2000) The influence of the structure and sizes of the jobs on the performance of co-allocation. In: Lecture notes in computer science, vol 1911. Springer, pp 154–173
Chun ATT, Wang CL (2000) Contention-free complete exchange algorithm on clusters. In: Proceedings of the IEEE international conference on cluster computing, Saxony, Germany, 28 Nov–1 Dec 2000, pp 57–64
Sarbazi-Azad H, Khonsari A, Ould-Khaoua M (2002) Performance analysis of deterministic routing in wormhole k-ary n-cubes with virtual channels. J Interconnect Netw 3(1–2):67–83
Ould-Khaoua M (1999) A performance model for Duato’s fully-adaptive routing algorithm in k-ary n-cubes. IEEE Trans Comput 42(12):1–8
Draper JT, Ghosh J (1994) A comprehensive analytical model for wormhole routing in multi-computer systems. J Parallel Distrib Comput 23(2):202–214
Boura YM, Das CR (1997) Performance analysis of buffering schemes in wormhole routers. IEEE Trans Comput 46(6):687–694
Hu PC, Kleinrock L (1995) A queuing model for wormhole routing with timeout. In: Proceedings of the 4th international conference on computer communications and networks, Las Vegas, NV, 20–23 Sep 1995, pp 584–593
Du X, Zhang X, Zhu Z (2000) Memory hierarchy consideration for cost-effective cluster computing. IEEE Trans Comput 49(5):915–933
Javadi B, Khorsandi S, Akbari MK (2004) Queuing network modeling of a cluster-based parallel systems. In: Proceedings of the 7th international conference on high performance computing and grids, Tokyo, Japan, 20–22 Jul 2004, pp 304–307
Javadi B, Khorsandi S, Akbari MK (2005) Study of cluster-based parallel systems using analytical modeling and simulation. In: Lecture notes in computer science, vol 3483, Springer, 2005, pp 1262–1271
Clematis A, Corana A (1999) Modeling performance of heterogeneous parallel computing systems. J Parallel Comput 25(9):1131–1145
Dally W, Towles B (2004) Principles and practices of interconnection networks. Morgan Kaufmann, San Francisco
Thunder Statement of Work (2003) University of California, Lawrence Livermore National Laboratory, Sep 2003.
InfiniBand clustering, delivering better price/performance than Ethernet (2005) White Paper, Mellanox Technologies Inc, Santa Clara
Building scalable, high performance cluster/grid networks: the role of Ethernet (2004) White Paper, Force10 Networks Inc, Milpitas
Lin X (2003) An efficient communication scheme for fat-tree topology on infiniband networks, MSc Thesis, Department of Information Engineering and Computer Science, Feng Chia University, Taiwan
Javadi B, Abawajy JH, Akbari MK (2006) Modeling and analysis of heterogeneous loosely-coupled distributed systems. Technical Report TR C06/1, School of Information Technology, Deakin University, Australia, Jan 2006
Schroeder MD et al (1990) Autonet: a high-speed, self configuring local area network using point-to-point links. SRC research report 59, Digital Equipment Corporation, Apr 1990
Dongarra J, Lastovetsky A (2006) An overview of heterogeneous high performance and grid computing. In: DiMartino B, Dongarra J, Hoisie A, Yang L, Zima H (eds) Engineering the grid: status and perspective. American Scientific
Kim J, Lilja DJ (1998) Characterization of communication patterns in message-passing parallel scientific application programs. In: Lecture notes in computer science, vol 1362. Springer, pp 202–216
Kleinrock L (1975) Queuing system: computer applications, vol 2. Wiley, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Javadi, B., Abawajy, J.H. & Akbari, M. Analytical modeling of interconnection networks in heterogeneous multi-cluster systems. J Supercomput 40, 29–47 (2007). https://doi.org/10.1007/s11227-006-0011-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-006-0011-6