Skip to main content
Log in

A cost-efficient congestion management methodology for fat-trees using traffic pattern detection

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Interconnection networks have a great impact on the performance of parallel systems. These networks provide the communication mechanism and framework needed by parallel applications. One such important network is fat-tree. Selection functions were shown to have a great impact on the performance of fat-trees. Selection functions perform differently under certain traffic patterns. The stage and destination priority (SADP) selection function was shown to have better performance in case of uniform traffic while the stage and origin priority (SAOP) selection function was shown to perform better in case of hot-spot traffic. In this paper, we propose a cost-efficient congestion management mechanism for fat-trees that choose a certain selection function for certain traffic pattern. The mechanism has the ability to detect the current traffic pattern and switch to a certain selection function that is proved to give better performance under the detected traffic pattern. This directly decreases the congestion in the network. First, we analyze the hot-spot traffic in fat-trees if SADP selection function is used. We derive a condition for the existence of hot-spot traffic if SADP function is used. We give an implementation for detecting this condition. Once this condition is detected, the network is forced to switch to use the SAOP selection function. Then, we use the analysis of SAOP to derive a condition to detect that a non hot-spot traffic exists in the fat-tree. We give an implementation for detecting this condition. In turn, we switch back to the SADP selection function. We use synthetic workloads to show the accuracy of the proposed mechanism for detecting the hot-spot traffic in the network. We show that the proposed mechanism incurs a constant number of bits per physical link as an overhead. Finally, we compare the proposed mechanism with other techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  1. Gilabert F, Gomez ME, Lopez P, Duato J (2006) On the Influence of the selection function on the performance of fat-trees. In: European Conference on Parallel Computing

  2. Gomez C, Gilabert F, Gomez ME, Lopez P, Duato J (2007) Deterministic versus adaptive routing in fat-trees. In: Proceedings of workshop on communication architecture on clusters

  3. Youngsik Kim, Kwon Oh-Young, Han Tack-Don, Youngsong Mun (1997) Design and performance analysis of the practical fat-tree network using a butterfly network. J Syst Archit 43(1–5):355–363

    Google Scholar 

  4. DeHon A (1990) A fat-tree routing for transit. A Technical Report, Massachusetts Institute of Technology, Cambridge, MA, USA

  5. Juropa (2012) Web Page at: http://www.fz-juelich.de/ias/jsc/EN/Expertise/Supercomputers/JUROPA/JUROPA_node.html

  6. Zahavi E, Johnson G, Kerbyson DJ, Lang M (2010) Optimized InfiniBandTM fattree routing for shift all-to-all communication patterns. Concurrency and computation: practice and experience (ISSN: 1532–0634) 22(2):217–231. doi:10.1002/cpe.1527

  7. Farouk A, El-Boghdadi HM (2011) On the Influence of selection function on the performance of fat-trees under hot-spot traffic. In: Proceedings of the 9th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA) Sharm El-Sheikh, Egypt

  8. Farouk A, El-Boghdadi HM (2012) A methodology for easing the congestion in fat-trees using traffic pattern detection. In: 10th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2012). Madrid

  9. Liu JC, Shin KG, Chang C (1992) Prevention of hot-spots in packet-switched multistage interconnection networks. In: International Conference on Par. and Distributed Systems. Hsin Chu, pp 1–6

  10. Yang Z, Ma H (2009) Hotspot avoidance for P2P streaming distribution application: a game theoretic approach. IEEE Trans Parallel Distrib Syst 20(2):219–232

    Article  MathSciNet  Google Scholar 

  11. Loucif S, Ould-Khaoua M, Geyong M (2007) A queuing model for predicting message latency in uni-directional k-ary N-cubes with deterministic routing and non-uniform traffic. Cluster Comput J 10(2):229–239

    Article  Google Scholar 

  12. Lang Tomás, Kurisaki Lance (1990) Nonuniform traffic spots (NUTS) in multistage interconnection networks. J. Parallel Distrib. Comput. 10(1):55–67

    Article  Google Scholar 

  13. Guay WL, Bogdanski B, Reinemo SA, Lysne O, Skeie T (2011) vFtree—a fat-tree routing algorithm using virtual lanes to alleviate congestion. In: Parallel and Distributed Processing Symposium (IPDPS), 2011 IEEE International. pp 197–208

  14. Santoro N, Khatib R (1985) Routing without routing tables. Tech. report SCS-TR-6, School of Computer Science, Carleton University, 1982. Also as: labeling and implicit routing in networks. Comput J 28(1):5–8

    Article  MATH  MathSciNet  Google Scholar 

  15. Duato J, Johnson I, Flich J, Naven F, García P, Nachiondo T (2005) A new scalable and cost-effective congestion management strategy for lossless multistage interconnection networks. In: Proceedings of the 11th Int’l Symposium on High-Performance Computer Architecture (HPCA-11 2005)

  16. Ferrer JL, Baydal E, Robles A, Lopez P, Duato J (2010) A scalable and early congestion management mechanism for MINs. Parallel Distributed and Network-Based Processing. pp 43–50

  17. Escudero-Sahuquillo Jesus, Garcia Pedro J, Quiles Francisco J, Flich Jose, Duato Jose (2011) OBQA: smart and cost-efficient queue scheme for head-of-line blocking elimination in fat-trees. J Parallel Distrib Comput 71(11):1460–1472

    Article  Google Scholar 

  18. Escudero-Sahuquillo Jesus, Garcia Pedro J, Quiles Francisco J, Reinemo Sven-Arne, Skeie Tor, Lysne Olav, Duato Jose (2014) A new proposal to deal with congestion in InfiniBand-based fat-trees. J Parallel Distrib Comput 74(1):1802–1819

    Article  Google Scholar 

  19. Bayar S, Yurdakul A (2012) A dynamically reconfigurable communication architecture for multicore embedded systems. J Syst Archit 58(3—-4):140–159

    Article  Google Scholar 

  20. Arabnia HR, Smith JW (1993) A reconfigurable interconnection network for imaging operations and its implementation using a multi-stage switching box. In: Proceedings of the 7th annual international high performance computing conference. The 1993 high performance computing: new horizons supercomputing symposium. Canada, pp 349–357

  21. Bhandarkar SM, Arabnia HR (1995) The REFINE multiprocessor—theoretical properties and algorithms. Parallel Comput 21(11):1783–1805

    Article  Google Scholar 

  22. Gran EG, Reinemo SA, Lysne O, Skeie T, Zahavi E, Shainer G (2012) Exploring the scope of the InfiniBand congestion control mechanism. In: Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International. pp 1131–1143

  23. Duato J, Yalamanchili S, Ni L (2004) Interconnection networks. An engineering approach. Morgan Kaufmann

  24. http://www.buet.ac.bd/cse/research/group/noc/index.html

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hatem M. El-Boghdadi.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Farouk, A., El-Boghdadi, H.M. A cost-efficient congestion management methodology for fat-trees using traffic pattern detection. J Supercomput 71, 1249–1276 (2015). https://doi.org/10.1007/s11227-014-1359-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-014-1359-7

Keywords

Navigation