Skip to main content
Log in

HPLB: High precision load balancing based on in-band network telemetry in data center networks

  • Published:
Peer-to-Peer Networking and Applications Aims and scope Submit manuscript

Abstract

In modern data centers, the congestion-aware load balancing makes rerouting decisions according to the traffic load. However, it is difficult to accurately obtain network load status using limited congestion information. Recently, In-band Network Telemetry (INT) has been embedded in the latest merchant silicones to support information collection of realtime traffic state from network equipments. In this paper, we present High Precision Load Balancing (HPLB), which is a load balancing method based on in-band network telemetry. With INT, HPLB collects the load information of each node on the transmission path and feeds it back to the near-source switch, which then selects the forwarding path according to the leaf-to-leaf congestion information. The test results of large-scale evaluation show that, compared with ECMP, RPS, DRILL and CONGA, HPLB reduces the flow completion times by \(\sim\)6-71% and improves link utilization by \(\sim\)13-80%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Hopps C (2000) Analysis of an Equal-Cost Multi-Path Algorithm in RFC 2992

  2. Dixit A, Prakash P, Hu YC, Kompella RR (2013) On the Impact of Packet Spraying in Data Center Networks in Proc. IEEE INFOCOMpp. 2130-2138

  3. Ghorbani S, Yang Z, Godfrey PB, Ganjali Y, Firoozshahian A (2017) DRILL: Micro Load Balancing for Low-latency Data Center Networks in Proc. ACM SIGCOMM pp. 225-238

  4. Alizadeh M, Edsall T, Dharmapurikar S et al (2014) CONGA: Distributed congestion-aware load balancing for datacenters in Proc. ACM SIGCOMMpp. 503-514

  5. Tan L, Su W, Miao J et al (2021) FindINT: Detect and Locate the Lost In-band Network Telemetry Packet. IEEE Networking Letters 4(1):20–24

    Article  Google Scholar 

  6. Pan T, Song E, Bian Z et al (2019) INT-path: Towards Optimal Path Planning for In-band Network-Wide Telemetry. In Proc, IEEE INFOCOM

    Google Scholar 

  7. Al-Fares M, Radhakrishnan S, Raghavan B, Huang N, Vahdat A (2010) Hedera: Dynamic flow scheduling for datacenter networks in Proc. USENIX NSDI pp.19-34

  8. Benson T, Anand A, Akella A, and Zhang M (2011) MicroTE: Fine grained traffic engineering for data centers in Proc. ACM CoNEXT pp. 1-12

  9. Wang W, Sun Y, Zheng K, Kaafar A M, Li D, Li Z (2014) Freeway: Adaptively isolating the elephant and mice flows on different transmission paths in Proc. IEEE ICNP pp. 362-367

  10. Perry J, Ousterhout A, Balakrishnan H et al (2014) Fastpass: A centralized zero-queue datacenter network in Proc. ACM SIGCOMM pp. 307-318

  11. Gao X, Kong L, Li W et al (2016) Traffic load balancing schemes for devolved controllers in mega data centers. IEEE Trans Parallel Distrib Syst 28(2):572–585

    Google Scholar 

  12. Raiciu C, Barre S, Pluntke C, Greenhalgh A, Wischik D and Handley M (2011) Improving datacenter performance and robustness with multipath TCP in Proc. ACM SIGCOMM pp. 266-277

  13. Vanini E, Pan R, Alizadeh M, Taheri P, Edsall T (2017) Let it Flow: Resilient Asymmetric Load Balancing with Flowlet Switching in Proc. USENIX NSDI pp. 407-420

  14. Katta N, Hira M, Kim C, Sivaraman A, Rexford J (2016) HULA: Scalable load balancing using programmable data planes in Proc. ACM SOSRpp.1-12

  15. He K, Rozner E, Agarwal K, Felter W, Carter J, Akellay A (2015) Presto: Edge-based Load Balancing for Fast Datacenter Networks in Proc. ACM SIGCOMM pp. 465-478

  16. Wang P, Trimponias G, Xu H, Geng YH (2019) Luopan: Sampling based load balancing in data center networks IEEE Trans. on Parallel and Distributed Systems 30(1):133-145

  17. Cao J, Xia R, Yang P et al (2013) Per-packetload-balanced low-latency routing for clos based data center networks in Proc. ACM CoNEXT pp.49-60

  18. Zhang H, Zhang J, Bai W et al (2017) Resilient datacenter load balancing in the wild in Proc. ACM SIGCOMM pp. 253-266

  19. Hu JB, Huang JW, Lv WJ, Zhou YT, Wang JX, He T (2019) CAPS: Coding based adaptive packet spraying to reduce flow completion time in data center. IEEE/ACM Trans. Netw. 27(6):2338–2353

    Article  Google Scholar 

  20. Liu J, Huang J, Li W et al AG (2019) Adaptive Switching Granularity for Load Balancing with Asymmetric Topology in Data Center Network in Proc. IEEE ICNP pp. 1-11

  21. Huang JW, Lv WJ, Li WH, Wang JX, He T (2018) QDAPS: Queueing Delay Aware Packet Spraying for Load Balancing in Data Center in Proc. IEEE ICNP 66-76

  22. Mittal R, Lam VT, Dukkipati N et al (2015) TIMELY: RTT-based Congestion Control for the Datacenter. ACM SIGCOMM Computer Communication Review 45(4):537–550

    Article  Google Scholar 

  23. Alizadeh M, Greenberg A, Maltz D A et al (2010) Data center tcp (dctcp) in Proc. ACM SIGCOMM 63-74

  24. Tan L. An article to understand in-band network telemetry. [Online]. Available: https://www.sdnlab.com/23822.html. Accessed 2 Jan 2020

  25. Bosshart P, Daly D, Gibb G, Izzard M, McKeown N, Rexford J, Schlesinger C, Talayco D, Vahdat A, Varghese G et al (2014) P4: Programming protocol-independent packet processors. ACM SIGCOMM Computer Communication Review 44(3):87–95

    Article  Google Scholar 

  26. Zhu Y, Eran H, Firestone D et al (2015) Congestion control for large-scale RDMA deployments in Proc. ACM SIGCOMM 523-536

  27. Gao X. How Much Network Telemetry Knows - INT. [Online]. Available: https://www.sdnlab.com/23822.html. Accessed 10 Jun 2020

  28. Tan L et al (2020) In-band Network Telemetry: A Survey. Comput Netw 186(107763):1–20

    Google Scholar 

  29. Kim C, Sivaraman A, Katta N et al (2015) In-band network telemetry via programmable dataplanes in Proc. ACM SIGCOMM pp. 1-3

  30. Kabbani A, Sharif M (2017) Flier: Flow-level congestion-aware routing for direct-connect data centers in Proc. IEEE INFOCOM pp. 1-9

  31. Gao, X, Xin Z, Li K (2015) Congestion Control Algorithm for Data Center Services Computing Conference IEEE pp. 156-161

  32. Perry J, Balakrishnan H, Shah D (2017) Flowtune: Flowlet Control for Datacenter Networks in Proc. USENIX NSDI pp. 421-435

  33. Chowdhury SR, Boutaba R, Franois J (2021) LINT: Accuracy-adaptive and Lightweight In-band Network Telemetry. In Proc, IFIP/IEEE International Symposium on Integrated Network Management

    Google Scholar 

  34. Cui Z, Hu Y, Hou S (2021) An INT-based Load Balancing Mechanism for Cloud Datacenters in Proc. 2nd International Conference on Electronics and Communication & Network and Computer Technology (ECNCT)pp. 12071-12080

  35. Rossi FD et al (2021) Near-Optimal Probing Planning for In-Band Network Telemetry. IEEE Commun Lett 25(5):1630–1634

    Article  Google Scholar 

  36. Ran BB et al (2020) PINT: Probabilistic In-band Network Telemetry in Proc. ACM SIGCOMM pp. 662-680

  37. Hu JB, Huang JW, Lv WJ, Li WH, Wang JX, He T (2021) TLB: Traffic-aware load balancing with adaptive granularity in data center networks. IEEE/ACM Trans. Netw. 29(5):2367–2384

    Article  Google Scholar 

  38. Bai W, Chen L, Chen K, Han D, Tian C, Wang H (2015) Informationagnostic flow scheduling for commodity data centers in Proc. USENIX NSDIpp. 455-468

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (62132022, 61872387, 62102047),  Key Research and Development Program of Hunan (2022WK2005), Natural Science Foundation of Hunan Province, China (2021JJ30867).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiawei Huang.

Ethics declarations

Conflicts of interest

We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gao, W., Huang, J., Jiang, N. et al. HPLB: High precision load balancing based on in-band network telemetry in data center networks. Peer-to-Peer Netw. Appl. 15, 2503–2515 (2022). https://doi.org/10.1007/s12083-022-01381-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12083-022-01381-w

Keywords

Navigation