Skip to main content
Log in

An up-to-date survey in web load balancing

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

This survey is an up-to-date state-of-the-art in Web load balancing mechanisms that includes all the possible classifications and focuses on the advantages of using load balancing solutions to increase the performance of the Web system. A general description of the Web load balancing solutions is included and organised by differentiating the OSI protocol stack layer the load balancing is based on. The most important request distributing polices that are proposed in the literature are also included. This article summarises all previous surveys on the Web load balancing subject and updates the state-of-the-art with the most recent load balancing proposals.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Andreolini, M., Colajanni, M., Nuccio, M.: Kernel-based web switches providing content-aware routing. In: Proc. of the 2nd IEEE International Symposium on Network Computing and Applications (NCA’03) (2003)

  2. Apostolopoulos, G., Aubespin, D., Peris, V.G.J., Pradhan, P., Saha, D.: Design, implementation and performance of a content-based switch. In: Proc. of INFOCOM (2000)

  3. Aron, M., Druschel, P., Zwaenepoel, W.: Efficient support for P-HTTP in cluster-based web servers. In: Proc. of the Annual Conference on USENIX Annual Technical Conference (1999)

  4. Aron, M., Druschel, P., Zwaenepoel, W.: Cluster reserves: a mechanism for resource management in cluster-based network servers. In: Proc. of ACM SIGMETRICS (2000)

  5. Aron, M., Sanders, D., Druschel, P., Zwaenepoel, W.: Scalable content-aware request distribution in cluster-based network servers. In: Proc. of the USENIX 2000 Annual Technical Conference (2000)

  6. Barroso, L., Dean, J., Hoelzle, U.: Web search for a planet: the google cluster architecture. IEEE Micro 23, 22–28 (2003)

    Article  Google Scholar 

  7. Bent, L., Rabinovich, M., Voelker, G.M., Xiao, Z.: Characterization of a large web site population with implications for content delivery. In: Proc. of the 13th International Conference on World Wide Web (2004)

  8. Borzemski, L., Zatwarnicki, K.: A fuzzy adaptive request distribution algorithm for cluster-based web systems. In: Proc. of the 11th Euromicro Conference on Parallel, Distributed and Network-Based Processing (Euro PDP) (2003)

  9. Brendel, J.: Client-side resource-based load-balancing with delayed-resource-binding using TCP state migration to WWW server farm. United States Patent 6,182,139. Resonate Inc (2001)

  10. Brisco, T.P.: DNS support for Load Balancing. RFC 1794 (1995)

  11. Cardellini, V., Casalicchio, E., Colajanni, M., Yu, P.S.: The state of the art in locally distributed web-server systems. ACM Comput. Surv. 34, 263–311 (2002). doi:10.1145/508352.508355

    Article  Google Scholar 

  12. Cardellini, V., Colajanni, M., Yu, P.S.: Dynamic load balancing on web-server systems. IEEE Int. Comp. 3(3), 28–39 (1999)

    Article  Google Scholar 

  13. Carrera, E., Bianchini, R.: Efficiency vs. portability in cluster-based network servers (2001)

  14. Casalicchio, E., Colajanni, M.: A client-aware dispatching algorithm for web clusters providing multiple services. In: Proc. of the 10th International Conference on World Wide Web (2001)

  15. Cavale, M.R.: Introducing Microsoft Cluster Service (MSCS) in the Windows Server 2003 Family. Microsoft Corporation (2002)

  16. Chang, Y.K., Cheng, W.H., Young, C.P.: Fully pre-splicing TCP for web switches. In: Proc. of the 1st International Conference on Innovative Computing, Information and Control (ICICIC) (2006)

  17. Cherkasova, L., DeSouza, M., Ponnekanti, S.: Performance analysis of “content-aware” load balancing strategy FLEX: two case studies. In: Proc. of the 34th Hawaii International Conference on System Sciences (2001)

  18. Cherkasova, L., Karlsson, M.: Scalable web server cluster design with workload-aware request distribution strategy WARD. In: Proc. of the Third International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems (WECWIS) (2001)

  19. Chiang, M.L., Lin, Y.C., Guo, L.F.: Design and implementation of an efficient web cluster with content-based request distribution and file caching. J. Syst. Softw. 81, 2044–2058 (2008)

    Article  Google Scholar 

  20. Chiang, M.L., Wu, C.H., Liao, Y.J., Chen, Y.F.: New content-aware request distribution policies in web clusters providing multiple services. In: Proc. of the ACM Symposium on Applied Computing (2009)

  21. Choi, E.: Performance test and analysis for an adaptive load balancing mechanism on distributed server cluster systems. Future Gener. Comput. Syst. 20, 237–247 (2004)

    Article  Google Scholar 

  22. Ciardo, G., Riska, A., Smirni, E.: EQUILOAD: a load balancing policy for clustered web servers. Perform. Eval. 46(2–3), 101–124 (2001)

    Article  MATH  Google Scholar 

  23. Cisco Systems, I.: Scalable Content Switching. A discussion of the cisco css 11500 series content services switch architecture. White Paper (2002)

  24. Cisco systems, inc. http://www.cisco.com/ (2010). Accessed 26 Nov 2010

  25. Cohen, A., Rangarajan, S., Slye, H.: On the performance of TCP splicing for URL-aware redirection. In: Proc. of the 2nd Conference on USENIX Symposium on Internet Technologies and Systems (1999)

  26. Colajanni, M., Yu, P.S.: A performance study of robust load sharing strategies for distributed heterogeneous web server systems. IEEE Trans. Knowl. Data Eng. 14(2), 398–414 (2002)

    Article  Google Scholar 

  27. Colby, S., Krawezyk, J.J., Nair, R.K., Royee, K., Siegel, K.P., Stevens, R.C., Wasson, S.: Method and System for Directing a Flow Between a Client and a Server. United States Patent 6,006,264 (2001). Arrowpoint Communications, Inc

  28. Dahlin, M.: Interpreting stale load information. IEEE Trans. Parallel Distrib. Syst. 11(10), 1033–1047 (2000)

    Article  Google Scholar 

  29. Damani, O.P., Chung, E., Huang, Y., Kintala, C., Wang, Y.M.: ONE-IP: techniques for hosting a service on a cluster of machines. Comput. Netw. ISDN Syst. 29, 1019–1027 (1997)

    Article  Google Scholar 

  30. (DARPA), D.A.R.P.A.: Transmission Control Protocol. RFC 793 (1981)

  31. F5 Networks, Inc.: http://www.f5.com/ (2010)

  32. Faour, A., Mansour, N.: Weblins: A scalable www cluster-based server. Adv. Eng. Softw. 37, 11–19 (2006)

    Article  Google Scholar 

  33. Fielding, R.T., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P.J., Berners-Lee, T.: Hypertext transfer protocol—HTTP/1.1. RFC 2616 (1999)

  34. Gan, X., Schroeder, T., Goddard, S., Ramamurthy, B.: Highly available and scalable cluster-based web servers. In: Proc. of the 8th IEEE International Conference on Computer Communications and Networks (1999)

  35. Gilly, K., Alcaraz, S., Juiz, C., Puigjaner, R.: Analysis of burstiness monitoring and detection in an adaptive web system. Comput. Networks 53, 668–679 (2009)

    Article  Google Scholar 

  36. Gilly, K., Juiz, C., Alcaraz, S., Puigjaner, R.: Adaptive admission control algorithm in a QoS-aware web system. In: Proc. of IEEE International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS) (2009)

  37. Goddard, S., Schroeder, T.: The SASHA architecture for network-clustered web servers. In: Proc. of the 6th IEEE International Symposium on High Assurance Systems Engineering (2001)

  38. Hunt, G., Nahum, E., Tracey, J.: Enabling Content-based Load Distribution for Scalable Services. Tech. rep., IBM T.J. Watson Research Center (1997)

  39. IBM: http://www.ibm.com/ (2010)

  40. IBM: Application Switching with Nortel Networks Layer 2–7 gigabit Ethernet Switch Module for IBM Bladecenter. IBM Redbook (2006)

  41. Iyengar, A., Challenger, J., Dias, D., Dantzig, P.: High-performance web site design techniques. IEEE Int. Comp. 4, 17–26 (2000)

    Article  Google Scholar 

  42. Kachris, C., Vassiliadis, S.: Design of a web switch in a reconfigurable platform. In: Proc. of the 2006 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (2006)

  43. Kerdlapanan, D., Khunkitti, A.: Content-based load balancing with multicast and tcp-handoff. In: Proc. of International Symposium on Circuits and Systems (2003)

  44. Kobayashi, M., Murase, T.: Asymmetric tcp splicing for content-based switches. In: Proc. of IEEE International Conference on Communications (ICC) (2002)

  45. Kokku, R., Rajamony, R., Harrick Vin, L.A.: Half-pipe anchoring: an efficient technique for multiple connection handoff. In: Proc. of the 10th IEEE International Conference on Network Protocols (2002)

  46. Kopparapu, C.: Load Balancing Servers, Firewalls and Caches. Wiley (2001)

  47. Krishnamurthy, B., Wills, C., Zhang, Y.: On the use and performance of content distribution networks. In: Proc. of the 1st ACM SIGCOMM Workshop on Internet Measurement (2001)

  48. Li, C., Peng, G., Gopalan, K., cker Chiueh, T.: Performance guarantee for cluster-based internet services. In: Proc. of the 23rd International Conference on Distributed Computing Systems (2003)

  49. Lin, Y.D., Tsai, P.T., Lin, P.C., Tien, C.M.: Direct web switch routing with state migration, TCP masquerade, and cookie name rewriting. In: Proc. of Global Telecommunications Conference (2003)

  50. Liu, H.H., Chiang, M.L.: Tcp rebuilding for content-aware request dispatching in web clusters. Journal of Internet Technology 6, 231–240 (2005)

    Google Scholar 

  51. Liu, H.H., Chiang, M.L., Wu, M.C.: Efficient support for content-aware request distribution and persistent connection in Web clusters. Softw. Pract. Exp. 37, 1215–1241 (2007)

    Article  Google Scholar 

  52. Luo, M.Y., Yang, C.S.: System support for scalable, reliable and highly manageable web hosting service. In: Proc. of the 3rd conference on USENIX Symposium on Internet Technologies and Systems (2001)

  53. Luo, M.Y., Yang, C.S., Tseng, C.W.: Analysis and improvement of content-aware routing mechanisms. IEICE Trans. Commun. E88, 227–238 (2005)

    Article  Google Scholar 

  54. Maltz, D.A., Bhagwat, P.: TCP Splicing for Application Layer Proxy Performance. Tech. Rep., IBM (1998)

  55. Marwah, M., Mishra, S., Fetzer, C.: Fault-tolerant and scalable TCP splice and web server architecture. In: Proc. of the 25th IEEE Symposium on Reliable Distributed Systems (2006)

  56. Mogul, J.C.: The case for persistent-connection HTTP. In: Proc. of SIGCOMM (1995)

  57. Networks, F.: http://www.foundrynet.com (2010)

  58. Networks, N.: http://www.nortel.com/ (2010)

  59. Ok, M., Park, M.S.: Distributing requests by (around k)-bounded load-balancing in web server cluster with high scalability. IEICE Trans. Inf. Sys. E89-D, 663–672 (2006)

    Article  Google Scholar 

  60. Pai, V.S., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., Nahum, E.M.: Locality-aware request distribution in cluster-based network servers. In: Proc. of Architectural Support for Programming Languages and Operating Systems (ASPLOS) (1998)

  61. Papathanasiou, A.E., Hensbergen, E.V.: KNITS: switch-based connection hand-off. In: Proc. of IEEE INFOCOM (2002)

  62. Park, S.Y., Park, D., Lee, J., Cho, J.W.: Efficient inter-backend prefetch algorithms in cluster-based web servers. In: Proc. of International Conference/Exhibition on High Performance Computing (2001)

  63. Radware: http://www.radware.com (2010)

  64. Resonate: Resonate central dispatch technology advantage: TCP connection HOP. White Paper (2001)

  65. Resonate, inc. http://www.resonate.com/ (2010)

  66. Riska, A., Sun, W., Smirni, E., Ciardo, G.: ADAPTLOAD: effective balancing in clustered web servers under transient load conditions. In: Proc. of the 22nd International Conference on Distributed Computing Systems (2002)

  67. Rosu, M.C., Rosu, D.: An evaluation of TCP splice benefits in web proxy servers. In: Proc. of WWW (2002)

  68. Satake, S., Inai, H.: Special issue on internet architecture technology papers: a nonprobabilistic server selection method based on periodically obtained load information for web server clusters. Electron. Commun. Jpn. 89, 1–12 (2006)

    Article  Google Scholar 

  69. Schroeder, T., Goddard, S., Ramamurthy, B.: Scalable web server ciustering technologies. IEEE Netw. May, 38–46 (2000)

    Article  Google Scholar 

  70. Server, L.V.: http://www.linuxvirtualserver.org/ (2006). Accessed 26 Nov 2010

  71. Shan, Z., Lin, C., Marinescu, D.C., Yang., Y.: Modeling and performance analysis of QoS-aware load balancing of web-server clusters. Comput. Networks 40, 235–256 (2002)

    Article  Google Scholar 

  72. Sharifian, S., Motamedi, S.A., Akbarib, M.K.: A content-based load balancing algorithm with admission control for cluster web servers. Future Gener. Comput. Syst. 24, 775–787 (2008)

    Article  Google Scholar 

  73. Sit, Y.F., Wang, C.L., Lau, F.: Socket cloning for cluster-based web servers. In: Proc. of IEEE International Conference on Cluster Computing (2002)

  74. Sit, Y.F., Wang, C.L., Lau, F.: Cyclone: a high-performance cluster-based web server with socket cloning. Cluster Comput. 7, 21–37 (2004)

    Article  Google Scholar 

  75. Snoeren, A.C., Andersen, D.G., Balakrishnan, H.: Fine-grained failover using connection migration. In: Proc. of 3rd USENIX Symp. on Internet Technologies and Systems (2001)

  76. Switching, L.L.: http://www.linuxvirtualserver.org/software/ktcpvs/ktcpvs.html (2010)

  77. Syme, M., Goldie, P.: Optimizing Network Performance with Content Switching. Server, Firewall and Cache Load Balancing. Prentice Hall (2004)

  78. Takahashi, M., Kohiga, A., Sugawara, T., Tanaka, A.: Tcp-migration with application-layer dispatching: a new http request distribution architecture in locally distributed web server systems. In: Proc. of the 1st International Conference on Communication System Software and Middleware (2006)

  79. Tang, W., Cherkasova, L., Russell, L., Mutka, M.W.: Modular tcp handoff design in streams-based tcp/ip implementation. In: Proc. of the 1st International Conference on Networking-Part 2 (2001)

  80. Tang, X., Chanson, S.T.: On caching effectiveness of web clusters under persistent connections. J. Parallel Distrib. Comput. 63, 981–995 (2003)

    Article  MATH  Google Scholar 

  81. TCPHA project. http://dragon.linux-vs.org/~dragonfly/htm/tcpha.htm (2004). Accessed 26 Nov 2010

  82. Teo, Y.M., Ayani, R.: Comparison of load balancing strategies on cluster-based web servers. Trans. of the Soc. for Model. and Sim. 77, 185–195 (2001)

    Google Scholar 

  83. Yang, C.S., Luo, M.Y.: Efficient support for content-based routing in web server clusters. In: Proc. of the 2nd Conference on USENIX Symposium on Internet Technologies and Systems, vol. 2 (1999)

  84. Yang, J., Jin, D., Li, Y., Hielscher, K.S., German, R.: Modeling and simulation of performance analysis for a cluster-based web server. Simulation Modelling Practice and Theory 14, 188–200 (2006)

    Article  Google Scholar 

  85. Yao, J., Ding, J.J., Bhuyan, L.N.: Intelligent message scheduling in application oriented networking systems. In: Proc. of IEEE International Conference on Communications (ICC) (2008)

  86. Zeng-Kai, D., Jiu-Bin, J.: A completely distributed architecture for cluster-based web servers. In: Proc. of the 4th International Conference on Parallel and Distributed Computing, Applications and Technologies (2003)

  87. Zhang, Q., Mi, N., Riska, A., Smirni, E.: Load unbalancing to improve performance under autocorrelated traffic. In: Proc. of the 26th IEEE International Conference on Distributed Computing Systems (2006)

  88. Zhang, Q., Riska, A., Sun, W., Smirni, E., Ciardo, G.: Workload-aware load balancing for clustered web servers. IEEE Trans. Parallel Distrib. Syst. 3, 219–233 (2005)

    Article  Google Scholar 

  89. Zhang, R., Abdelzaher, T.F., Stankovic, J.A.: Efficient TCP connection failover in web server clusters. In: Proc. of IEEE INFOCOM (2004)

  90. Zhang, W.: Linux virtual server for scalable network services. In: Proc. of OTTAWA Linux Symposium (2000)

  91. Zhang, X., Barrientos, M., Chen, J.B., Seltzer, M.: HACC: an architecture for cluster-based web servers. In: Proc. of the 3rd USENIX Windows NT Symposium (1999)

  92. Zhao, L., Luo, Y., Bhuyan, L., Iyer, R.: Design and implementation of a content-aware switch using a network processor. In: Proc. of the 13th Symposium on High Performance Interconnects (2005)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katja Gilly.

Additional information

This work was partially funded by the Spanish Ministry of Education and Science under Grant TIN2006-02265.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gilly, K., Juiz, C. & Puigjaner, R. An up-to-date survey in web load balancing. World Wide Web 14, 105–131 (2011). https://doi.org/10.1007/s11280-010-0101-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11280-010-0101-5

Keywords

Navigation