Abstract
This survey is an up-to-date state-of-the-art in Web load balancing mechanisms that includes all the possible classifications and focuses on the advantages of using load balancing solutions to increase the performance of the Web system. A general description of the Web load balancing solutions is included and organised by differentiating the OSI protocol stack layer the load balancing is based on. The most important request distributing polices that are proposed in the literature are also included. This article summarises all previous surveys on the Web load balancing subject and updates the state-of-the-art with the most recent load balancing proposals.
Similar content being viewed by others
References
Andreolini, M., Colajanni, M., Nuccio, M.: Kernel-based web switches providing content-aware routing. In: Proc. of the 2nd IEEE International Symposium on Network Computing and Applications (NCA’03) (2003)
Apostolopoulos, G., Aubespin, D., Peris, V.G.J., Pradhan, P., Saha, D.: Design, implementation and performance of a content-based switch. In: Proc. of INFOCOM (2000)
Aron, M., Druschel, P., Zwaenepoel, W.: Efficient support for P-HTTP in cluster-based web servers. In: Proc. of the Annual Conference on USENIX Annual Technical Conference (1999)
Aron, M., Druschel, P., Zwaenepoel, W.: Cluster reserves: a mechanism for resource management in cluster-based network servers. In: Proc. of ACM SIGMETRICS (2000)
Aron, M., Sanders, D., Druschel, P., Zwaenepoel, W.: Scalable content-aware request distribution in cluster-based network servers. In: Proc. of the USENIX 2000 Annual Technical Conference (2000)
Barroso, L., Dean, J., Hoelzle, U.: Web search for a planet: the google cluster architecture. IEEE Micro 23, 22–28 (2003)
Bent, L., Rabinovich, M., Voelker, G.M., Xiao, Z.: Characterization of a large web site population with implications for content delivery. In: Proc. of the 13th International Conference on World Wide Web (2004)
Borzemski, L., Zatwarnicki, K.: A fuzzy adaptive request distribution algorithm for cluster-based web systems. In: Proc. of the 11th Euromicro Conference on Parallel, Distributed and Network-Based Processing (Euro PDP) (2003)
Brendel, J.: Client-side resource-based load-balancing with delayed-resource-binding using TCP state migration to WWW server farm. United States Patent 6,182,139. Resonate Inc (2001)
Brisco, T.P.: DNS support for Load Balancing. RFC 1794 (1995)
Cardellini, V., Casalicchio, E., Colajanni, M., Yu, P.S.: The state of the art in locally distributed web-server systems. ACM Comput. Surv. 34, 263–311 (2002). doi:10.1145/508352.508355
Cardellini, V., Colajanni, M., Yu, P.S.: Dynamic load balancing on web-server systems. IEEE Int. Comp. 3(3), 28–39 (1999)
Carrera, E., Bianchini, R.: Efficiency vs. portability in cluster-based network servers (2001)
Casalicchio, E., Colajanni, M.: A client-aware dispatching algorithm for web clusters providing multiple services. In: Proc. of the 10th International Conference on World Wide Web (2001)
Cavale, M.R.: Introducing Microsoft Cluster Service (MSCS) in the Windows Server 2003 Family. Microsoft Corporation (2002)
Chang, Y.K., Cheng, W.H., Young, C.P.: Fully pre-splicing TCP for web switches. In: Proc. of the 1st International Conference on Innovative Computing, Information and Control (ICICIC) (2006)
Cherkasova, L., DeSouza, M., Ponnekanti, S.: Performance analysis of “content-aware” load balancing strategy FLEX: two case studies. In: Proc. of the 34th Hawaii International Conference on System Sciences (2001)
Cherkasova, L., Karlsson, M.: Scalable web server cluster design with workload-aware request distribution strategy WARD. In: Proc. of the Third International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems (WECWIS) (2001)
Chiang, M.L., Lin, Y.C., Guo, L.F.: Design and implementation of an efficient web cluster with content-based request distribution and file caching. J. Syst. Softw. 81, 2044–2058 (2008)
Chiang, M.L., Wu, C.H., Liao, Y.J., Chen, Y.F.: New content-aware request distribution policies in web clusters providing multiple services. In: Proc. of the ACM Symposium on Applied Computing (2009)
Choi, E.: Performance test and analysis for an adaptive load balancing mechanism on distributed server cluster systems. Future Gener. Comput. Syst. 20, 237–247 (2004)
Ciardo, G., Riska, A., Smirni, E.: EQUILOAD: a load balancing policy for clustered web servers. Perform. Eval. 46(2–3), 101–124 (2001)
Cisco Systems, I.: Scalable Content Switching. A discussion of the cisco css 11500 series content services switch architecture. White Paper (2002)
Cisco systems, inc. http://www.cisco.com/ (2010). Accessed 26 Nov 2010
Cohen, A., Rangarajan, S., Slye, H.: On the performance of TCP splicing for URL-aware redirection. In: Proc. of the 2nd Conference on USENIX Symposium on Internet Technologies and Systems (1999)
Colajanni, M., Yu, P.S.: A performance study of robust load sharing strategies for distributed heterogeneous web server systems. IEEE Trans. Knowl. Data Eng. 14(2), 398–414 (2002)
Colby, S., Krawezyk, J.J., Nair, R.K., Royee, K., Siegel, K.P., Stevens, R.C., Wasson, S.: Method and System for Directing a Flow Between a Client and a Server. United States Patent 6,006,264 (2001). Arrowpoint Communications, Inc
Dahlin, M.: Interpreting stale load information. IEEE Trans. Parallel Distrib. Syst. 11(10), 1033–1047 (2000)
Damani, O.P., Chung, E., Huang, Y., Kintala, C., Wang, Y.M.: ONE-IP: techniques for hosting a service on a cluster of machines. Comput. Netw. ISDN Syst. 29, 1019–1027 (1997)
(DARPA), D.A.R.P.A.: Transmission Control Protocol. RFC 793 (1981)
F5 Networks, Inc.: http://www.f5.com/ (2010)
Faour, A., Mansour, N.: Weblins: A scalable www cluster-based server. Adv. Eng. Softw. 37, 11–19 (2006)
Fielding, R.T., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P.J., Berners-Lee, T.: Hypertext transfer protocol—HTTP/1.1. RFC 2616 (1999)
Gan, X., Schroeder, T., Goddard, S., Ramamurthy, B.: Highly available and scalable cluster-based web servers. In: Proc. of the 8th IEEE International Conference on Computer Communications and Networks (1999)
Gilly, K., Alcaraz, S., Juiz, C., Puigjaner, R.: Analysis of burstiness monitoring and detection in an adaptive web system. Comput. Networks 53, 668–679 (2009)
Gilly, K., Juiz, C., Alcaraz, S., Puigjaner, R.: Adaptive admission control algorithm in a QoS-aware web system. In: Proc. of IEEE International Symposium on Modelling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS) (2009)
Goddard, S., Schroeder, T.: The SASHA architecture for network-clustered web servers. In: Proc. of the 6th IEEE International Symposium on High Assurance Systems Engineering (2001)
Hunt, G., Nahum, E., Tracey, J.: Enabling Content-based Load Distribution for Scalable Services. Tech. rep., IBM T.J. Watson Research Center (1997)
IBM: http://www.ibm.com/ (2010)
IBM: Application Switching with Nortel Networks Layer 2–7 gigabit Ethernet Switch Module for IBM Bladecenter. IBM Redbook (2006)
Iyengar, A., Challenger, J., Dias, D., Dantzig, P.: High-performance web site design techniques. IEEE Int. Comp. 4, 17–26 (2000)
Kachris, C., Vassiliadis, S.: Design of a web switch in a reconfigurable platform. In: Proc. of the 2006 ACM/IEEE Symposium on Architectures for Networking and Communications Systems (2006)
Kerdlapanan, D., Khunkitti, A.: Content-based load balancing with multicast and tcp-handoff. In: Proc. of International Symposium on Circuits and Systems (2003)
Kobayashi, M., Murase, T.: Asymmetric tcp splicing for content-based switches. In: Proc. of IEEE International Conference on Communications (ICC) (2002)
Kokku, R., Rajamony, R., Harrick Vin, L.A.: Half-pipe anchoring: an efficient technique for multiple connection handoff. In: Proc. of the 10th IEEE International Conference on Network Protocols (2002)
Kopparapu, C.: Load Balancing Servers, Firewalls and Caches. Wiley (2001)
Krishnamurthy, B., Wills, C., Zhang, Y.: On the use and performance of content distribution networks. In: Proc. of the 1st ACM SIGCOMM Workshop on Internet Measurement (2001)
Li, C., Peng, G., Gopalan, K., cker Chiueh, T.: Performance guarantee for cluster-based internet services. In: Proc. of the 23rd International Conference on Distributed Computing Systems (2003)
Lin, Y.D., Tsai, P.T., Lin, P.C., Tien, C.M.: Direct web switch routing with state migration, TCP masquerade, and cookie name rewriting. In: Proc. of Global Telecommunications Conference (2003)
Liu, H.H., Chiang, M.L.: Tcp rebuilding for content-aware request dispatching in web clusters. Journal of Internet Technology 6, 231–240 (2005)
Liu, H.H., Chiang, M.L., Wu, M.C.: Efficient support for content-aware request distribution and persistent connection in Web clusters. Softw. Pract. Exp. 37, 1215–1241 (2007)
Luo, M.Y., Yang, C.S.: System support for scalable, reliable and highly manageable web hosting service. In: Proc. of the 3rd conference on USENIX Symposium on Internet Technologies and Systems (2001)
Luo, M.Y., Yang, C.S., Tseng, C.W.: Analysis and improvement of content-aware routing mechanisms. IEICE Trans. Commun. E88, 227–238 (2005)
Maltz, D.A., Bhagwat, P.: TCP Splicing for Application Layer Proxy Performance. Tech. Rep., IBM (1998)
Marwah, M., Mishra, S., Fetzer, C.: Fault-tolerant and scalable TCP splice and web server architecture. In: Proc. of the 25th IEEE Symposium on Reliable Distributed Systems (2006)
Mogul, J.C.: The case for persistent-connection HTTP. In: Proc. of SIGCOMM (1995)
Networks, F.: http://www.foundrynet.com (2010)
Networks, N.: http://www.nortel.com/ (2010)
Ok, M., Park, M.S.: Distributing requests by (around k)-bounded load-balancing in web server cluster with high scalability. IEICE Trans. Inf. Sys. E89-D, 663–672 (2006)
Pai, V.S., Aron, M., Banga, G., Svendsen, M., Druschel, P., Zwaenepoel, W., Nahum, E.M.: Locality-aware request distribution in cluster-based network servers. In: Proc. of Architectural Support for Programming Languages and Operating Systems (ASPLOS) (1998)
Papathanasiou, A.E., Hensbergen, E.V.: KNITS: switch-based connection hand-off. In: Proc. of IEEE INFOCOM (2002)
Park, S.Y., Park, D., Lee, J., Cho, J.W.: Efficient inter-backend prefetch algorithms in cluster-based web servers. In: Proc. of International Conference/Exhibition on High Performance Computing (2001)
Radware: http://www.radware.com (2010)
Resonate: Resonate central dispatch technology advantage: TCP connection HOP. White Paper (2001)
Resonate, inc. http://www.resonate.com/ (2010)
Riska, A., Sun, W., Smirni, E., Ciardo, G.: ADAPTLOAD: effective balancing in clustered web servers under transient load conditions. In: Proc. of the 22nd International Conference on Distributed Computing Systems (2002)
Rosu, M.C., Rosu, D.: An evaluation of TCP splice benefits in web proxy servers. In: Proc. of WWW (2002)
Satake, S., Inai, H.: Special issue on internet architecture technology papers: a nonprobabilistic server selection method based on periodically obtained load information for web server clusters. Electron. Commun. Jpn. 89, 1–12 (2006)
Schroeder, T., Goddard, S., Ramamurthy, B.: Scalable web server ciustering technologies. IEEE Netw. May, 38–46 (2000)
Server, L.V.: http://www.linuxvirtualserver.org/ (2006). Accessed 26 Nov 2010
Shan, Z., Lin, C., Marinescu, D.C., Yang., Y.: Modeling and performance analysis of QoS-aware load balancing of web-server clusters. Comput. Networks 40, 235–256 (2002)
Sharifian, S., Motamedi, S.A., Akbarib, M.K.: A content-based load balancing algorithm with admission control for cluster web servers. Future Gener. Comput. Syst. 24, 775–787 (2008)
Sit, Y.F., Wang, C.L., Lau, F.: Socket cloning for cluster-based web servers. In: Proc. of IEEE International Conference on Cluster Computing (2002)
Sit, Y.F., Wang, C.L., Lau, F.: Cyclone: a high-performance cluster-based web server with socket cloning. Cluster Comput. 7, 21–37 (2004)
Snoeren, A.C., Andersen, D.G., Balakrishnan, H.: Fine-grained failover using connection migration. In: Proc. of 3rd USENIX Symp. on Internet Technologies and Systems (2001)
Switching, L.L.: http://www.linuxvirtualserver.org/software/ktcpvs/ktcpvs.html (2010)
Syme, M., Goldie, P.: Optimizing Network Performance with Content Switching. Server, Firewall and Cache Load Balancing. Prentice Hall (2004)
Takahashi, M., Kohiga, A., Sugawara, T., Tanaka, A.: Tcp-migration with application-layer dispatching: a new http request distribution architecture in locally distributed web server systems. In: Proc. of the 1st International Conference on Communication System Software and Middleware (2006)
Tang, W., Cherkasova, L., Russell, L., Mutka, M.W.: Modular tcp handoff design in streams-based tcp/ip implementation. In: Proc. of the 1st International Conference on Networking-Part 2 (2001)
Tang, X., Chanson, S.T.: On caching effectiveness of web clusters under persistent connections. J. Parallel Distrib. Comput. 63, 981–995 (2003)
TCPHA project. http://dragon.linux-vs.org/~dragonfly/htm/tcpha.htm (2004). Accessed 26 Nov 2010
Teo, Y.M., Ayani, R.: Comparison of load balancing strategies on cluster-based web servers. Trans. of the Soc. for Model. and Sim. 77, 185–195 (2001)
Yang, C.S., Luo, M.Y.: Efficient support for content-based routing in web server clusters. In: Proc. of the 2nd Conference on USENIX Symposium on Internet Technologies and Systems, vol. 2 (1999)
Yang, J., Jin, D., Li, Y., Hielscher, K.S., German, R.: Modeling and simulation of performance analysis for a cluster-based web server. Simulation Modelling Practice and Theory 14, 188–200 (2006)
Yao, J., Ding, J.J., Bhuyan, L.N.: Intelligent message scheduling in application oriented networking systems. In: Proc. of IEEE International Conference on Communications (ICC) (2008)
Zeng-Kai, D., Jiu-Bin, J.: A completely distributed architecture for cluster-based web servers. In: Proc. of the 4th International Conference on Parallel and Distributed Computing, Applications and Technologies (2003)
Zhang, Q., Mi, N., Riska, A., Smirni, E.: Load unbalancing to improve performance under autocorrelated traffic. In: Proc. of the 26th IEEE International Conference on Distributed Computing Systems (2006)
Zhang, Q., Riska, A., Sun, W., Smirni, E., Ciardo, G.: Workload-aware load balancing for clustered web servers. IEEE Trans. Parallel Distrib. Syst. 3, 219–233 (2005)
Zhang, R., Abdelzaher, T.F., Stankovic, J.A.: Efficient TCP connection failover in web server clusters. In: Proc. of IEEE INFOCOM (2004)
Zhang, W.: Linux virtual server for scalable network services. In: Proc. of OTTAWA Linux Symposium (2000)
Zhang, X., Barrientos, M., Chen, J.B., Seltzer, M.: HACC: an architecture for cluster-based web servers. In: Proc. of the 3rd USENIX Windows NT Symposium (1999)
Zhao, L., Luo, Y., Bhuyan, L., Iyer, R.: Design and implementation of a content-aware switch using a network processor. In: Proc. of the 13th Symposium on High Performance Interconnects (2005)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was partially funded by the Spanish Ministry of Education and Science under Grant TIN2006-02265.
Rights and permissions
About this article
Cite this article
Gilly, K., Juiz, C. & Puigjaner, R. An up-to-date survey in web load balancing. World Wide Web 14, 105–131 (2011). https://doi.org/10.1007/s11280-010-0101-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11280-010-0101-5