Scalable Web server architectures

A scalable web server architecture is key to enabling WWW sites to handle the ever increasing traffic loads. There is empirical evidence that, for the current generation of web server applications, multiprocessor platforms do not provide the needed scalability to handle large traffic volumes. This is primarily due to the fact that current TCP/IP implementations are single-threaded which severely limits the multiprocessor speedup for HTTP server applications. Therefore a distributed architecture consisting of independent servers sharing the load is more appropriate for implementing a scalable web server. Most current distributed server platforms typically use some form of Round-Robin or weighted Round-Robin implemented using the DNS server. This approach for distributing the load requires each HTTP server to handle requests for all the data stored at the site which implies that either each server must store its own copy of all the data available at the site or it must access the data from other HTTP servers or some dedicated back-end file or database servers thus generating significant back-end traffic. Neither of these options scale effectively because they lead to bottlenecks in the LAN or back-end servers or require significant additional resources. The "redirection-based" hierarchical server architecture eliminates bottlenecks in the server complex and allows for hardware to be added seamlessly to handle increases in load. In this approach, two levels of servers are used: redirection servers and normal HTTP servers. Data are partitioned according to their content and stored on different HTTP servers. Redirection servers are used to distribute the users' requests to the corresponding HTTP servers using the redirection mechanism supported by the HTTP protocol. This approach is completely transparent to the user, allows for intelligent distribution of load among servers, and achieves better caching efficiency compared with other load balancing schemes.

[1]  Michelle Butler,et al.  A Scalable HTTP Server: The NCSA Prototype , 1994, Comput. Networks ISDN Syst..

[2]  Virgílio A. F. Almeida,et al.  Characterizing reference locality in the WWW , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[3]  James E. Pitkow,et al.  Yet Robust Caching Algorithm Based on Dynamic Access Patterns , 1994, WWW Spring 1994.

[4]  Margo I. Seltzer,et al.  The case for geographical push-caching , 1995, Proceedings 5th Workshop on Hot Topics in Operating Systems (HotOS-V).