Scalable Web server cluster design with workload-aware request distribution strategy WARD

We consider a Web cluster in which the content-aware distribution is performed by each of the nodes in a Web cluster. Each server in the cluster may forward a request to another node based on the requested content. We propose a new workload-aware request distribution strategy WARD, that assigns a small set of most frequent files, called core, to be served locally, by any server in a cluster, while partitioning the rest of the files to be served by different cluster nodes. We propose an algorithm, called ward-analysis, to compute the nearly optimal core size. The algorithm takes into account workload access patterns and cluster parameters such as number of nodes, node RAM, TCP handoff overhead, and disk access overhead. Our simulations driven by a realistic workload show that WARD achieves super-linear speedup with increased cluster size. It shows superior performance compared with traditional round-robin strategy (up to 260% increased throughput for a cluster of 16 nodes), and outperforms a pure partitioning strategy based on a cache-affinity requests distribution (up to 50% increased throughput for a cluster of 16 nodes).

[1]  Haakon Bryhni,et al.  A comparison of load balancing techniques for scalable Web servers , 2000, IEEE Netw..

[2]  Magnus Karlsson,et al.  Dynamics and evolution of Web sites: analysis, metrics and design issues , 2001, Proceedings. Sixth IEEE Symposium on Computers and Communications.

[3]  H. Schwetman Object-oriented simulation modeling with C++/CSIM17 , 1995, Winter Simulation Conference Proceedings, 1995..

[4]  Margo Seltzer,et al.  HACC: an architecture for cluster-based web servers , 1999 .

[5]  Martin F. Arlitt,et al.  Web server workload characterization: the search for invariants , 1996, SIGMETRICS '96.

[6]  Willy Zwaenepoel,et al.  Scalable Content-aware Request Distribution in Cluster-based Network Servers , 2000, USENIX Annual Technical Conference, General Track.

[7]  Ludmila Cherkasova FLEX: load balancing and management strategy for scalable Web hosting service , 2000, Proceedings ISCC 2000. Fifth IEEE Symposium on Computers and Communications.

[8]  Herb Schwetman Object-oriented simulation modeling with C++/CSIM17 , 1995, WSC '95.

[9]  Sampath Rangarajan,et al.  On the Performance of TCP Splicing for URL-Aware Redirection , 1999, USENIX Symposium on Internet Technologies and Systems.

[10]  Byrav Ramamurthy,et al.  Scalable Web server clustering technologies , 2000, IEEE Netw..

[11]  ZwaenepoelWilly,et al.  Locality-aware request distribution in cluster-based network servers , 1998 .