QoS-based Architectures for Geographically Replicated Web Servers

In this paper, we discuss strategies for providing World Wide Web service users with adequate Quality of Service (QoS). We argue that QoS can be provided by distributing the service requests processing load among replicated Web servers (WSs), that can be geographically distributed across the Internet. In order to support our argument, we compare and contrast several load distribution strategies, and assess their effectivness when deployed within the context of a geographically replicated Web service; the principal figure of merit we use in this assessment is the response time experienced by the users of that service. As a result of this comparison, we propose a specific strategy, named QoS-based, that implements load distribution among WS replicas by binding a user to the replica that provides the shortest user response time. We examine several architectures that exploit our QoS-based strategy. Two of these architectures, named, respectively, Browser-based and Load Distribution-based, are described in detail as they are particularly appropriate for implementing our strategy.

[1]  Yi-Min Wang,et al.  ONE-IP: Techniques for Hosting a Service on a Cluster of Machines , 1997, Comput. Networks.

[2]  Piet Demeester,et al.  Performance measurements on the current internet , 1999 .

[3]  Edward D. Lazowska,et al.  Adaptive load sharing in homogeneous distributed systems , 1986, IEEE Transactions on Software Engineering.

[4]  Peter Scheuermann,et al.  Proxy Cache Algorithms: Design, Implementation, and Performance , 1999, IEEE Trans. Knowl. Data Eng..

[5]  Santosh K. Shrivastava,et al.  Constructing Dependable Web Services , 1999, IEEE Internet Comput..

[6]  Stephen Taylor,et al.  A Practical Approach to Dynamic Load Balancing , 1998, IEEE Trans. Parallel Distributed Syst..

[7]  Erich M. Nahum,et al.  Locality-aware request distribution in cluster-based network servers , 1998, ASPLOS VIII.

[8]  Steffen Rothkugel,et al.  Enhancing the Web's Infrastructure: From Caching to Replication , 1997, IEEE Internet Comput..

[9]  Enrico Gregori,et al.  Load distribution among replicated Web servers: a QoS-based approach , 2000, PERV.

[10]  Asser N. Tantawi,et al.  Optimal static load balancing in distributed computer systems , 1985, JACM.

[11]  Philip S. Yu,et al.  Analysis of Task Assignment Policies in Scalable Distributed Web-Server Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[12]  Larry L. Peterson,et al.  TCP Vegas: End to End Congestion Avoidance on a Global Internet , 1995, IEEE J. Sel. Areas Commun..

[13]  Reinhard Klemm WebCompanion: A Friendly Client-Side Web Prefetching Agent , 1999, IEEE Trans. Knowl. Data Eng..

[14]  Paul Francis,et al.  The IP Network Address Translator (NAT) , 1994, RFC.

[15]  Neil G. Smith The UK National Web Cache - The State of the Art , 1996, Comput. Networks.

[16]  Philip S. Yu,et al.  Dynamic load balancing in geographically distributed heterogeneous Web servers , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).

[17]  Jie Li,et al.  Load Balancing Problems for Multiclass Jobs in Distributed/Parallel Computer Systems , 1998, IEEE Trans. Computers.

[18]  Michelle Butler,et al.  A Scalable HTTP Server: The NCSA Prototype , 1994, Comput. Networks ISDN Syst..

[19]  Bruce S. Davie,et al.  Computer Networks: A Systems Approach , 1996 .

[20]  James E. Pitkow,et al.  Summary of WWW characterizations , 1998, World Wide Web.

[21]  Guido M. Schuster,et al.  Real-time voice over packet-switched networks , 1998, IEEE Netw..

[22]  Carey Williamson,et al.  Achieving Load Balance and Efiective Caching in Clustered Web Servers , 1999 .