Approximate Algorithms for Document Placement in Distributed Web Servers

We study approximate algorithms for placing a set of documents into M distributed Web servers in this paper. We define the load of a server to be the summation of loads induced by all documents stored. The size of a server is defined in a similar manner. We propose five algorithms. Algorithm 1 balances the loads and sizes of the servers by limiting the loads to k/sub l/ and the sizes to k/sub s/ times their optimal values, where 1/k/sub l/-1 + 1/k/sub n/-1. This result improves the bounds on load and size of servers in (L.C. Chen et al., 2001). Algorithm 2 further reduces the load bound on each server by using partial document replication, and algorithm 3 by sorting. Algorithm 4 employs both partial replication and sorting. Last, without using sorting and replication, we give algorithm 5 for the dynamic placement at the cost of a factor Q(log M) in the time-complexity.

[1]  Cho-Li Wang,et al.  Load balancing in distributed Web server systems with partial document replication , 2002, Proceedings International Conference on Parallel Processing.

[2]  Giancarlo Martella,et al.  Optimal File Allocation in a Computer Network: a Solution Method Based on the Knapsack Problem , 1982, Comput. Networks.

[3]  Donald E. Knuth,et al.  The art of computer programming, volume 3: (2nd ed.) sorting and searching , 1998 .

[4]  Erich M. Nahum,et al.  Locality-aware request distribution in cluster-based network servers , 1998, ASPLOS VIII.

[5]  Thomas P. Brisco DNS Support for Load Balancing , 1995, RFC.

[6]  Hyeong-Ah Choi,et al.  Approximation algorithms for data distribution with load balancing of web servers , 2001, Proceedings 42nd IEEE Symposium on Foundations of Computer Science.

[7]  Sampath Rangarajan,et al.  Data distribution algorithms for load balanced fault-tolerant Web access , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[8]  Philip S. Yu,et al.  Dynamic Load Balancing on Web-Server Systems , 1999, IEEE Internet Comput..

[9]  Hyeong-Ah Choi,et al.  File allocation for a parallel Webserver , 1996, Proceedings of 3rd International Conference on High Performance Computing (HiPC).

[10]  Dorit S. Hochbaum,et al.  Database Location in Computer Networks , 1980, JACM.

[11]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[12]  Oscar H. Ibarra,et al.  SWEB: towards a scalable World Wide Web server on multicomputers , 1996, Proceedings of International Conference on Parallel Processing.

[13]  Philip S. Yu,et al.  DNS dispatching algorithms with state estimators for scalable Web‐server clusters , 1999, World Wide Web.

[14]  Michelle Butler,et al.  A Scalable HTTP Server: The NCSA Prototype , 1994, Comput. Networks ISDN Syst..

[15]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[16]  Philip S. Yu,et al.  Dynamic load balancing in geographically distributed heterogeneous Web servers , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).

[17]  Philip S. Yu,et al.  Analysis of Task Assignment Policies in Scalable Distributed Web-Server Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[18]  Lawrence W. Dowdy,et al.  Comparative Models of the File Assignment Problem , 1982, CSUR.