Replicated servers allocation for multiple information sources in a distributed environment

Recently, we have witnessed a phenomenal growth in the Internet/intranet coupled with rapid deployment of new services. Information dissemination over the network has become one of the most important activities in our daily life. The existing systems, however, often suffer from notorious long delay experienced by clients, especially during the peak hour. This is caused by the combination of a variety of factors including inadequate link bandwidth, server overload, and network congestion. Server replication has been shown to be one of the most effective mechanisms to cope with this problem. The basic idea is to replicate the information across a network so that clients' requests can be spread out. The major issue is in which locations inside the network these replications should take place, i.e., where to place the replicated servers. In this paper we investigate the server replication in a controlled distributed environment. The salient feature of this environment is that the decision that where to replicate information can be determined by a single authority, the intranet being the typical example. We consider the problem of placing multiple replicated servers within a network given there exists multiple target Web servers as information providers. We formulate this as an optimization problem by taking into consideration the characteristics of the network topology. We first show that this is a NP-complete problem, and then present a number of heuristics-based algorithms for server replications. Finally, in order to investigate various tradeoffs in terms of cost and algorithm complexity, we carry out comparison studies among different heuristic algorithms.

[1]  Bo Li,et al.  On the optimal placement of web proxies in the Internet , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[2]  Lalit M. Patnaik,et al.  Genetic algorithms: a survey , 1994, Computer.

[3]  Steven Glassman,et al.  A Caching Relay for the World Wide Web , 1994, Comput. Networks ISDN Syst..

[4]  Ellen W. Zegura,et al.  Self-organizing wide-area network caches , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[5]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[6]  Ari Luotonen,et al.  World-Wide Web Proxies , 1994, Comput. Networks ISDN Syst..

[7]  G.J. Minden,et al.  A survey of active network research , 1997, IEEE Communications Magazine.

[8]  Kimberly C. Claffy,et al.  Web Traffic Characterization: An Assesment of the Impact of Caching Documents from NCSA's Web Server , 1995, Comput. Networks ISDN Syst..

[9]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[10]  Bo Li,et al.  On the Optimal Placement of Web Proxies in the Internet: The Linear Topology , 1998, HPN.

[11]  Azer Bestavros,et al.  WWW traffic reduction and load balancing through server-based caching , 1997, IEEE Concurrency.

[12]  Edward A. Fox,et al.  Removal Policies in Network Caches for World-Wide Web Documents , 1996, SIGCOMM.