A Quantitative Justification to Partial Replication of Web Contents

Traditionally the alternatives for Web content storage have been full replication and full distribution. More recently partial replication has been proposed as an hybrid strategy. This paper shows a quantitative justification to advantages achieved by using this approach in terms of storage capacity usage and reliability. Our analytical study proves that partial replication offers much higher storage capacity than full replication and that its reliability is much higher than full distribution reaching to levels equivalent to those provided by full replication.

[1]  Scott M. Baker,et al.  Distributed Cooperative Web Servers , 1999, Comput. Networks.

[2]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[3]  Willy Zwaenepoel,et al.  Flash: An efficient and portable Web server , 1999, USENIX Annual Technical Conference, General Track.

[4]  Li Li,et al.  Evaluating the performance of user-space and kernel-space web servers , 2004, CASCON.

[5]  Daniel A. Reed,et al.  NCSA's World Wide Web Server: Design and Performance , 1995, Computer.

[6]  Willy Zwaenepoel,et al.  Efficient Support for P-HTTP in Cluster-Based Web Servers , 1999, USENIX Annual Technical Conference, General Track.

[7]  Cho-Li Wang,et al.  Document replication and distribution in extensible geographically distributed web servers , 2003, J. Parallel Distributed Comput..

[8]  Peter Druschel,et al.  Better operating system features for faster network servers , 1998, PERV.

[9]  Peter Druschel,et al.  Resource containers: a new facility for resource management in server systems , 1999, OSDI '99.

[10]  Azer Bestavros,et al.  Load balancing a cluster of web servers: using distributed packet rewriting , 2000, Conference Proceedings of the 2000 IEEE International Performance, Computing, and Communications Conference (Cat. No.00CH37086).

[11]  Daniel M. Dias,et al.  A scalable and highly available web server , 1996, COMPCON '96. Technologies for the Information Superhighway Digest of Papers.

[12]  Quanzhong Li,et al.  Distributed cooperative Apache web server , 2001, WWW '01.

[13]  Willy Zwaenepoel,et al.  IO-Lite: a unified I/O buffering and caching system , 1999, TOCS.

[14]  Kenneth J. Christensen,et al.  A single system image server cluster using duplicated MAC and IP addresses , 2001, Proceedings LCN 2001. 26th Annual IEEE Conference on Local Computer Networks.

[15]  S.S.H. Tse Approximate Algorithms for Document Placement in Distributed Web Servers , 2005, IEEE Trans. Parallel Distributed Syst..

[16]  Jim Gray,et al.  Scalability Terminology: Farms, Clones, Partitions, Packs, RACS and RAPS , 1999, ArXiv.

[17]  Byrav Ramamurthy,et al.  Scalable Web server clustering technologies , 2000, IEEE Netw..

[18]  Qing Yang,et al.  Measurement, analysis and performance improvement of the Apache Web server , 1999, 1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305).