Analysis of web caching architectures: hierarchical and distributed caching

Cache cooperation improves the performance of isolated caches, especially for caches with small cache populations. To make caches cooperate on a large scale and effectively increase the cache population, several caches are usually federated in caching architectures. In this paper, we discuss and compare the performance of different caching architectures. In particular, we consider hierarchical and distributed caching. We derive analytical models to study important performance parameters of hierarchical and distributed caching, i.e., client's perceived latency, bandwidth usage, load in the caches, and disk space usage. Additionally, we consider a hybrid caching architecture that combines hierarchical caching with distributed caching at every level of a caching hierarchy. We evaluate the performance of a hybrid scheme and determine the optimal number of caches that should cooperate at each caching level to minimize client's retrieval latency.

[1]  Alex Rousskov,et al.  On performance of caching proxies (extended abstract) , 1998, SIGMETRICS '98/PERFORMANCE '98.

[2]  Eric A. Brewer,et al.  System Design Issues for Internet Middleware Services: Deductions from a Large Client Trace , 1997, USENIX Symposium on Internet Technologies and Systems.

[3]  Syam Gadde,et al.  Reduce, reuse, recycle: an approach to building large Internet caches , 1997, Proceedings. The Sixth Workshop on Hot Topics in Operating Systems (Cat. No.97TB100133).

[4]  Steffen Rothkugel,et al.  World Wide Web caching: the application-level view of the Internet , 1997, IEEE Commun. Mag..

[5]  Paul Francis,et al.  Yoid: Extending the Internet Multicast Architec-ture , 2000 .

[6]  Michael Dahlin,et al.  Design considerations for distributed caching on the Internet , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[7]  Li Fan,et al.  Summary cache: a scalable wide-area web cache sharing protocol , 2000, TNET.

[8]  Keith Ross,et al.  Cache Array Routing Protocol v1.1 , 1998 .

[9]  Pablo Rodriguez,et al.  Improving the WWW: Caching or Multicast? , 1998, Comput. Networks.

[10]  Anirban Mahanti,et al.  Traffic analysis of a Web proxy caching hierarchy , 2000 .

[11]  Michael J. Feeley,et al.  The Measured Access Characteristics of World-Wide-Web Client Proxy Caches , 1997, USENIX Symposium on Internet Technologies and Systems.

[12]  Alex Rousskov On Performance of Caching Proxies , 1998, SIGMETRICS 1998.

[13]  ShenkerScott,et al.  Scaling of multicast trees , 1999 .

[14]  Patrick R. McManus,et al.  A Passive System for Server Selection within Mirrored Resource Environments Using AS Path Length Heuristics , 2000 .

[15]  John Harrison,et al.  A Distributed Internet Cache , 1997 .

[16]  Syam Gadde,et al.  Not all Hits are Created Equal: Cooperative Proxy Caching Over a Wide-Area Network , 1998, Comput. Networks.

[17]  Peter Scheuermann,et al.  Selection algorithms for replicated Web servers , 1998, PERV.

[18]  Hongsuda Tangmunarunkit,et al.  Scaling of multicast trees: comments on the Chuang-Sirbu scaling law , 1999, SIGCOMM '99.

[19]  Duane Wessels,et al.  Cache Digests , 1998, Comput. Networks.

[20]  David R. Karger,et al.  Web Caching with Consistent Hashing , 1999, Comput. Networks.

[21]  Ernst W. Biersack,et al.  Performance modelling of reliable multicast transmission , 1997, Proceedings of INFOCOM '97.

[22]  Duane Wessels,et al.  Application of Internet Cache Protocol (ICP), version 2 , 1997, RFC.

[23]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[24]  Ping Pan,et al.  Internet Engineering Task Force , 1995 .

[25]  Duane Wessels,et al.  Hyper Text Caching Protocol (HTCP/0.0) , 2000, RFC.

[26]  CachingLee,et al.  On the Implications of Zipf ' s Law for Web , 1998 .

[27]  Andrei Z. Broder,et al.  A Technique for Measuring the Relative Size and Overlap of Public Web Search Engines , 1998, Comput. Networks.

[28]  Yuen Ren Chao,et al.  Human Behavior and the Principle of Least Effort: An Introduction to Human Ecology , 1950 .

[29]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[30]  Neil G. Smith The UK National Web Cache - The State of the Art , 1996, Comput. Networks.

[31]  Guillaume Pierre,et al.  Replicated directory service for weakly consistent distributed caches , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[32]  Peter B. Danzig,et al.  A Hierarchical Internet Object Cache , 1996, USENIX ATC.

[33]  Kimberly C. Claffy,et al.  Web Traffic Characterization: An Assesment of the Impact of Caching Documents from NCSA's Web Server , 1995, Comput. Networks ISDN Syst..

[34]  Duane Wessels Squid internet object cache , 1996 .