Directory Structures for Scalable Internet Caches

Use of Internet caches is a cheap and eeective way to improve performance for all Internet users. Distributed caches ooer the potential to serve larger user communities and to deliver higher hit ratios on shared Web documents. The key to building eeective distributed caches is a directory structure that allows individual caching servers to locate objects cached at neighboring sites, combining them into a logically uniied collective cache. This paper uses Web traces to evaluate a range of alternatives for managing directories in distributed Internet caches. We use trace-driven executions and simulations of prototype caches to compare multicast-based queries of local maps (Harvest) with unicast queries of a global map (CRISP). We then use properties of the traces to predict performance of CRISP variants in which the global map is partitioned or replicated. Finally, we propose a novel lazy CRISP structure based on weakly consistent replication of the most valuable subset of the global map. We show that this scheme combines the best aspects of several distributed cache structures, while maintaining a near-ideal hit ratio.

[1]  Margo I. Seltzer,et al.  World Wide Web Cache Consistency , 1996, USENIX Annual Technical Conference.

[2]  Chengjie Liu,et al.  Maintaining Strong Cache Consistency in the World Wide Web , 1998, IEEE Trans. Computers.

[3]  Barron C. Housel,et al.  WebExpress: a system for optimizing Web browsing in a wireless environment , 1996, MobiCom '96.

[4]  Fred Douglis,et al.  HPP: HTML Macro-Preprocessing to Support Dynamic Document Caching , 1997, USENIX Symposium on Internet Technologies and Systems.

[5]  Syam Gadde,et al.  Reduce, reuse, recycle: an approach to building large Internet caches , 1997, Proceedings. The Sixth Workshop on Hot Topics in Operating Systems (Cat. No.97TB100133).

[6]  Keith Ross,et al.  Cache Array Routing Protocol v1.1 , 1998 .

[7]  Anna R. Karlin,et al.  Implementing global memory management in a workstation cluster , 1995, SOSP.

[8]  Ari Luotonen,et al.  World-Wide Web Proxies , 1994, Comput. Networks ISDN Syst..

[9]  Marvin Theimer,et al.  Flexible update propagation for weakly consistent replication , 1997, SOSP.

[10]  Michael Dahlin,et al.  A quantitative analysis of cache policies for scalable network file systems , 1994, SIGMETRICS.

[11]  Margo I. Seltzer,et al.  A New Hashing Package for UNIX , 1991, USENIX Winter.

[12]  Stephen E. Deering,et al.  Host extensions for IP multicasting , 1986, RFC.

[13]  Duane Wessels,et al.  Internet Cache Protocol (ICP), version 2 , 1997, RFC.

[14]  Peter B. Danzig,et al.  A Hierarchical Internet Object Cache , 1996, USENIX ATC.