Analysis and design of hierarchical Web caching systems

This paper aims at finding fundamental design principles for hierarchical Web caching. An analytical modeling technique is developed to characterize an uncooperative two-level hierarchical caching system where the least recently used (LRU) algorithm is locally run at each cache. With this modeling technique, we are able to identify a characteristic time for each cache, which plays a fundamental role in understanding the caching processes. In particular, a cache can be viewed roughly as a lowpass filter with its cutoff frequency equal to the inverse of the characteristic time. Documents with access frequencies lower than this cutoff frequency will have good chances to pass through the cache without cache hits. This viewpoint enables us to take any branch of the cache tree as a tandem of lowpass filters at different cutoff frequencies, which further results in the finding of two fundamental design principles. Finally, to demonstrate how to use the principles to guide the caching algorithm design, we propose a cooperative hierarchical Web caching architecture based on these principles. The simulation study shows that the proposed cooperative architecture results in 50% saving of the cache resource compared with the traditional uncooperative hierarchical caching architecture.

[1]  Peter B. Danzig,et al.  A Hierarchical Internet Object Cache , 1996, USENIX Annual Technical Conference.

[2]  Jia Wang,et al.  A survey of web caching schemes for the Internet , 1999, CCRV.

[3]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[4]  D. Reininger,et al.  Web caching framework: analytical models and beyond , 1999, Proceedings 1999 IEEE Workshop on Internet Applications (Cat. No.PR00197).

[5]  Mark Crovella,et al.  Characteristics of WWW Client-based Traces , 1995 .

[6]  Y. Xia,et al.  Model for magnetic phase transitions of RBa2Cu3O6+x compounds , 1990 .

[7]  Michael Dahlin,et al.  Coordinated Placement and Replacement for Large-Scale Distributed Caches , 2002, IEEE Trans. Knowl. Data Eng..

[8]  Marc Abrams,et al.  Proxy Caching That Estimates Page Load Delays , 1997, Comput. Networks.

[9]  Louis O. Hertzberger,et al.  Dealing with one-timer-documents in Web caching , 1998, Proceedings. 24th EUROMICRO Conference (Cat. No.98EX204).

[10]  Edward A. Fox,et al.  Caching Proxies: Limitations and Potentials , 1995, WWW.

[11]  Francis Patrick Kelly,et al.  The Clifford Paterson Lecture, 1995 Modelling communication networks, present and future , 1996, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[12]  Dimitri P. Bertsekas,et al.  Data Networks , 1986 .

[13]  Sandy Irani,et al.  Cost-Aware WWW Proxy Caching Algorithms , 1997, USENIX Symposium on Internet Technologies and Systems.

[14]  Clinton L. Jeffery,et al.  Taxonomy and design analysis for distributed Web caching , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[15]  Keith W. Ross,et al.  Hash routing for collections of shared Web caches , 1997, IEEE Netw..