Hierarchical Web caching systems: modeling, design and experimental results

This paper aims at finding fundamental design principles for hierarchical Web caching. An analytical modeling technique is developed to characterize an uncooperative two-level hierarchical caching system where the least recently used (LRU) algorithm is locally run at each cache. With this modeling technique, we are able to identify a characteristic time for each cache, which plays a fundamental role in understanding the caching processes. In particular, a cache can be viewed roughly as a low-pass filter with its cutoff frequency equal to the inverse of the characteristic time. Documents with access frequencies lower than this cutoff frequency have good chances to pass through the cache without cache hits. This viewpoint enables us to take any branch of the cache tree as a tandem of low-pass filters at different cutoff frequencies, which further results in the finding of two fundamental design principles. Finally, to demonstrate how to use the principles to guide the caching algorithm design, we propose a cooperative hierarchical Web caching architecture based on these principles. Both model-based and real trace simulation studies show that the proposed cooperative architecture results in more than 50% memory saving and substantial central processing unit (CPU) power saving for the management and update of cache entries compared with the traditional uncooperative hierarchical caching architecture.

[1]  G. Grimmett,et al.  Probability and random processes , 2002 .

[2]  Dimitri P. Bertsekas,et al.  Data Networks , 1986 .

[3]  Mark Crovella,et al.  Characteristics of WWW Client-based Traces , 1995 .

[4]  Edward A. Fox,et al.  Caching Proxies: Limitations and Potentials , 1995, WWW.

[5]  Peter B. Danzig,et al.  A Hierarchical Internet Object Cache , 1996, USENIX ATC.

[6]  Francis Patrick Kelly,et al.  The Clifford Paterson Lecture, 1995 Modelling communication networks, present and future , 1996, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[7]  Sandy Irani,et al.  Cost-Aware WWW Proxy Caching Algorithms , 1997, USENIX Symposium on Internet Technologies and Systems.

[8]  Keith W. Ross,et al.  Hash routing for collections of shared Web caches , 1997, IEEE Netw..

[9]  Marc Abrams,et al.  Proxy Caching That Estimates Page Load Delays , 1997, Comput. Networks.

[10]  Louis O. Hertzberger,et al.  Dealing with one-timer-documents in Web caching , 1998, Proceedings. 24th EUROMICRO Conference (Cat. No.98EX204).

[11]  Jia Wang,et al.  A survey of web caching schemes for the Internet , 1999, CCRV.

[12]  Clinton L. Jeffery,et al.  Taxonomy and design analysis for distributed Web caching , 1999, Proceedings of the 32nd Annual Hawaii International Conference on Systems Sciences. 1999. HICSS-32. Abstracts and CD-ROM of Full Papers.

[13]  D. Reininger,et al.  Web caching framework: analytical models and beyond , 1999, Proceedings 1999 IEEE Workshop on Internet Applications (Cat. No.PR00197).

[14]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[15]  Hao Che,et al.  Analysis and design of hierarchical Web caching systems , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[16]  Michael Dahlin,et al.  Coordinated Placement and Replacement for Large-Scale Distributed Caches , 2002, IEEE Trans. Knowl. Data Eng..