Effective caching of Web objects using Zipf's law

Web accesses follow Zipf's law with a good approximation, as measurements and observations indicate. This property provides an important tool in the design of Web caching architectures, because it allows designers to calculate appropriate cache sizes to achieve the desired hit ratios. The appropriate cache size combined with an LFU replacement policy achieves high cache hit rates. However, LFU replaces objects based on frequency measurements of past accesses. Thus, the system achieves high hit rates only after these measurements are reliable and converge to the final Zipf distribution. In this paper, we provide an analysis using Chernoff's bound and a calculation of an upper bound of the number of initial requests that need to be processed in order to obtain measurements of popularity with high confidence and a measured Zipf distribution which converges to the correct one.

[1]  Mark Crovella,et al.  Characteristics of WWW Client-based Traces , 1995 .

[2]  Peter B. Danzig,et al.  The Harvest Information Discovery and Access System , 1995, Comput. Networks ISDN Syst..

[3]  Wayne Wolf,et al.  Caching Web objects using Zipf's law , 1998, Other Conferences.

[4]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[5]  Andrew Wolfe,et al.  An approach to network caching for multimedia objects , 1997, Proceedings International Conference on Computer Design VLSI in Computers and Processors.

[6]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).