On Staleness and the Delivery of Web Pages

A popular technique to improve the scalability of a web based system is caching at proxy servers. Caching has the drawback that a cached page becomes stale when the page is updated at the web server. In some cases, staleness may not be completely avoided because the server may not wish to expend the processing and communication resources required to transmit all the updates immediately. In general, if updates are transmitted less frequently, the staleness will tend to increase, but the amount of resources consumed will be reduced. The tradeoff between resource consumption and staleness is investigated. A measure of staleness is defined and optimization problems are formulated. The solutions to these problems allow one to come up with an optimal strategy for transmitting page updates. Numerical examples showing the resource consumption/staleness tradeoff are presented.

[1]  Hector Garcia-Molina,et al.  Synchronizing a database to improve freshness , 2000, SIGMOD 2000.

[2]  Stephen E. Deering,et al.  Host extensions for IP multicasting , 1986, RFC.

[3]  Jörg Nonnenmacher Asynchronous multicast push : AMP , 1997 .

[4]  Mun Choon Chan,et al.  Cache-based compaction: a new technique for optimizing Web transfer , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[5]  Mostafa H. Ammar,et al.  The Design of Teletext Broadcast Cycles , 1985, Perform. Evaluation.

[6]  Virgílio A. F. Almeida,et al.  Characterizing reference locality in the WWW , 1996, Fourth International Conference on Parallel and Distributed Information Systems.

[7]  Margo I. Seltzer,et al.  The case for geographical push-caching , 1995, Proceedings 5th Workshop on Hot Topics in Operating Systems (HotOS-V).

[8]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[9]  Pablo Rodriguez,et al.  Improving the WWW: Caching or Multicast? , 1998, Comput. Networks.

[10]  William C. Fenner Internet Group Management Protocol, Version 2 , 1997, RFC.

[11]  Margo I. Seltzer,et al.  World Wide Web Cache Consistency , 1996, USENIX Annual Technical Conference.

[12]  Scott Shenker,et al.  A scalable Web cache consistency architecture , 1999, SIGCOMM '99.

[13]  Adam Dingle,et al.  Web Cache Coherence , 1996, Comput. Networks.