Demand-based Document Dissemination for the World-Wide Web

Abstract We analyzed the logs of the cs-www.bu.edu HTTP server for the month of January 1995. Our analysis showed that remote HTTP accesses were confined to a small subset of documents. Using an analytical model of server popularity and file access profiles, we show that by disseminating the most popular documents on servers (proxies) closer to the clients, network traffic could be reduced considerably, while server loads are balanced. We argue that this process could be generalized so as to provide for an automated demand-based duplication of documents. We believe that such server-based information dissemination protocols will be more effective at reducing both network bandwidth and document retrieval times than client-based caching protocols.

[1]  Mahadev Satyanarayanan,et al.  Scale and performance in a distributed file system , 1988, TOCS.

[2]  Richard S. Hall,et al.  A case for caching file objects inside internetworks , 1993, SIGCOMM '93.

[3]  J. Howard Et El,et al.  Scale and performance in a distributed file system , 1988 .

[4]  Mahadev Satyanarayanan,et al.  Coda: A Highly Available File System for a Distributed Workstation Environment , 1990, IEEE Trans. Computers.

[5]  Stanley B. Zdonik,et al.  An Efficient Scheme for Dynamic Data Replication , 1993 .

[6]  Matthew Addison Blaze Caching in large-scale distributed file systems , 1993 .

[7]  Azer Bestavros,et al.  Application-level document caching in the Internet , 1995, Second International Workshop on Services in Distributed and Networked Environments.

[8]  Margo I. Seltzer,et al.  The case for geographical push-caching , 1995, Proceedings 5th Workshop on Hot Topics in Operating Systems (HotOS-V).

[9]  Mahadev Satyanarayanan,et al.  Andrew: a distributed personal computing environment , 1986, CACM.

[10]  Michael F. Schwartz,et al.  Locating nearby copies of replicated Internet servers , 1995, SIGCOMM '95.

[11]  Peter Honeyman,et al.  Multi-level Caching in Distributed File Systems or Your cache ain't nuthin' but trash , 1992 .

[12]  Michael Dahlin,et al.  Cooperative caching: using remote client memory to improve file system performance , 1994, OSDI '94.

[13]  P. Venkat Rangan,et al.  Information caching for delivery of personalized video programs on home entertainment channels , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[14]  David A. Patterson,et al.  Cooperative Caching: Using Remote Client Memory , 1994 .