Abstract As distributed information services like the World Wide Web become increasingly popular on the Internet, problems of scale are clearly evident. A promising technique that addresses many of these problems is service (or document) replication. However, when a service is replicated, clients then need the additional ability to find a ``good'''' provider of that service. In this paper we report on techniques for finding good service providers without a priori knowledge of server location or network topology. We consider the use of two principal metrics for measuring distance in the Internet: hops, and round-trip latency. We show that these two metrics yield very different results in practice. Surprisingly, we show data indicating that the number of hops between two hosts in the Internet is {\em not\/} strongly correlated to round-trip latency. Thus, the distance in hops between two hosts is not necessarily a good predictor of the expected latency of a document transfer. Instead of using known or measured distances in hops, we show that the extra cost at runtime incurred by dynamic latency measurement is well justified based on the resulting improved performance. In addition we show that selection based on dynamic latency measurement performs much better in practice that any static selection scheme. Finally, the difference between the distribution of hops and latencies is fundamental enough to suggest differences in algorithms for server replication. We show that conclusions drawn about service replication based on the distribution of hops need to be revised when the distribution of latencies is considered instead.
[1]
Michael F. Schwartz,et al.
Locating nearby copies of replicated Internet servers
,
1995,
SIGCOMM '95.
[2]
Margo Seltzer,et al.
VINO: The 1994 Fall Harvest
,
1994
.
[3]
Peter B. Danzig,et al.
Harvest: A Scalable, Customizable Discovery and Access System
,
1994
.
[4]
Azer Bestavros,et al.
Demand-based document dissemination to reduce traffic and balance load in distributed information systems
,
1995,
Proceedings.Seventh IEEE Symposium on Parallel and Distributed Processing.
[5]
Margo I. Seltzer,et al.
The case for geographical push-caching
,
1995,
Proceedings 5th Workshop on Hot Topics in Operating Systems (HotOS-V).
[6]
Peter B. Danzig,et al.
A Hierarchical Internet Object Cache
,
1996,
USENIX ATC.