Network Performance Implications of Variability in Data Traffic

World Wide Web (WWW) traffic will dominate network traffic for the foreseeable future. Accurate predictions of network performance can only be achieved if network models reflect WWW traffic statistics. Through analysis of usage logs at a range of caches it is shown that WWW traffic is not a Poisson arrival process, and that it displays significant levels of self-similarity. It is also shown for the first time that the self-similar variability extends to demand for individual pages, and is far more pervasive than previously thought. These measurements are used as the basis for a cache-modelling tool-kit. Using this software the impact of the variability on predictive planning is illustrated. The model predicts that optimisations based on predictive algorithms (such as least recently used discard) are likely to reduce performance very quickly. This means that, far from improving the efficiency of the network, conventional approaches to network planning and engineering will tend to reduce efficiency and increase costs.

[1]  Anja Feldmann,et al.  Web proxy caching: the devil is in the details , 1998, PERV.

[2]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[3]  Ian W. Marshall,et al.  Linking Cache Performance to User Behaviour , 1998, Comput. Networks.

[4]  Walter Willinger,et al.  Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level , 1997, TNET.

[5]  Mark E. Crovella,et al.  Effect of traffic self-similarity on network performance , 1997, Other Conferences.

[6]  Paul Barford,et al.  Generating representative Web workloads for network and server performance evaluation , 1998, SIGMETRICS '98/PERFORMANCE '98.

[7]  Chris Roadknight,et al.  Modelling and Performance Analysis of Cache Networks. , 1999 .

[8]  J. Beran Statistical methods for data with long-range dependence , 1992 .

[9]  W. Willinger,et al.  ESTIMATORS FOR LONG-RANGE DEPENDENCE: AN EMPIRICAL STUDY , 1995 .

[10]  Steffen Rothkugel,et al.  Enhancing the Web's Infrastructure: From Caching to Replication , 1997, IEEE Internet Comput..

[11]  Ian W. Marshall,et al.  File popularity characterisation , 2000, PERV.