Multiscale Predictability of Network Traffic

Distributed applications use predictions of network traffic to sustain their performance by adapting their behavior. The timescale of interest is application-dependent and thus it is natural to ask how predictability depends on the resolution, or degree of smoothing, of the network traffic signal. To help answer this question we empirically study the onestep-ahead predictability, measured by the ratio of mean squared error to signal variance, of network traffic at different resolutions. A one-step-ahead prediction at a low resolution is a prediction of the average behavior over a long interval. We apply a wide range of linear time series models to a large number of packet traces, generating different resolution views of the traces through two methods: the simple binning approach used by several extant network measurement tools, and by wavelet-based approximations. The wavelet-based approach is a natural way to provide multiscale prediction to applications. We find that predictability seems to be highly situational in practice---it varies widely from trace to trace. Unexpectedly, predictability does not always increase as the signal is smoothed. Half of the time there is a “sweet spot” at which the ratio is minimized and predictability is clearly best. We conclude by describing plans for an online waveletbased prediction system. Effort sponsored by the National Science Foundation under Grants ANI-0093221, ACI0112891, and EIA-0130869. The NLANR PMA traces are provided to the community by the National Laboratory for Applied Network Research under NSF Cooperative Agreement ANI-9807479. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation (NSF).}

[1]  Peter A. Dinda Online Prediction of the Running Time of Tasks , 2004, Cluster Computing.

[2]  Walter Willinger,et al.  On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[3]  N. C. Nigam Introduction to Random Vibrations , 1983 .

[4]  George C. Polyzos,et al.  A time series model of long-term NSFNET backbone traffic , 1994, Proceedings of ICC/SUPERCOMM'94 - 1994 International Conference on Communications.

[5]  Gwilym M. Jenkins,et al.  Time series analysis, forecasting and control , 1972 .

[6]  Amarnath Mukherjee,et al.  Time series models for internet traffic , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[7]  Ian T. Foster,et al.  Predicting the performance of wide area data transfers , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[8]  J. Beran Statistical methods for data with long-range dependence , 1992 .

[9]  Stephen F. Bush,et al.  Active Virtual Network Management Prediction , 2001 .

[10]  Ingrid Daubechies,et al.  Ten Lectures on Wavelets , 1992 .

[11]  Peter A. Dinda,et al.  Multi-resolution resource behavior queries using wavelets , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[12]  Patrice Abry,et al.  Long‐range Dependence: Revisiting Aggregation with Wavelets , 1998 .

[13]  Matthew Roughan,et al.  On-line estimation of the parameters of long-range dependence , 1998, IEEE GLOBECOM 1998 (Cat. NO. 98CH36250).

[14]  Yi Qiao,et al.  Network Traffic Analysis, Classification, and Prediction , 2003 .

[15]  Walter Willinger,et al.  Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level , 1997, TNET.

[16]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[17]  Dean Sutherland,et al.  The architecture of the Remos system , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[18]  Larry L. Peterson,et al.  TCP Vegas: End to End Congestion Avoidance on a Global Internet , 1995, IEEE J. Sel. Areas Commun..

[19]  Peter B. Danzig,et al.  A measurement-based admission control algorithm for integrated services packet networks , 1995, SIGCOMM '95.

[20]  Richard Wolski,et al.  Forecasting network performance to support dynamic scheduling using the network weather service , 1997, Proceedings. The Sixth IEEE International Symposium on High Performance Distributed Computing (Cat. No.97TB100183).

[21]  Peter A. Dinda,et al.  Virtualized Audio: A Highly Adaptive Interactive High Performance Computing Application , 2002 .

[22]  Anja Feldmann,et al.  Data networks as cascades: investigating the multifractal nature of Internet WAN traffic , 1998, SIGCOMM '98.

[23]  Anja Feldmann,et al.  Dynamics of IP traffic: a study of the role of variability and the impact of control , 1999, SIGCOMM '99.

[24]  San-qi Li,et al.  A predictability analysis of network traffic , 2002, Comput. Networks.

[25]  V. Jacobson,et al.  Congestion avoidance and control , 1988, CCRV.

[26]  Francine Berman,et al.  Scheduling from the perspective of the application , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[27]  Kavitha Chandra,et al.  Time series models for Internet data traffic , 1999, Proceedings 24th Conference on Local Computer Networks. LCN'99.

[28]  Murad S. Taqqu,et al.  On the Self-Similar Nature of Ethernet Traffic , 1993, SIGCOMM.

[29]  Peter Steenkiste,et al.  Collecting network status information for network-aware applications , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[30]  Richard G. Baraniuk,et al.  A Multifractal Wavelet Model with Application to Network Traffic , 1999, IEEE Trans. Inf. Theory.

[31]  Hung Man Tong,et al.  Threshold models in non-linear time series analysis. Lecture notes in statistics, No.21 , 1983 .

[32]  Brian D. Noble,et al.  Mobile network estimation , 2001, MobiCom '01.

[33]  Srinivasan Seshan,et al.  A network measurement architecture for adaptive applications , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[34]  Donald F. Towsley,et al.  A new algorithm for measurement-based admission control in integrated services packet networks , 1996, Protocols for High-Speed Networks.

[35]  C. Granger,et al.  AN INTRODUCTION TO LONG‐MEMORY TIME SERIES MODELS AND FRACTIONAL DIFFERENCING , 1980 .

[36]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[37]  San-qi Li,et al.  Predictive Dynamic Bandwidth Allocation for Efficient Transport of Real-Time VBR Video over ATM , 1995, IEEE J. Sel. Areas Commun..