On the Use of Principle Component Analysis for the Hurst Parameter Estimation of Long-Range Dependent Network Traffic

Long-range dependency and self-similarity are the major characteristics of the Internet traffic. The degree of self-similarity is measured by the Hurst parameter (H). Various methods have been proposed to estimate H. One of the recent methods is an eigen domain estimator which is based on Principle Component Analysis (PCA); a popular signal processing tool. The PCA-based Method (PCAbM) uses the progression of the eigenvalues which are extracted from the autocorrelation matrix. For a self-similar process, this progression obeys a power-law relationship from which H can be estimated. In this paper, we compare PCAbM with some of the well-known estimation methods, namely; periodogram-based, wavelet-based estimation methods and show that PCAbM is reliable only when the process is long-range dependent (LRD), i.e., H is greater than 0.5. We also apply PCAbM and the other estimators to real network traces.

[1]  Süleyman Baykut,et al.  Principal Component Analysis of the Fractional Brownian Motion for 0 < H < 0.5 , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[2]  Murad S. Taqqu,et al.  Theory and applications of long-range dependence , 2003 .

[3]  Walter Willinger,et al.  On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[4]  Rachid Harba,et al.  Fast and exact synthesis for 1-D fractional Brownian motion and fractional Gaussian noises , 2002, IEEE Signal Processing Letters.

[5]  C. R. Dietrich,et al.  Fast and Exact Simulation of Stationary Gaussian Processes through Circulant Embedding of the Covariance Matrix , 1997, SIAM J. Sci. Comput..

[6]  Patrice Abry,et al.  Wavelet Analysis of Long-Range-Dependent Traffic , 1998, IEEE Trans. Inf. Theory.

[7]  Vern Paxson,et al.  Fast, approximate synthesis of fractional Gaussian noise for generating self-similar network traffic , 1997, CCRV.

[8]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[9]  Ilkka Norros,et al.  A storage model with self-similar input , 1994, Queueing Syst. Theory Appl..

[10]  W. Willinger,et al.  ESTIMATORS FOR LONG-RANGE DEPENDENCE: AN EMPIRICAL STUDY , 1995 .

[11]  Jan Beran,et al.  Statistics for long-memory processes , 1994 .

[12]  B. Mandelbrot,et al.  Fractional Brownian Motions, Fractional Noises and Applications , 1968 .

[13]  Jianbo Gao,et al.  Principal component analysis of 1/fα noise , 2003 .

[14]  G. Wornell Wavelet-based representations for the 1/f family of fractal processes , 1993, Proc. IEEE.

[15]  Walter Willinger,et al.  Self-similarity through high-variability: statistical analysis of Ethernet LAN traffic at the source level , 1997, TNET.

[16]  A. Erramilli,et al.  Self-similar traffic generation: the random midpoint displacement algorithm and its properties , 1995, Proceedings IEEE International Conference on Communications ICC '95.