Network traffic prediction using PCA and K-means

The growing demand for link bandwidth and node capacity is a frequent phenomenon in IP network backbones. Within this context, traffic prediction is essential for the network operator. Traffic prediction can be undertaken based on link traffic or on origin-destination (OD) traffic which presents better results. This work investigates a methodology for traffic prediction based on multidimensional OD traffic, focusing on the stage of short-term traffic prediction using Principal Components Analysis as a technique for dimensionality reduction and a Local Linear Model based on K-means as a technique for prediction and trend analysis. The results validated with data on a real network present a satisfactory margin of error for use in practical situations.

[1]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[2]  Jun Lv,et al.  Network Traffic Prediction Algorithm and its Practical Application in real network , 2007, 2007 IFIP International Conference on Network and Parallel Computing Workshops (NPC 2007).

[3]  Brian Everitt,et al.  Cluster analysis , 1974 .

[4]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[5]  Henry X. Liu,et al.  Use of Local Linear Regression Model for Short-Term Traffic Forecasting , 2003 .

[6]  Sally Floyd,et al.  Wide-area traffic: the failure of Poisson modeling , 1994 .

[7]  Steve Uhlig,et al.  Providing public intradomain traffic matrices to the research community , 2006, CCRV.

[8]  Henry X. Liu,et al.  Short Term Traffic Forecasting Using the Local Linear Regression Model , 2002 .

[9]  Konstantina Papagiannaki,et al.  Long-term forecasting of Internet backbone traffic: observations and initial models , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[10]  Tong Li,et al.  A New Algorithm for Network Traffic Prediction , 2006, 11th IEEE Symposium on Computers and Communications (ISCC'06).

[11]  Desmond P. Taylor,et al.  On the SelfSimilar Nature of Ethernet Traffic (Extended Version) , 2007 .

[12]  Jean-Sébastien Bedo,et al.  Internet Traffic Mid-term Forecasting: A Pragmatic Approach Using Statistical Analysis Tools , 2006, Networking.

[13]  Konstantina Papagiannaki,et al.  Structural analysis of network traffic flows , 2004, SIGMETRICS '04/Performance '04.

[14]  M. Alam Long-Term Forecasting of Internet Traffic for Pakistan Internet Exchange , 2007, 2007 International Conference on Electrical Engineering.

[15]  B. Tabachnick,et al.  Using Multivariate Statistics (5th Edition) , 2006 .

[16]  Walter Willinger,et al.  On the self-similar nature of Ethernet traffic , 1993, SIGCOMM '93.

[17]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[18]  Miguel Rio,et al.  Topology Aware Internet Traffic Forecasting Using Neural Networks , 2007, ICANN.