Towards Fine Grained Network Flow Prediction

One main challenge for the design of networks is that traffic load is not generally known in advance. This makes it hard to adequately devote resources such as to best prevent or mitigate bottlenecks. While several authors have shown how to predict traffic in a coarse grained manner by aggregating flows, fine grained prediction of traffic at the level of individual flows, including bursty traffic, is widely considered to be impossible. This paper shows, to the best of our knowledge, the first approach to fine grained per flow traffic prediction. In short, we introduce the Frequency-based Kernel Kalman Filter (FKKF), which predicts individual flows' behavior based on measurements. Our FKKF relies on the well known Kalman Filter in combination with a kernel to support the prediction of non linear functions. Furthermore we change the operating space from time to frequency space. In this space, into which we transform the input data via a Short-Time Fourier Transform (STFT), the peak structures of flows can be predicted after gleaning their key characteristics, with a Principal Component Analysis (PCA), from past and ongoing flows that stem from the same socket-to-socket connection. We demonstrate the effectiveness of our approach on popular benchmark traces from a university data center. Our approach predicts traffic on average across 17 out of 20 groups of flows with an average prediction error of 6.43% around 0.49 (average) seconds in advance, whilst existing coarse grained approaches exhibit prediction errors of 77% at best.

[1]  Poo Kuan Hoong,et al.  Bittorrent Network Traffic Forecasting With ARMA , 2012, ArXiv.

[2]  Mark Handley,et al.  Congestion control for high bandwidth-delay product networks , 2002, SIGCOMM '02.

[3]  George Varghese,et al.  High Speed Networks Need Proactive Congestion Control , 2015, HotNets.

[4]  Klaus Wehrle,et al.  Network Latency Control in Data Centres (Dagstuhl Seminar 16281) , 2016, Dagstuhl Reports.

[5]  David A. Maltz,et al.  Network traffic characteristics of data centers in the wild , 2010, IMC '10.

[6]  Benoit Claise,et al.  Specification of the IP Flow Information Export (IPFIX) Protocol for the Exchange of IP Traffic Flow Information , 2008, RFC.

[7]  Min Zhu,et al.  B4: experience with a globally-deployed software defined wan , 2013, SIGCOMM.

[8]  Peter Phaal,et al.  InMon Corporation's sFlow: A Method for Monitoring Traffic in Switched and Routed Networks , 2001, RFC.

[9]  Amin Vahdat,et al.  Hedera: Dynamic Flow Scheduling for Data Center Networks , 2010, NSDI.

[10]  Murali S. Kodialam,et al.  Traffic engineering in software defined networks , 2013, 2013 Proceedings IEEE INFOCOM.

[11]  Mo Dong,et al.  PCC: Re-architecting Congestion Control for Consistent High Performance , 2014, NSDI.

[12]  Badong Chen,et al.  Learning Nonlinear Generative Models of Time Series With a Kalman Filter in RKHS , 2014, IEEE Transactions on Signal Processing.

[13]  Poo Kuan Hoong,et al.  Impact of utilizing forecasted network traffic for data transfers , 2011, 13th International Conference on Advanced Communication Technology (ICACT2011).

[14]  Jilali Antari,et al.  Identification and Prediction of Internet Traffic Using Artificial Neural Networks , 2010, J. Intell. Learn. Syst. Appl..

[15]  L. Ralaivola,et al.  Time series filtering, smoothing and learning using the kernel Kalman filter , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[16]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[17]  Andreas Kassler,et al.  Predicting expected TCP throughput using genetic algorithm , 2016, Comput. Networks.

[18]  Hao Chen,et al.  Prediction of traffic in a public safety network , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[19]  Ming Zhang,et al.  MicroTE: fine grained traffic engineering for data centers , 2011, CoNEXT '11.

[20]  Lirong Qiu,et al.  Network Traffic Prediction Based on SVR Improved By Chaos Theory and Ant Colony Optimization , 2015 .

[21]  Yanhua Yu,et al.  Traffic prediction in 3G mobile networks based on multifractal exploration , 2013 .

[22]  Praveen Yalagandula,et al.  Mahout: Low-overhead datacenter traffic management using end-host-based elephant detection , 2011, 2011 Proceedings IEEE INFOCOM.

[23]  P. Welch The use of fast Fourier transform for the estimation of power spectra: A method based on time averaging over short, modified periodograms , 1967 .

[24]  Gerhard Neumann,et al.  The Kernel Kalman Rule - Efficient Nonparametric Inference with Recursive Least Squares , 2017, AAAI.

[25]  Miguel Rio,et al.  Internet Traffic Forecasting using Neural Networks , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[26]  Balasubramaniam Natarajan,et al.  GARCH — non-linear time series model for traffic modeling and prediction , 2008, NOMS 2008 - 2008 IEEE Network Operations and Management Symposium.

[27]  Jilali Antari,et al.  ANFIS method for forecasting internet traffic time series , 2009, 2009 Mediterrannean Microwave Symposium (MMS).

[28]  David A. Maltz,et al.  Data center TCP (DCTCP) , 2010, SIGCOMM 2010.

[29]  Ming Zhang,et al.  Understanding data center traffic characteristics , 2010, CCRV.

[30]  Nick McKeown,et al.  Why flow-completion time is the right metric for congestion control , 2006, CCRV.

[31]  Srikanth Kandula,et al.  Achieving high utilization with software-driven WAN , 2013, SIGCOMM.

[32]  B. E. Reddy,et al.  Performance Comparison of Four New ARIMA-ANN Prediction Models on Internet Traffic Data , 2015 .

[33]  Sophia Daskalaki,et al.  Comparing forecasting approaches for Internet traffic , 2015, Expert Syst. Appl..

[34]  Konstantina Papagiannaki,et al.  A pragmatic definition of elephants in internet backbone traffic , 2002, IMW '02.

[35]  Dong-Chul Park,et al.  Prediction of Network Traffic Using Dynamic Bilinear Recurrent Neural Network , 2009, 2009 Fifth International Conference on Natural Computation.