Internet Performance Prediction Framework Based on PingER Dataset

The Internet performance directly affects the scalability, reliability and availability of the online applications. Delay of a few millisecond may cause companies lose millions of dollars. Therefore, Internet measurements are carried out to capture the performance of the Internet links worldwide. Most of the Internet performance monitoring frameworks are active in nature i.e., they can only capture the real-time performance of the Internet links. Thus, these monitoring frameworks are unable to forecast the near future performance of the Internet links in a region. Such estimates are quite critical for the network administrators to carry out bandwidth extensive experiments between different sites, policy makers to suggest future upgrades to the Internet infrastructures or streaming service providers to enhance the quality of service to their customers. Therefore, we analyze different machine learning algorithms including Multiple Linear regression, Random Forest algorithm, Gradient Boosting, and eXtreme Gradient Boosting to predict the performance of the Internet links using PingER (Ping End-to-End Reporting) dataset for the countries like China, India and Japan. Our experimental results show that the Multiple Linear regression has improved Internet performance prediction accuracy compared with the other methods. Our work can be utilized by the Internet service providers, streaming service providers or policymakers for the design, deployment, and evaluation of next-generation Internet infrastructure.

[1]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[2]  C. L. Wu,et al.  Methods to improve neural network performance in daily flows prediction , 2009 .

[3]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[4]  Leo Guelman,et al.  Gradient boosting trees for auto insurance loss cost modeling and prediction , 2012, Expert Syst. Appl..

[5]  K. Hasan,et al.  Comparative study of wavelet-ARIMA and wavelet-ANN models for temperature time series data in northeastern Bangladesh , 2017 .

[6]  Vaibhav Bajpai,et al.  Lessons Learned From Using the RIPE Atlas Platform for Measurement Research , 2015, CCRV.

[7]  Jukka Manner,et al.  Netradar - Measuring the wireless world , 2013, 2013 11th International Symposium and Workshops on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks (WiOpt).

[8]  Shi Dong,et al.  Network traffic prediction based on ARFIMA model , 2013, ArXiv.

[9]  Bo Li,et al.  Network traffic prediction based on a new time series model , 2005, Int. J. Commun. Syst..

[10]  Saqib Ali,et al.  Internet Performance Analysis of South Asian Countries Using End-to-End Internet Performance Measurements , 2017, 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC).

[11]  Fengying Cui Study of Traffic Flow Prediction Based on BP Neural Network , 2010, 2010 2nd International Workshop on Intelligent Systems and Applications.

[12]  Nick Feamster,et al.  BISmark: A Testbed for Deploying Measurements and Applications in Broadband Access Networks , 2014, USENIX ATC.

[13]  Sanjay Kumar,et al.  Intuitionistic Fuzzy Time Series: An Approach for Handling Nondeterminism in Time Series Forecasting , 2016, IEEE Transactions on Fuzzy Systems.

[14]  Chen Hongxin Network traffic prediction based on Extreme Learning Machine and Least Square Support Vector Machine , 2015 .

[15]  D. Martin Swany,et al.  PerfSONAR: A Service Oriented Architecture for Multi-domain Network Monitoring , 2005, ICSOC.

[16]  Les Cottrell,et al.  The PingER project: active Internet performance monitoring for the HENP community , 2000, IEEE Commun. Mag..

[17]  Abhay Bansal,et al.  Analysis and clustering of PingER network data , 2016, 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence).

[18]  Bruce M. Maggs,et al.  The Internet at the Speed of Light , 2014, HotNets.

[19]  Zheng Weiyon Network traffic combination forecasting based on encompassing tests and Support Vector Machine , 2013 .

[20]  Saqib Ali,et al.  Substituting Missing Values in End-to-End Internet Performance Measurements Using k-Nearest Neighbors , 2018, 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[21]  Xiaosheng Wang,et al.  Uncertain linear regression model and its application , 2017, J. Intell. Manuf..

[22]  Alan Fern,et al.  Gradient Boosting for Sequence Alignment , 2006, AAAI.

[23]  Henry X. Liu,et al.  Short Term Traffic Forecasting Using the Local Linear Regression Model , 2002 .

[24]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[25]  Saqib Ali,et al.  A Blockchain-Based Decentralized Data Storage and Access Framework for PingER , 2018, 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/ 12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE).

[26]  Vern Paxson End-to-end internet packet dynamics , 1999, TNET.

[27]  Pengjian Shang,et al.  Nonlinear analysis of traffic time series at different temporal scales , 2006 .

[28]  Dulakshi S. K. Karunasinghe,et al.  Chaotic time series prediction with a global model: Artificial neural network , 2006 .

[29]  Enrico Gregori,et al.  Smartphone-based crowdsourcing for network monitoring: Opportunities, challenges, and a case study , 2014, IEEE Communications Magazine.

[30]  Henry X. Liu,et al.  Use of Local Linear Regression Model for Short-Term Traffic Forecasting , 2003 .

[31]  Saqib Ali,et al.  Detecting Anomalies from End-to-End Internet Performance Measurements (PingER) Using Cluster Based Local Outlier Factor , 2017, 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC).

[32]  Abdelfettah Belghith,et al.  Hybrid Live P2P Streaming Protocol , 2014, ANT/SEIT.