Neural network architecture based on gradient boosting for IoT traffic prediction

Abstract Network traffic forecasting is an operational and management function that is critical for any data network. It is even more important for IoT networks given the number of connected elements and the real-time nature of many connections. This work presents a novel deep learning architecture applicable to this supervised regression problem. It is based on an additive network model formed by ‘learning blocks’ that are stacked iteratively following, in part, the principles of gradient boosting models. The resulting architecture is trained end-to-end using stochastic gradient descent. This new architecture has connections with residual, stacked and boosted networks, being different from any of them. Like residual networks, it shows excellent convergence behavior during training and allows for deeper models. It has a regularization effect similar to stacked models and presents excellent prediction results as gradient boosting models do. The building elements of the architecture are neural network blocks or learning blocks, that can be constituted by a sequence of simple fully connected layers or by more elaborate dispositions of recurrent and convolutional layers. The resulting architecture is a generic additive network (gaNet) applicable to any supervised regression problem. To obtain experimental results on a hard prediction problem, the model is applied to the forecasting of network traffic using IoT traffic volume real data from a mobile operator. The paper presents a comprehensive comparison of results between the proposed new model and many alternative algorithms, showing important improvements in terms of prediction performance metrics and training/prediction processing times.

[1]  Mohamed Faten Zhani,et al.  Analysis and Prediction of Real Network Traffic , 2009, J. Networks.

[2]  San-qi Li,et al.  A predictability analysis of network traffic , 2002, Comput. Networks.

[3]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[4]  Amit P. Sheth,et al.  Machine learning for Internet of Things data analysis: A survey , 2017, Digit. Commun. Networks.

[5]  Jaime Lloret,et al.  Network Traffic Classifier With Convolutional and Recurrent Neural Networks for Internet of Things , 2017, IEEE Access.

[6]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[7]  Xin Wang,et al.  Machine Learning for Networking: Workflow, Advances and Opportunities , 2017, IEEE Network.

[8]  Tjeng Wawan Cenggoro,et al.  Dynamic bandwidth management based on traffic prediction using Deep Long Short Term Memory , 2016, 2016 2nd International Conference on Science in Information Technology (ICSITech).

[9]  Concha Bielza,et al.  A survey on multi‐output regression , 2015, WIREs Data Mining Knowl. Discov..

[10]  Raouf Boutaba,et al.  A comprehensive survey on machine learning for networking: evolution, applications and research opportunities , 2018, Journal of Internet Services and Applications.

[11]  Jaime Lloret,et al.  Deep Learning Model for Multimedia Quality of Experience Prediction Based on Network Flow Packets , 2018, IEEE Communications Magazine.

[12]  Shipeng Xie,et al.  Alcoholism Identification Based on an AlexNet Transfer Learning Model , 2019, Front. Psychiatry.

[13]  Kun Zhu,et al.  A seasonal ARIMA model with exogenous variables for elspot electricity prices in Sweden , 2013, 2013 10th International Conference on the European Energy Market (EEM).

[14]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[15]  Yantai Shu,et al.  Study on network traffic prediction techniques , 2005, Proceedings. 2005 International Conference on Wireless Communications, Networking and Mobile Computing, 2005..

[16]  Jun Lv,et al.  Network traffic prediction and fault detection based on adaptive linear model , 2004, 2004 IEEE International Conference on Industrial Technology, 2004. IEEE ICIT '04..

[17]  Shuihua Wang,et al.  Cerebral Micro-Bleeding Detection Based on Densely Connected Neural Network , 2019, Front. Neurosci..

[18]  Zhitang Chen,et al.  Predicting future traffic using Hidden Markov Models , 2016, 2016 IEEE 24th International Conference on Network Protocols (ICNP).

[19]  Alberto Mozo,et al.  Forecasting short-term data center network traffic load with convolutional neural networks , 2018, PloS one.

[20]  Yoshua Bengio,et al.  Boosting Neural Networks , 2000, Neural Computation.

[21]  Nei Kato,et al.  State-of-the-Art Deep Learning: Evolving Machine Intelligence Toward Tomorrow’s Intelligent Network Traffic Control Systems , 2017, IEEE Communications Surveys & Tutorials.

[22]  Rob J Hyndman,et al.  Another look at measures of forecast accuracy , 2006 .

[23]  Oliver W. W. Yang,et al.  Wireless traffic modeling and prediction using seasonal ARIMA models , 2003, IEEE International Conference on Communications, 2003. ICC '03..

[24]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[25]  Jian Yang,et al.  Boosted Convolutional Neural Networks , 2016, BMVC.

[26]  Manuel López Martín,et al.  Review of Methods to Predict Connectivity of IoT Wireless Devices , 2017, Ad Hoc Sens. Wirel. Networks.

[27]  Stefano Ermon,et al.  Deep Transfer Learning for Crop Yield Prediction with Remote Sensing Data , 2018, COMPASS.

[28]  Germain Forestier,et al.  Transfer learning for time series classification , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[29]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[30]  ZhiHui Lv,et al.  RPPS: A Novel Resource Prediction and Provisioning Scheme in Cloud Data Center , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[31]  Nei Kato,et al.  A Survey on Network Methodologies for Real-Time Analytics of Massive IoT Data and Open Research Issues , 2017, IEEE Communications Surveys & Tutorials.

[32]  Qun Dai,et al.  A novel transfer learning framework for time series forecasting , 2018, Knowl. Based Syst..

[33]  Mikel Izal,et al.  Computation of Traffic Time Series for Large Populations of IoT Devices , 2019, Sensors.

[34]  Dongfeng Yuan,et al.  Citywide Cellular Traffic Prediction Based on Densely Connected Convolutional Neural Networks , 2018, IEEE Communications Letters.