Anomaly prediction in mobile networks : A data driven approach for machine learning algorithm selection

In this paper, we propose a model for proactive anomaly detection in mobile networks. We show that when Key Performance Indicators (KPIs) are highly correlated, the linear regression gives a good accuracy in anomaly detection for a short prediction horizon. When the prediction horizon is far, KPIs become weakly correlated. We propose to transform discrete measurements into functional data and apply a functional data regression method to perform the prediction. We compare pro-posed models using KPI measurements obtained from a real Long Term Evolution (LTE) network. We show that an improvement in prediction performance can be obtained by using functional data analysis.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  Pedro Casas,et al.  Machine-Learning Based Approaches for Anomaly Detection and Classification in Cellular Networks , 2016, TMA.

[3]  W. Marsden I and J , 2012 .

[4]  Georg Heinze,et al.  Variable selection – A review and recommendations for the practicing statistician , 2018, Biometrical journal. Biometrische Zeitschrift.

[5]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Gareth M. James Generalized linear models with functional predictors , 2002 .

[8]  Jun Wu,et al.  CellPAD: Detecting Performance Anomalies in Cellular Networks via Regression Analysis , 2018, 2018 IFIP Networking Conference (IFIP Networking) and Workshops.

[9]  Hao Helen Zhang,et al.  Consistent Group Identification and Variable Selection in Regression With Correlated Predictors , 2013, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[10]  Tomasz Górecki,et al.  Classification Problems Based on Regression Models for Multi-Dimensional Functional Data , 2015, Statistics in Transition new series.

[11]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[12]  Dallas S. Batten,et al.  Polynomial distributed lags and the estimation of the St. Louis equation , 1983 .

[13]  Guy Shani,et al.  A Survey of Accuracy Evaluation Metrics of Recommendation Tasks , 2009, J. Mach. Learn. Res..