Feature-Based Time Series Classification for Service Request Opening Prediction in the Telecom Industry

Telecommunication companies face the challenge to reduce the number of service request openings (SROs). A predictive behavior able to reduce this number can improve customers experience and decrease operational costs. This paper proposes a machine learning (ML) based approach to reduce the number of SROs. For such, it uses real data from a Brazilian telecom operator. The proposed approach uses feature-based time series extracted from network equipment’s signals, modeling the problem as a binary classification task. We carry out experiments to investigate the impact of long-term and short-term windows in the predictive performance. After pre-processing the data, we apply different classifiers algorithms. According to experimental results, a high predictive performance was obtained, mainly when long-term network behavior data was used. These results have a positive impact in the company costs.

[1]  Zheng Hu,et al.  Not Too Late to Identify Potential Churners: Early Churn Prediction in Telecommunication Industry , 2016, 2016 IEEE/ACM 3rd International Conference on Big Data Computing Applications and Technologies (BDCAT).

[2]  Matteo Terzi,et al.  Time-Series Classification Methods: Review and Applications to Power Systems Data , 2018 .

[3]  Caroline Kleist,et al.  Time Series Data Mining Methods: A Review , 2015 .

[4]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[5]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.

[8]  Shi Jin,et al.  Failure Prediction Based on Anomaly Detection for Complex Core Routers , 2018, 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[9]  F ROSENBLATT,et al.  The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[10]  Alexander A. Totok,et al.  Modern Internet Services: Exploiting Service Usage Information for Optimizing Service Management , 2009 .

[11]  Hai Jin,et al.  A Cross-Platform Consumer Behavior Analysis of Large-Scale Mobile Shopping Data , 2018, WWW.

[12]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[13]  Mohak Shah,et al.  Evaluating Learning Algorithms: A Classification Perspective , 2011 .

[14]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[15]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[16]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[17]  Chih-Ping Wei,et al.  Turning telecommunications call details to churn prediction: a data mining approach , 2002, Expert Syst. Appl..

[18]  Yiannis Gkoufas,et al.  Towards real-time customer experience prediction for telecommunication operators , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[19]  R. Prashanth,et al.  High Accuracy Predictive Modelling for Customer Churn Prediction in Telecom Industry , 2017, MLDM.

[20]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[21]  Àngela Nebot,et al.  Intelligent data analysis approaches to churn as a business problem: a survey , 2017, Knowledge and Information Systems.