Time series for early churn detection: Using similarity based classification for dynamic networks

Abstract The success of retention campaigns in fast-moving and saturated markets, such as the telecommunication industry, often depends on accurately predicting potential churners. Being able to identify certain behavioral patterns that lead to churn is important, because it allows the organization to make arrangements for retention in a timely manner. Moreover, previous research has shown that the decision to leave one operator for another, is often influenced by the customer’s social circle. Therefore, features that represent the churn status of their connections are usually good predictors of churn when it is treated as a binary classification problem, which is the traditional approach. We propose a novel method to extract time series data from call networks to represent dynamic customer behavior. More precisely, we use call detail records of the customers of a telecommunication provider to build call networks on a weekly basis over the period of six months. From each network, we extract features based on each customer’s connections within the network, resulting in individual time series of link-based measures. The time series are then classified using the recently proposed similarity forests method, which we further extend in various ways to accommodate multivariate time series. We show that predicting churn with customer behavior represented by time series is a suitable option. According to our results, the similarity forests method together with some of our proposed extensions, perform better than the one-nearest neighbor benchmark for time series classification. Using a time series of a single feature only, the similarity forests method performs as good as traditional churn prediction methods using more features. In fact, compared to traditional methods, similarity forests based approaches perform better when predicting further in the future, and are therefore better at detecting churn early, allowing organizations to make arrangements for retention in a timely manner.

[1]  Bart Baesens,et al.  Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..

[2]  Ruggero G. Pensa,et al.  Introduction to the special issue on dynamic networks and knowledge discovery , 2017, Machine Learning.

[3]  Minghe Sun,et al.  A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data , 2012, Eur. J. Oper. Res..

[4]  Claude Sammut,et al.  Classification of Multivariate Time Series and Structured Data Using Constructive Induction , 2005, Machine Learning.

[5]  Junyi Shen,et al.  Classification of multivariate time series using locality preserving projections , 2008, Knowl. Based Syst..

[6]  Esteban Moro Egido,et al.  Temporal patterns behind the strength of persistent ties , 2017, EPJ Data Science.

[7]  Gueorgi Kossinets,et al.  Empirical Analysis of an Evolving Social Network , 2006, Science.

[8]  S. García,et al.  An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons , 2008 .

[9]  Arun Sundararajan,et al.  Distinguishing influence-based contagion from homophily-driven diffusion in dynamic networks , 2009, Proceedings of the National Academy of Sciences.

[10]  Bart Baesens,et al.  A comparative study of social network classifiers for predicting churn in the telecommunication industry , 2016, 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM).

[11]  Gerhard Thonhauser,et al.  Multivariate Time Series Classification by Combining Trend-Based and Value-Based Approximations , 2012, ICCSA.

[12]  Marco Fiore,et al.  Large-Scale Mobile Traffic Analysis: A Survey , 2016, IEEE Communications Surveys & Tutorials.

[13]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[14]  Chi-Hyuck Jun,et al.  Improved churn prediction in telecommunication industry by analyzing a large network , 2014, Expert Syst. Appl..

[15]  Eamonn J. Keogh,et al.  The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances , 2016, Data Mining and Knowledge Discovery.

[16]  Dino Pedreschi,et al.  Tiles: an online algorithm for community discovery in dynamic social networks , 2017, Machine Learning.

[17]  Arno De Caigny,et al.  A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees , 2018, Eur. J. Oper. Res..

[18]  Nagiza F. Samatova,et al.  Efficient Outlier Detection in Hyperedge Streams Using MinHash and Locality-Sensitive Hashing , 2017, COMPLEX NETWORKS.

[19]  Ji Zhu,et al.  Predicting customer churn through interpersonal influence , 2012, Knowl. Based Syst..

[20]  Bart Baesens,et al.  A Novel Profit Maximizing Metric for Measuring Classification Performance of Customer Churn Prediction Models , 2013, IEEE Transactions on Knowledge and Data Engineering.

[21]  Denis Helic,et al.  Modeling Peer Influence in Time-Varying Networks , 2017, COMPLEX NETWORKS.

[22]  Shan Liu,et al.  An effective multivariate time series classification approach using echo state network and adaptive differential evolution algorithm , 2016, Expert Syst. Appl..

[23]  Tomasz Górecki,et al.  Multivariate time series classification with parametric derivative dynamic time warping , 2015, Expert Syst. Appl..

[24]  Charu C. Aggarwal,et al.  Similarity Forests , 2017, KDD.

[25]  Yun Chi,et al.  Facetnet: a framework for analyzing communities and their evolutions in dynamic networks , 2008, WWW.

[26]  Bart Baesens,et al.  Predicting time-to-churn of prepaid mobile telephone customers using social network analysis , 2016, J. Oper. Res. Soc..

[27]  Nicola Santoro,et al.  Time-Varying Graphs and Social Network Analysis: Temporal Indicators and Metrics , 2011, ArXiv.

[28]  Milos Hauskrecht,et al.  Multivariate Time Series Classification with Temporal Abstractions , 2009, FLAIRS.

[29]  Carlo Vercellis,et al.  Combining discrete SVM and fixed cardinality warping distances for multivariate time series classification , 2010, Pattern Recognit..

[30]  Mohammad Khansari,et al.  An Opportunistic Network Approach Towards Disease Spreading , 2017, COMPLEX NETWORKS.

[31]  Bart Baesens,et al.  Development and application of consumer credit scoring models using profit-based classification measures , 2014, Eur. J. Oper. Res..

[32]  Bart Baesens,et al.  Social network analytics for churn prediction in telco: Model building, evaluation and network architecture , 2017, Expert Syst. Appl..

[33]  Jun Wang,et al.  Generalizing DTW to the multi-dimensional case requires an adaptive approach , 2016, Data Mining and Knowledge Discovery.

[34]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[35]  Nicola Barbieri,et al.  Evolution of Ego-networks in Social Media with Link Recommendations , 2017, WSDM.

[36]  Hüseyin Uzunalioglu,et al.  Prediction of subscriber churn using social network analysis , 2013, Bell Labs Technical Journal.

[37]  Yong Duan,et al.  Early classification on multivariate time series , 2015, Neurocomputing.