Telco Churn Prediction with Big Data

We show that telco big data can make churn prediction much more easier from the $3$V's perspectives: Volume, Variety, Velocity. Experimental results confirm that the prediction performance has been significantly improved by using a large volume of training data, a large variety of features from both business support systems (BSS) and operations support systems (OSS), and a high velocity of processing new coming data. We have deployed this churn prediction system in one of the biggest mobile operators in China. From millions of active customers, this system can provide a list of prepaid customers who are most likely to churn in the next month, having $0.96$ precision for the top $50000$ predicted churners in the list. Automatic matching retention campaigns with the targeted potential churners significantly boost their recharge rates, leading to a big business value.

[1]  Kristof Coussement,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-selection Techniques Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparin , 2022 .

[2]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[3]  Dirk Van den Poel,et al.  Handling class imbalance in customer churn prediction , 2009, Expert Syst. Appl..

[4]  João Bártolo Gomes,et al.  Predicting Near-Future Churners and Win-Backs in the Telecommunications Industry , 2012, ArXiv.

[5]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[6]  Li Xiu,et al.  The Application ofAdaBoost in Customer Churn Prediction , 2007, 2007 International Conference on Service Systems and Service Management.

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Foster J. Provost,et al.  Predictive Modeling With Big Data: Is Bigger Really Better? , 2013, Big Data.

[9]  Marcin Owczarczuk,et al.  Churn models for prepaid customers in the cellular telecommunication industry using large data marts , 2010, Expert Syst. Appl..

[10]  Jiming Liu,et al.  Learning Topic Models by Belief Propagation , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Peter van der Putten,et al.  The Impact of Experimental Setup in Prepaid Churn Prediction for Mobile Telecommunications: What to Predict, for Whom and Does the Customer Experience Matter? , 2010, Trans. Mach. Learn. Data Min..

[12]  Jia Zeng,et al.  A topic modeling toolbox using belief propagation , 2012, J. Mach. Learn. Res..

[13]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[14]  Yong Seog Kim,et al.  Uniformly subsampled ensemble (USE) for churn management: Theory and implementation , 2012, Expert Syst. Appl..

[15]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[16]  Pushpa An Efficient Method of Building the Telecom Social Network for Churn Prediction , 2012 .

[17]  Guangquan Zhang,et al.  A Customer Churn Prediction Model in Telecom Industry Using Boosting , 2014, IEEE Transactions on Industrial Informatics.

[18]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[19]  Isabelle Guyon,et al.  Design and analysis of the KDD cup 2009: fast scoring on a large orange customer database , 2009, SKDD.

[20]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[21]  Mehmet Sabih Aksoy,et al.  A Survey On Data Mining Techniques In Customer Churn Analysis For Telecom Industry , 2014 .

[22]  Christophe Croux,et al.  Bagging and Boosting Classification Trees to Predict Churn , 2006 .

[23]  Matthew Rowe,et al.  The effect of user features on churn in social networks , 2011, WebSci '11.

[24]  Chih-Ping Wei,et al.  Turning telecommunications call details to churn prediction: a data mining approach , 2002, Expert Syst. Appl..

[25]  Bin Li,et al.  Automated Cellular Modeling and Prediction on a Large Scale , 2000, Artificial Intelligence Review.

[26]  Steffen Rendle Scaling Factorization Machines to Relational Data , 2013, Proc. VLDB Endow..

[27]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[28]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[29]  Sougata Mukherjea,et al.  Social ties and their relevance to churn in mobile telecom networks , 2008, EDBT '08.

[30]  Ashutosh Tiwari,et al.  Computer assisted customer churn management: State-of-the-art and future trends , 2007, Comput. Oper. Res..

[31]  Xin Yao,et al.  A novel evolutionary data mining algorithm with applications to churn prediction , 2003, IEEE Trans. Evol. Comput..

[32]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[33]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[34]  Evangelos Xevelonakis Developing retention strategies based on customer profitability in telecommunications: An empirical study , 2005 .

[35]  Emilio Frazzoli,et al.  A review of urban computing for mobile phone traces: current methods, challenges and opportunities , 2013, UrbComp '13.

[36]  Shou-De Lin,et al.  Feature Engineering and Classifier Ensemble for KDD Cup 2010 , 2010, KDD 2010.

[37]  Zoubin Ghahramani,et al.  Learning from labeled and unlabeled data with label propagation , 2002 .

[38]  Bart Baesens,et al.  Domain knowledge integration in data mining using decision tables: case studies in churn prediction , 2009, J. Oper. Res. Soc..

[39]  Wagner A. Kamakura,et al.  Defection Detection: Measuring and Understanding the Predictive Accuracy of Customer Churn Models , 2006 .

[40]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..