Churn Prediction System for Telecom using Filter-Wrapper and Ensemble Classification

Churn prediction in telecom is a challenging data mining task for retaining customers, especially, when we have imbalanced class distribution, high dimensionality and large number of samples in training set. To cope with this challenging task of churn prediction, we propose a new intelligent churn prediction system for telecom, named FW-ECP. The novelty of the FW-ECP lies in its ability to combine both filterand wrapper-based feature selection as well as exploit the learning capability of an ensemble classifier built using diverse base classifiers. In the filter phase, Particle Swarm Optimization-based undersampling and mRMR feature selection are employed to reduce the effect of imbalanced class distribution and large dimensionality. In Wrapper phase, we employ Genetic Algorithm that further discards irrelevant and redundant features. Random Forest, Rotation Forest, RotBoost and SVMs are then employed to exploit the new feature space. Finally, the ensemble classifier is constructed using both majority voting and stacking. We have tested and compared the performance of proposed FW-ECP system on two publicly available standard telecom datasets: Orange and Cell2Cell. FW-ECP takes into account both the imbalanced nature and large dimensionality of the training sets and yields better prediction performances compared with existing state-of-the-art approaches. The feature spaces for the Orange and Cell2Cell datasets are reduced to 24D and 18D, from 260D and 76D, respectively. The AUCs obtained by FW-ECP are 0.85 and 0.82 for Orange and Cell2Cell datasets, respectively.

[1]  Sougata Mukherjea,et al.  Analyzing the Structure and Evolution of Massive Telecom Graphs , 2008, IEEE Transactions on Knowledge and Data Engineering.

[2]  Chun-Xia Zhang,et al.  RotBoost: A technique for combining Rotation Forest and AdaBoost , 2008, Pattern Recognit. Lett..

[3]  Asifullah Khan,et al.  Churn prediction in telecom using Random Forest and PSO based data balancing in combination with various feature selection strategies , 2012, Comput. Electr. Eng..

[4]  Guangquan Zhang,et al.  A Customer Churn Prediction Model in Telecom Industry Using Boosting , 2014, IEEE Transactions on Industrial Informatics.

[5]  張 毓騰,et al.  APPLYING DATA MINING TO TELECOM CHURN MANAGEMENT , 2009 .

[6]  Juan José Rodríguez Diez,et al.  An Experimental Study on Rotation Forest Ensembles , 2007, MCS.

[7]  Chi-Hyuck Jun,et al.  Improved churn prediction in telecommunication industry by analyzing a large network , 2014, Expert Syst. Appl..

[8]  A. Athanassopoulos Customer Satisfaction Cues To Support Market Segmentation and Explain Switching Behavior , 2000 .

[9]  Dan Steinberg,et al.  Stochastic Gradient Boosting: An Introduction to TreeNet™ , 2002, AusDM.

[10]  Eric Johnson,et al.  Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry , 2000, IEEE Trans. Neural Networks Learn. Syst..

[11]  Asifullah Khan,et al.  Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification , 2013, Applied Intelligence.

[12]  M. Tahar Kechadi,et al.  A new feature set with new window techniques for customer churn prediction in land-line telecommunications , 2010, Expert Syst. Appl..

[13]  Chun-Xia Zhang,et al.  A variant of Rotation Forest for constructing ensemble classifiers , 2010, Pattern Analysis and Applications.

[14]  Ching-Wei Wang,et al.  Boosting-SVM: effective learning with reduced data dimension , 2013, Applied Intelligence.

[15]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[16]  M. Tahar Kechadi,et al.  Customer churn prediction in telecommunications , 2012, Expert Syst. Appl..

[17]  Daria Sorokina,et al.  Application of Additive Groves Ensemble with Multiple Counts Feature Evaluation to KDD Cup'09 Small Data Set , 2009, KDD Cup.

[18]  Marcin Owczarczuk,et al.  Churn models for prepaid customers in the cellular telecommunication industry using large data marts , 2010, Expert Syst. Appl..

[19]  Andrej Kosir,et al.  A diffusion model for churn prediction based on sociometric theory , 2015, Adv. Data Anal. Classif..

[20]  Dirk Van den Poel,et al.  Handling class imbalance in customer churn prediction , 2009, Expert Syst. Appl..

[21]  Bin Li,et al.  CHAMP: A Prototype for Automated Cellular Churn Prediction , 1999, Data Mining and Knowledge Discovery.

[22]  Abbas Keramati,et al.  Improved churn prediction in telecommunication industry using data mining techniques , 2014, Appl. Soft Comput..

[23]  Koen W. De Bock,et al.  An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction , 2011, Expert Syst. Appl..

[24]  Brian J. d'Auriol,et al.  A novel feature selection method based on normalized mutual information , 2011, Applied Intelligence.

[25]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[26]  Balázs Kégl,et al.  Accelerating AdaBoost using UCB , 2009, KDD Cup.

[27]  Thanaruk Theeramunkong,et al.  Multi-Label Classification Using Dependent and Independent Dual Space Reduction , 2013, Comput. J..

[28]  Saharon Rosset,et al.  Customer Lifetime Value Models for Decision Support , 2003, Data Mining and Knowledge Discovery.

[29]  Thomas G. Dietterich An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization , 2000, Machine Learning.

[30]  Hugh Miller,et al.  Predicting customer behaviour: The University of Melbourne's KDD Cup report , 2009, KDD Cup.

[31]  Bernard Zenko,et al.  Is Combining Classifiers with Stacking Better than Selecting the Best One? , 2004, Machine Learning.

[32]  YongSeog Kim,et al.  Toward a successful CRM: variable selection, sampling, and ensemble , 2006, Decis. Support Syst..

[33]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[34]  Juan José Rodríguez Diez,et al.  Rotation Forest: A New Classifier Ensemble Method , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Torsten J. Gerpott,et al.  Regaining drifting mobile communication customers: Predicting the odds of success of winback efforts with competing risks regression , 2015, Expert Syst. Appl..

[36]  Sotiris B. Kotsiantis Cascade Generalization with Reweighting Data for Handling Imbalanced Problems , 2011, Comput. J..

[37]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[38]  Bart Baesens,et al.  Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..

[39]  Antanas Verikas,et al.  Mining data with random forests: A survey and results of new tests , 2011, Pattern Recognit..

[40]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[41]  Mohand Tahar Kechadi,et al.  A new filter feature selection approach for customer churn prediction in telecommunications , 2010, 2010 IEEE International Conference on Industrial Engineering and Engineering Management.

[42]  Eric W. T. Ngai,et al.  Customer churn prediction using improved balanced random forests , 2009, Expert Syst. Appl..

[43]  Sinisa Todorovic,et al.  Local-Learning-Based Feature Selection for High-Dimensional Data Analysis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.