Offering a hybrid approach of data mining to predict the customer churn based on bagging and boosting methods

Purpose – Churn management is a fundamental process in firms to keep their customers. Therefore, predicting the customer’s churn is essential to facilitate such processes. The literature has introduced data mining approaches for this purpose. On the other hand, results indicate that performance of classification models increases by combining two or more techniques. The purpose of this paper is to propose a combined model based on clustering and ensemble classifiers. Design/methodology/approach – Based on churn data set in Cell2Cell, single baseline classifiers, ensemble classifiers are used for comparisons. Specifically, self-organizing map (SOM) clustering technique, and four other classifier techniques including decision tree, artificial neural networks, support vector machine, and K-nearest neighbors were used. Moreover, for reduced dimensions of the features, principal component analysis (PCA) method was employed. Findings – As results 14 models are compared with each other regarding accuracy, sensiti...

[1]  Yen-Liang Chen,et al.  Constructing a multi-valued and multi-labeled decision tree , 2003, Expert Syst. Appl..

[2]  Gwo-Hshiung Tzeng,et al.  Combined rough set theory and flow network graph to predict customer churn in credit card accounts , 2011, Expert Syst. Appl..

[3]  Nan-Chen Hsieh,et al.  Hybrid mining approach in the design of credit scoring models , 2005, Expert Syst. Appl..

[4]  Asifullah Khan,et al.  Intelligent churn prediction in telecom: employing mRMR feature selection and RotBoost based ensemble classification , 2013, Applied Intelligence.

[5]  Kristof Coussement,et al.  Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning , 2013 .

[6]  Asifullah Khan,et al.  Customer churn prediction for telecommunication: Employing various various features selection techniques and tree based ensemble classifiers , 2012, 2012 15th International Multitopic Conference (INMIC).

[7]  Stanley F. Slater,et al.  Intelligence generation and superior customer value , 2000 .

[8]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[9]  Yong Shi,et al.  Finding the Hidden Pattern of Credit Card Holder's Churn: A Case of China , 2009, ICCS.

[10]  David C. Yen,et al.  Predicting stock returns by classifier ensembles , 2011, Appl. Soft Comput..

[11]  Chih-Fong Tsai,et al.  Modeling credit scoring using neural network ensembles , 2014, Kybernetes.

[12]  Asifullah Khan,et al.  Genetic Programming and Adaboosting based churn prediction for Telecom , 2012, 2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC).

[13]  Reza Tavakkoli-Moghaddam,et al.  Customer churn prediction using a hybrid method and censored data , 2013 .

[14]  Yong Shi,et al.  Credit card churn forecasting by logistic regression and decision tree , 2011, Expert Syst. Appl..

[15]  Fan Li,et al.  Model Selection Strategy for Customer Attrition Risk Prediction in Retail Banking , 2011, AusDM.

[16]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[17]  Chih-Fong Tsai,et al.  Credit rating by hybrid machine learning techniques , 2010, Appl. Soft Comput..

[18]  Chih-Fong Tsai,et al.  Customer churn prediction by hybrid neural networks , 2009, Expert Syst. Appl..

[19]  Reza Tavakkoli-Moghaddam,et al.  Hierarchical Neural Regression Models for Customer Churn Prediction , 2013 .

[20]  David West,et al.  Neural network ensemble strategies for financial decision applications , 2005, Comput. Oper. Res..

[21]  Francisco Herrera,et al.  A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  Vadlamani Ravi,et al.  Data Mining Using Rules Extracted from SVM: An Application to Churn Prediction in Bank Credit Cards , 2009, RSFDGrC.

[23]  Abbas Keramati,et al.  Improved churn prediction in telecommunication industry using data mining techniques , 2014, Appl. Soft Comput..

[24]  Wooju Kim,et al.  Combination of multiple classifiers for the customer's purchase behavior prediction , 2003, Decis. Support Syst..

[25]  Chih-Fong Tsai,et al.  Combining cluster analysis with classifier ensembles to predict financial distress , 2014, Inf. Fusion.

[26]  David G. Stork,et al.  Pattern Classification , 1973 .

[27]  Chih-Fong Tsai,et al.  Dimensionality and data reduction in telecom churn prediction , 2014, Kybernetes.

[28]  Christophe Croux,et al.  Bagging and Boosting Classification Trees to Predict Churn , 2006 .