Comparison of supervised machine learning techniques for customer churn prediction based on analysis of customer behavior

Purpose This paper aims to provide a predictive framework of customer churn through six stages for accurate prediction and preventing customer churn in the field of business. Design/methodology/approach The six stages are as follows: first, collection of customer behavioral data and preparation of the data; second, the formation of derived variables and selection of influential variables, using a method of discriminant analysis; third, selection of training and testing data and reviewing their proportion; fourth, the development of prediction models using simple, bagging and boosting versions of supervised machine learning; fifth, comparison of churn prediction models based on different versions of machine-learning methods and selected variables; and sixth, providing appropriate strategies based on the proposed model. Findings According to the results, five variables, the number of items, reception of returned items, the discount, the distribution time and the prize beside the recency, frequency and monetary (RFM) variables (RFMITSDP), were chosen as the best predictor variables. The proposed model with accuracy of 97.92 per cent, in comparison to RFM, had much better performance in churn prediction and among the supervised machine learning methods, artificial neural network (ANN) had the highest accuracy, and decision trees (DT) was the least accurate one. The results show the substantially superiority of boosting versions in prediction compared with simple and bagging models. Research limitations/implications The period of the available data was limited to two years. The research data were limited to only one grocery store whereby it may not be applicable to other industries; therefore, generalizing the results to other business centers should be used with caution. Practical implications Business owners must try to enforce a clear rule to provide a prize for a certain number of purchased items. Of course, the prize can be something other than the purchased item. Business owners must accept the items returned by the customers for any reasons, and the conditions for accepting returned items and the deadline for accepting the returned items must be clearly communicated to the customers. Store owners must consider a discount for a certain amount of purchase from the store. They have to use an exponential rule to increase the discount when the amount of purchase is increased to encourage customers for more purchase. The managers of large stores must try to quickly deliver the ordered items, and they should use equipped and new transporting vehicles and skilled and friendly workforce for delivering the items. It is recommended that the types of services, the rules for prizes, the discount, the rules for accepting the returned items and the method of distributing the items must be prepared and shown in the store for all the customers to see. The special services and reward rules of the store must be communicated to the customers using new media such as social networks. To predict the customer behaviors based on the data, the future researchers should use the boosting method because it increases efficiency and accuracy of prediction. It is recommended that for predicting the customer behaviors, particularly their churning status, the ANN method be used. To extract and select the important and effective variables influencing customer behaviors, the discriminant analysis method can be used which is a very accurate and powerful method for predicting the classes of the customers. Originality/value The current study tries to fill this gap by considering five basic and important variables besides RFM in stores, i.e. prize, discount, accepting returns, delay in distribution and the number of items, so that the business owners can understand the role services such as prizes, discount, distribution and accepting returns play in retraining the customers and preventing them from churning. Another innovation of the current study is the comparison of machine-learning methods with their boosting and bagging versions, especially considering the fact that previous studies do not consider the bagging method. The other reason for the study is the conflicting results regarding the superiority of machine-learning methods in a more accurate prediction of customer behaviors, including churning. For example, some studies introduce ANN (Huang et al., 2010; Hung and Wang, 2004; Keramati et al., 2014; Runge et al., 2014), some introduce support vector machine ( Guo-en and Wei-dong, 2008; Vafeiadis et al., 2015; Yu et al., 2011) and some introduce DT (Freund and Schapire, 1996; Qureshi et al., 2013; Umayaparvathi and Iyakutti, 2012) as the best predictor, confusing the users of the results of these studies regarding the best prediction method. The current study identifies the best prediction method specifically in the field of store businesses for researchers and the owners. Moreover, another innovation of the current study is using discriminant analysis for selecting and filtering variables which are important and effective in predicting churners and non-churners, which is not used in previous studies. Therefore, the current study is unique considering the used variables, the method of comparing their accuracy and the method of selecting effective variables.

[1]  Ali Tamaddoni Jahromi,et al.  Managing B2B customer churn, retention and profitability , 2014 .

[2]  Peng Gao,et al.  Churn prediction for high-value players in casual social games , 2014, 2014 IEEE Conference on Computational Intelligence and Games.

[3]  Nikola Simidjievski,et al.  Predicting long-term population dynamics with bagging and boosting of process-based models , 2015, Expert Syst. Appl..

[4]  Kristof Coussement,et al.  Improving customer attrition prediction by integrating emotions from client/company interaction emails and evaluating multiple classifiers , 2009, Expert Syst. Appl..

[5]  K. Iyakutti,et al.  Applications of Data Mining Techniques in Telecom Churn Prediction , 2012 .

[6]  Cheng-Jung Lin,et al.  Goal-oriented sequential pattern for network banking churn analysis , 2003, Expert Syst. Appl..

[7]  Guangquan Zhang,et al.  A Customer Churn Prediction Model in Telecom Industry Using Boosting , 2014, IEEE Transactions on Industrial Informatics.

[8]  Alan Dick,et al.  Customer loyalty: Toward an integrated conceptual framework , 1994 .

[9]  Xin-an Lai Segmentation Study on Enterprise Customers Based on Data Mining Technology , 2009, 2009 First International Workshop on Database Technology and Applications.

[10]  S. Tsay,et al.  INTEGRATING OF SOM AND K-MEAN IN DATA MINING CLUSTERING: AN EMPIRICAL STUDY OF CRM AND PROFITABILITY EVALUATION , 2004 .

[11]  Pennie Frow,et al.  The role of multichannel integration in customer relationship management , 2004 .

[12]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[13]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[14]  Amir Khanlari,et al.  CUSTOMER LIFETIME VALUE (CLV) MEASUREMENT BASED ON RFM MODEL , 2007 .

[15]  Koen W. De Bock,et al.  An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction , 2011, Expert Syst. Appl..

[16]  Dirk Van den Poel,et al.  Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting , 2005, Eur. J. Oper. Res..

[17]  Harsh Vardhan Samalia,et al.  A Business Intelligence Perspective for Churn Management , 2014 .

[18]  Abbas Keramati,et al.  Improved churn prediction in telecommunication industry using data mining techniques , 2014, Appl. Soft Comput..

[19]  Ashutosh Tiwari,et al.  Computer assisted customer churn management: State-of-the-art and future trends , 2007, Comput. Oper. Res..

[20]  Jun Guo,et al.  An extended support vector machine forecasting framework for customer churn in e-commerce , 2011, Expert Syst. Appl..

[21]  Bart Baesens,et al.  Modeling churn using customer lifetime value , 2009, Eur. J. Oper. Res..

[22]  Judith W. Kincaid,et al.  Customer Relationship Management: Getting It Right! , 2002 .

[23]  Stephen F. King Citizens as customers: Exploring the future of CRM in UK local government , 2007, Gov. Inf. Q..

[24]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[25]  J. Sheth,et al.  Customer Relationship Management: Emerging Practice, Process, and Discipline , 2002 .

[26]  Chih-Fong Tsai,et al.  Variable selection by association rules for customer churn prediction of multimedia on demand , 2010, Expert Syst. Appl..

[27]  Philip H. Williams,et al.  Plant MicroRNA Prediction by Supervised Machine Learning Using C5.0 Decision Trees , 2012, Journal of nucleic acids.

[28]  Sami Madani,et al.  Mining changes in customer purchasing behavior : a data mining approach , 2009 .

[29]  Meltem Caber,et al.  Using data mining techniques for profiling profitable hotel customers: An application of RFM analysis , 2016 .

[30]  J. Bowen,et al.  Loyalty: A Strategic Commitment , 1998 .

[31]  David C. Yen,et al.  Data mining techniques for customer relationship management , 2002 .

[32]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[33]  Der-Chiang Li,et al.  A two-stage clustering method to analyze customer characteristics to build discriminative customer management: A case of textile manufacturing business , 2011, Expert Syst. Appl..

[34]  Hee-Su Kim,et al.  Determinants of subscriber churn and customer loyalty in the Korean mobile telephony market , 2004 .

[35]  Dirk Van den Poel,et al.  Customer attrition analysis for financial services using proportional hazard models , 2004, Eur. J. Oper. Res..

[36]  Kristof Coussement,et al.  Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning , 2013 .

[37]  J. Miglautsch Thoughts on RFM scoring , 2000 .

[38]  Gülhayat Gölbaşı Şimşek,et al.  The Antecedents of Customer Loyalty , 2014 .

[39]  John A. McCarty,et al.  SEGMENTATION APPROACHES IN DATA MINING: A COMPARISON OF RFM, CHAID, AND LOGISTIC REGRESSION , 2007 .

[40]  Yinghong Li,et al.  Predicting customer purchase behavior in the e-commerce context , 2015, Electron. Commer. Res..

[41]  Abdul Kadir Othman,et al.  The Relationship between Loyalty Program, Customer Satisfaction and Customer Loyalty in Retail Industry: A Case Study , 2014 .

[42]  M. Tahar Kechadi,et al.  Customer churn prediction in telecommunications , 2012, Expert Syst. Appl..

[43]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[44]  Sven F. Crone,et al.  The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing , 2006, Eur. J. Oper. Res..

[45]  M. Tahar Kechadi,et al.  A new feature set with new window techniques for customer churn prediction in land-line telecommunications , 2010, Expert Syst. Appl..

[46]  Guo-en Xia,et al.  Model of Customer Churn Prediction on Support Vector Machine , 2008 .

[47]  Repeat patronage: Cultivating alliances with customers , 1988 .

[48]  U. Devi Prasad,et al.  Prediction of Churn Behaviour of Bank Customers Using Data Mining Tools , 2012 .

[49]  Hsin-Hung Wu,et al.  A review of the application of RFM model , 2010 .

[50]  Ali Mustafa Qamar,et al.  Telecommunication subscribers' churn prediction model using machine learning , 2013, Eighth International Conference on Digital Information Management (ICDIM 2013).

[51]  Slobodan Ivanović,et al.  CRM DEVELOPMENT IN HOSPITALITY COMPANIES FOR THE PURPOSE OF INCREASING THE COMPETITIVENESS IN THE TOURIST MARKET , 2011 .

[52]  Chu-Chai Henry Chan,et al.  Intelligent value-based customer segmentation method for campaign management: A case study of automobile retailer , 2008, Expert Syst. Appl..

[53]  Özden Gür Ali,et al.  Dynamic churn prediction framework with more effective use of rare event data: The case of private banking , 2014, Expert Syst. Appl..

[54]  Michael Braun,et al.  Modeling Customer Lifetimes with Multiple Causes of Churn , 2011, Mark. Sci..

[55]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[56]  Chih-Hsuan Wang,et al.  Apply robust segmentation to the service industry using kernel induced fuzzy clustering techniques , 2010, Expert Syst. Appl..

[57]  I-Cheng Yeh,et al.  Knowledge discovery on RFM model using Bernoulli sequence , 2009, Expert Syst. Appl..

[58]  Divya Tomar,et al.  A comparison on multi-class classification methods based on least squares twin support vector machine , 2015, Knowl. Based Syst..

[59]  Seyed Mohammad Seyedhosseini,et al.  Cluster analysis using data mining approach to develop CRM methodology to assess the customer loyalty , 2010, Expert Syst. Appl..

[60]  Jia Hong-bo,et al.  Type 2 diabetes data processing with EM and C4.5 algorithm , 2007, 2007 IEEE/ICME International Conference on Complex Medical Engineering.

[61]  C. Marcus A practical yet meaningful approach to customer segmentation , 1998 .

[62]  Hsuan-Kai Chen,et al.  Customer relationship management in the hairdressing industry: An application of data mining techniques , 2013, Expert Syst. Appl..

[63]  F. F. Reichheld,et al.  Zero defections: quality comes to services. , 1990, Harvard business review.

[64]  Konstantinos I. Diamantaras,et al.  A comparison of machine learning techniques for customer churn prediction , 2015, Simul. Model. Pract. Theory.

[65]  Ssu-Han Chen,et al.  The gamma CUSUM chart method for online customer churn prediction , 2016, Electron. Commer. Res. Appl..

[66]  D. Christodoulakis,et al.  Customer clustering using RFM analysis , 2005 .

[67]  Chi-Hyuck Jun,et al.  Improved churn prediction in telecommunication industry by analyzing a large network , 2014, Expert Syst. Appl..