Customer attrition in retailing: An application of Multivariate Adaptive Regression Splines

The profit resulting from customer relationship is essential to ensure companies viability, so an improvement in customer retention is crucial for competitiveness. As such, companies have recognized the importance of customer centered strategies and consequently customer relationship management (CRM) is often at the core of their strategic plans. In this context, a priori knowledge about the risk of a given customer to mitigate or even end the relationship with the provider is valuable information that allows companies to take preventive measures to avoid defection. This paper proposes a model to predict partial defection, using two classification techniques: Logistic regression and Multivariate Adaptive Regression Splines (MARS). The main objective is to compare the performance of MARS with Logistic regression in modeling customer attrition. This paper considers the general form of Logistic regression and Logistic regression combined with a wrapper feature selection approach, such as stepwise approach. The empirical results showed that MARS performs better than Logistic regression when variable selection procedures are not used. However, MARS loses its superiority when Logistic regression is conducted with stepwise feature selection.

[1]  Ian Witten,et al.  Data Mining , 2000 .

[2]  Manish K. Desai Customer Churn - A Reason to Churn , 2006 .

[3]  Nissan Levin,et al.  Segmentation analysis with managerial judgment , 1996 .

[4]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[5]  Dirk Van den Poel,et al.  Separating financial from commercial customer churn: A modeling step towards resolving the conflict between the sales and credit department , 2008, Expert Syst. Appl..

[6]  Kathy Hammond,et al.  Fact and Fallacy in Retention Marketing , 2006 .

[7]  Jayanthi Ranjan,et al.  Data mining tools: a CRM perspective , 2008 .

[8]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[9]  Oguzhan Alagoz,et al.  Informatics in radiology: comparison of logistic regression and artificial neural network models in breast cancer risk estimation. , 2010, Radiographics : a review publication of the Radiological Society of North America, Inc.

[10]  Li Xiu,et al.  Application of data mining techniques in customer relationship management: A literature review and classification , 2009, Expert Syst. Appl..

[11]  Chih-Ping Wei,et al.  Turning telecommunications call details to churn prediction: a data mining approach , 2002, Expert Syst. Appl..

[12]  Ali Al-Shahib,et al.  Feature Selection and the Class Imbalance Problem in Predicting Protein Function from Sequence , 2005, Applied bioinformatics.

[13]  Couchen Wu,et al.  Counting your customers: Compounding customer's in-store decisions, interpurchase time and repurchasing behavior , 2000, Eur. J. Oper. Res..

[14]  F. Dwyer Customer lifetime valuation to support marketing decision making , 1997 .

[15]  Shiwei Tang,et al.  A Mixed Process Neural Network and its Application to Churn Prediction in Mobile Communications , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[16]  J N Weinstein,et al.  Quantitative structure-antitumor activity relationships of camptothecin analogues: cluster analysis and genetic algorithm-based studies. , 2001, Journal of medicinal chemistry.

[17]  Kent Eriksson,et al.  Customer Retention, Purchasing Behavior and Relationship Substance in Professional Services , 2000 .

[18]  Xiuqin Deng,et al.  Customer Churn Prediction of China Telecom Based on Cluster Analysis and Decision Tree Algorithm , 2012, AICI 2012.

[19]  Dirk Seifert,et al.  Collaborative Customer Relationship Management: Taking CRM to the Next Level , 2004 .

[20]  Don Peppers,et al.  CRM at the Speed of Light: Capturing and Keeping Customers in Internet Real Time , 2001 .

[21]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[22]  Lyle H. Ungar,et al.  A comparison of two nonparametric estimation schemes: MARS and neural networks , 1993 .

[23]  Alan Dick,et al.  Customer loyalty: Toward an integrated conceptual framework , 1994 .

[24]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[25]  Lin Ma,et al.  Mining the customer credit using hybrid support vector machine technique , 2009, Expert Syst. Appl..

[26]  Nikolaos Tzokas,et al.  Some dangerous axioms of relationship marketing , 1998 .

[27]  J. Sheth,et al.  Customer Relationship Management: Emerging Practice, Process, and Discipline , 2002 .

[28]  Dirk Van den Poel,et al.  Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting , 2005, Eur. J. Oper. Res..

[29]  Adrian Payne,et al.  Relationship marketing for competitive advantage : winning and keeping customers , 1998 .

[30]  L. Fahrmeir,et al.  Correction: Consistency and Asymptotic Normality of the Maximum Likelihood Estimator in Generalized Linear Models , 1985 .

[31]  Michael J. A. Berry,et al.  Mastering Data Mining: The Art and Science of Customer Relationship Management , 1999 .

[32]  Gerhard-Wilhelm Weber,et al.  A new approach to multivariate adaptive regression splines by using Tikhonov regularization and continuous optimization , 2010 .

[33]  M. Shashi,et al.  Applicability of Data Mining Techniques for Churn Prediction , 2012 .

[34]  Ashutosh Tiwari,et al.  Computer assisted customer churn management: State-of-the-art and future trends , 2007, Comput. Oper. Res..

[35]  Xin Yao,et al.  A novel evolutionary data mining algorithm with applications to churn prediction , 2003, IEEE Trans. Evol. Comput..

[36]  Jun Guo,et al.  An extended support vector machine forecasting framework for customer churn in e-commerce , 2011, Expert Syst. Appl..

[37]  J. Friedman Multivariate adaptive regression splines , 1990 .

[38]  Dirk Van den Poel,et al.  Predicting customer retention and profitability by using random forests and regression forests techniques , 2005, Expert Syst. Appl..

[39]  Magnus Pettersson,et al.  SPC with Applications to Churn Management , 2004 .

[40]  R. Srivastava,et al.  Applying Latent Trait Analysis in the Evaluation of Prospects For Cross-Selling of Financial Services , 1991 .

[41]  Michael J. A. Berry,et al.  Data mining techniques - for marketing, sales, and customer support , 1997, Wiley computer publishing.

[42]  F. F. Reichheld,et al.  Zero defections: quality comes to services. , 1990, Harvard business review.

[43]  Haslina Mohd,et al.  A unified definition of CRM towards the successful adoption and implementation , 2010 .

[44]  Eric Johnson,et al.  Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry , 2000, IEEE Trans. Neural Networks Learn. Syst..

[45]  Ronald Swift,et al.  Accelerating Customer Relationships: Using Crm and Relationship Technologies , 2000 .

[46]  Dirk Van den Poel,et al.  Customer attrition analysis for financial services using proportional hazard models , 2004, Eur. J. Oper. Res..

[47]  Bart Baesens,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Bayesian Network Classifiers for Identifying the Slope of the Customer Lifecycle of Long-life Customers Bayesian Network Classifiers for Identifying the Slope of the Customer Lifecycle of Long-life Customers , 2022 .

[48]  Tian-Shyug Lee,et al.  Mining the customer credit using classification and regression tree and multivariate adaptive regression splines , 2006, Comput. Stat. Data Anal..

[49]  Chih-Fong Tsai,et al.  Customer churn prediction by hybrid neural networks , 2009, Expert Syst. Appl..

[50]  Mevlut Ture,et al.  Comparing classification techniques for predicting essential hypertension , 2005, Expert Syst. Appl..

[51]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[52]  Soung Hie Kim,et al.  A Personalized Defection Detection and Prevention Procedure based on the Self-Organizing Map and Association Rule Mining: Applied to Online Game Site , 2004, Artificial Intelligence Review.

[53]  Erniel B. Barrios,et al.  Profiling Poverty with Multivariate Adaptive Regression Splines , 2009 .

[54]  Michel Ballings,et al.  Customer event history for churn prediction: How long is long enough? , 2012, Expert Syst. Appl..

[55]  João Falcão e Cunha,et al.  Modeling partial customer churn: On the value of first product-category purchase sequences , 2012, Expert systems with applications.

[56]  Charles X. Ling,et al.  Using AUC and accuracy in evaluating learning algorithms , 2005, IEEE Transactions on Knowledge and Data Engineering.

[57]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[58]  Robert A. Peterson,et al.  Relationship marketing and the consumer , 1995 .

[59]  Dirk Van den Poel,et al.  Faculteit Economie En Bedrijfskunde Hoveniersberg 24 B-9000 Gent Predicting Mail-order Repeat Buying: Which Variables Matter? Predicting Mail-order Repeat Buying: Which Variables Matter? , 2022 .

[60]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[61]  Asifullah Khan,et al.  Churn prediction in telecom using Random Forest and PSO based data balancing in combination with various feature selection strategies , 2012, Comput. Electr. Eng..

[62]  Minghe Sun,et al.  A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data , 2012, Eur. J. Oper. Res..

[63]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[64]  Bart Baesens,et al.  New insights into churn prediction in the telecommunication sector: A profit driven data mining approach , 2012, Eur. J. Oper. Res..

[65]  D. Quinn Mills,et al.  Customer Management as the Origin of Collaborative Customer Relationship Management , 2004 .

[66]  João Falcão e Cunha,et al.  Predicting partial customer churn using Markov for discrimination for modeling first purchase sequences , 2012, Adv. Data Anal. Classif..

[67]  M. Tahar Kechadi,et al.  Customer churn prediction in telecommunications , 2012, Expert Syst. Appl..

[68]  Michael J. A. Berry,et al.  Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management , 2004 .

[69]  Tammo H. A. Bijmolt,et al.  Staying Power of Churn Prediction Models , 2010 .

[70]  Gwo-Hshiung Tzeng,et al.  Combined rough set theory and flow network graph to predict customer churn in credit card accounts , 2011, Expert Syst. Appl..

[71]  Dominique Haughton,et al.  Application of multiple adaptive regression splines (MARS) in direct response modeling , 2002 .

[72]  Yong Shi,et al.  Credit card churn forecasting by logistic regression and decision tree , 2011, Expert Syst. Appl..