Why you should stop predicting customer churn and start using uplift models

Abstract Uplift modeling has received increasing interest in both the business analytics research community and the industry as an improved paradigm for predictive analytics for data-driven operational decision-making. The literature, however, does not provide conclusive empirical evidence that uplift modeling outperforms predictive modeling. Case studies that directly compare both approaches are lacking, and the performance of predictive models and uplift models as reported in various experimental studies cannot be compared indirectly since different evaluation measures are used to assess their performance. Therefore, in this paper, we introduce a novel evaluation metric called the maximum profit uplift (MPU) measure that allows assessing the performance in terms of the maximum potential profit that can be achieved by adopting an uplift model. This measure, developed for evaluating customer churn uplift models, extends the maximum profit measure for evaluating customer churn prediction models. While introducing the MPU measure, we describe the generally applicable liftup curve and liftup measure for evaluating uplift models as counterparts of the lift curve and lift measure that are broadly used to evaluate predictive models. These measures are subsequently applied to assess and compare the performance of customer churn prediction and uplift models in a case study that applies uplift modeling to customer retention in the financial industry. We observe that uplift models outperform predictive models and lead to improved profitability of retention campaigns.

[1]  Patrick D. Surry,et al.  Real-World Uplift Modelling with Significance-Based Uplift Trees , 2012 .

[2]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[3]  Kathleen Kane,et al.  Mining for the truly responsive customers and prospects using true-lift modeling: Comparison of new and existing methods , 2014 .

[4]  L. Lai INFLUENTIAL MARKETING: A NEW DIRECT MARKETING STRATEGY ADDRESSING THE EXISTENCE OF VOLUNTARY BUYERS , 2006 .

[5]  David Maxwell Chickering,et al.  A Decision Theoretic Approach to Targeted Advertising , 2000, UAI.

[6]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[7]  Nicholas Radcliffe,et al.  Using control groups to target on predicted lift: Building and assessing uplift model , 2007 .

[8]  Bart Baesens,et al.  New Insights into Churn Prediction in the Telco Sector , 2012 .

[9]  P. Holland Statistics and Causal Inference , 1985 .

[10]  Kristof Coussement,et al.  Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning , 2013 .

[11]  CoussementKristof,et al.  A comparative analysis of data preparation algorithms for customer churn prediction , 2017 .

[12]  David C. Yen,et al.  Applying data mining to telecom churn management , 2006, Expert Syst. Appl..

[13]  S. Jaroszewicz,et al.  Uplift modeling for clinical trial data , 2012 .

[14]  Minghe Sun,et al.  A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data , 2012, Eur. J. Oper. Res..

[15]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[16]  Szymon Jaroszewicz,et al.  Decision trees for uplift modeling with single and multiple treatments , 2011, Knowledge and Information Systems.

[17]  Dirk Van den Poel,et al.  Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting , 2005, Eur. J. Oper. Res..

[18]  Talel Abdessalem,et al.  Pessimistic Uplift Modeling , 2016, KDD 2016.

[19]  Stefan Lessmann,et al.  A comparative analysis of data preparation algorithms for customer churn prediction: A case study in the telecommunication industry , 2017, Decis. Support Syst..

[20]  Bart Baesens,et al.  Building comprehensible customer churn prediction models with advanced rule induction techniques , 2011, Expert Syst. Appl..

[21]  D. Sculley,et al.  Hidden Technical Debt in Machine Learning Systems , 2015, NIPS.

[22]  Bart Baesens,et al.  Predicting time-to-churn of prepaid mobile telephone customers using social network analysis , 2016, J. Oper. Res. Soc..

[23]  Leo Guelman,et al.  Random Forests for Uplift Modeling: An Insurance Customer Retention Case , 2012, MS.

[24]  Bart Baesens,et al.  Analytics in a Big Data World: The Essential Guide to Data Science and its Applications , 2014 .

[25]  Bart Baesens,et al.  Domain knowledge integration in data mining using decision tables: case studies in churn prediction , 2009, J. Oper. Res. Soc..

[26]  Bart Baesens,et al.  Data Mining Techniques for Software Effort Estimation: A Comparative Study , 2012, IEEE Transactions on Software Engineering.

[27]  Bj Hansotia,et al.  Direct marketing for multichannel retailers: Issues, challenges and solutions , 2002 .

[28]  Eva Ascarza Retention Futility: Targeting High-Risk Customers Might be Ineffective , 2018 .

[29]  Bin Li,et al.  Automated Cellular Modeling and Prediction on a Large Scale , 2000, Artificial Intelligence Review.

[30]  Jaishankar Ganesh,et al.  Understanding the Customer Base of Service Providers: An Examination of the Differences between Switchers and Stayers , 2000, Journal of Marketing.

[31]  Bart Baesens,et al.  Development and application of consumer credit scoring models using profit-based classification measures , 2014, Eur. J. Oper. Res..

[32]  R. Rust,et al.  Customer satisfaction, customer retention, and market share , 1993 .

[33]  Wagner A. Kamakura,et al.  Defection Detection: Measuring and Understanding the Predictive Accuracy of Customer Churn Models , 2006 .

[34]  Victor S. Y. Lo The true lift model: a novel data mining approach to response modeling in database marketing , 2002, SKDD.

[35]  P. Danaher,et al.  Implementing a customer relationship strategy: The asymmetric impact of poor versus excellent execution , 2000 .

[36]  Véronique Van Vlasselaer,et al.  Fraud Analytics : Using Descriptive, Predictive, and Social Network Techniques:A Guide to Data Science for Fraud Detection , 2015 .

[37]  J. Suykens,et al.  Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research , 2015, Eur. J. Oper. Res..

[38]  Bart Baesens,et al.  A Novel Profit Maximizing Metric for Measuring Classification Performance of Customer Churn Prediction Models , 2013, IEEE Transactions on Knowledge and Data Engineering.

[39]  Bart Baesens,et al.  Profit Driven Business Analytics: A Practitioner's Guide to Transforming Big Data into Added Value , 2017 .

[41]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[42]  Behram Hansotia,et al.  Incremental value modeling , 2002 .

[43]  Leo Guelman,et al.  Optimal personalized treatment rules for marketing interventions: A review of methods, a new proposal, and an insurance case study , 2014 .

[44]  Wouter Verbeke,et al.  A Literature Survey and Experimental Evaluation of the State-of-the-Art in Uplift Modeling: A Stepping Stone Toward the Development of Prescriptive Analytics , 2018, Big Data.

[45]  Dirk Van den Poel,et al.  Customer attrition analysis for financial services using proportional hazard models , 2004, Eur. J. Oper. Res..

[46]  Bart Baesens,et al.  Social network analysis for customer churn prediction , 2014, Appl. Soft Comput..