Dynamic Programming Models for Maximizing Customer Lifetime Value: An Overview

Customer lifetime value (CLV) is the most reliable indicator in direct marketing for measuring the profitability of the customers. This motivated the researchers to compete in building models to maximize CLV and consequently, enhancing the firm, and the customer relationship. This review paper analyzes the contributions of applying dynamic programming models in the area of direct marketing, to maximize CLV. It starts by reviewing the basic models that focused on calculating CLV, measuring it, simulating, optimizing it or -rarely- maximizing its value. Then highlighting the dynamic programming models including, Markov Decision Process (MDP), Approximate Dynamic Programming (ADP), also called Reinforcement Learning (RL), Deep RL and Double Deep RL. Although, MDP contributed significantly in the area of maximizing CLV, it has many limitations that encouraged researchers to utilize ADP (i.e. RL) and recently deep reinforcement learning (i.e. deep Q network). These algorithms overcame the limitations of MDP and were able to solve complex problems without suffering from the curse of dimensionality problem, however they still have some limitations including, overestimating the action values. This was the main motivation behind proposing double deep Q networks (DDQN). Meanwhile, neither DDQN nor the algorithms that outperformed it and overcame its limitations were applied in the area of direct marketing and this leaves a space for future research directions.

[1]  Krzysztof Kluza,et al.  Customer Simulation for Direct Marketing Experiments , 2016, 2016 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[2]  Bo An,et al.  Impression Allocation for Combating Fraud in E-commerce Via Deep Reinforcement Learning with Action Norm Penalty , 2018, IJCAI.

[3]  Javier García,et al.  A comprehensive survey on safe reinforcement learning , 2015, J. Mach. Learn. Res..

[4]  David Silver,et al.  Concurrent Reinforcement Learning from Customer Interactions , 2013, ICML.

[5]  Jose M. Vidal-Sanz,et al.  The Impact of Forward-Looking Metrics on Employee Decision-Making: The Case of Customer Lifetime Value , 2016 .

[6]  Duncan Simester,et al.  Field Experiments in Marketing , 2017 .

[7]  Carl F. Mela,et al.  Choice Models and Customer Relationship Management , 2005 .

[8]  Modeling Strategic Opportunities in Product-Mix Strategy: A Customer- Versus Product-Oriented Perspective , 2014 .

[9]  Abdulkadir Hiziroglu,et al.  Investigating Two Customer Lifetime Value Models from Segmentation Perspective , 2012 .

[10]  Mohamed A. Nour,et al.  An integrative framework for customer relationship management: towards a systems view , 2012, Int. J. Bus. Inf. Syst..

[11]  Mohammad Jafar Tarokh,et al.  A New Model to Speculate CLV Based on Markov Chain Model , 2017 .

[12]  Marco Wiering,et al.  Markov Decision Processes: Concepts and Algorithms , 2012 .

[13]  P. Seetharaman,et al.  Assessing lifetime profitability of customers with purchasing cycles , 2018 .

[14]  Robert C. Blattberg,et al.  Can We Predict Customer Lifetime Value? Can We Predict Customer Lifetime Value , 2004 .

[15]  S. Verma Effectiveness of social network sites for influencing consumer purchase decisions , 2013 .

[16]  Ana Fernández del Río,et al.  Customer Lifetime Value in Video Games Using Deep Learning and Parametric Models , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[17]  Bram van Ginneken,et al.  A survey on deep learning in medical image analysis , 2017, Medical Image Anal..

[18]  Renato Paes Leme,et al.  Dynamic Auctions with Bank Accounts , 2016, IJCAI.

[19]  M. S. Kahreh,et al.  Analyzing the Applications of Customer Lifetime Value (CLV) based on Benefit Segmentation for the Banking Sector , 2014 .

[20]  W. Reinartz,et al.  Balancing Acquisition and Retention Resources to Maximize Customer Profitability , 2005 .

[21]  P. Verhoef,et al.  CRM in Data-Rich Multichannel Retailing Environments: A Review and Future Research Directions , 2010 .

[22]  Kiyana Zolfaghar,et al.  Estimating customer lifetime value based on RFM analysis of customer purchase behavior: Case study , 2011, WCIT.

[23]  Kerstin Reimer,et al.  How Online Consumer Segments Differ in Long-term Marketing Effectiveness , 2014 .

[24]  Florian von Wangenheim,et al.  Instant Customer Base Analysis: Managerial Heuristics Often “Get it Right”: , 2008 .

[25]  John N. Tsitsiklis,et al.  Bias and Variance Approximation in Value Function Estimates , 2007, Manag. Sci..

[26]  Andreas M. Kaplan,et al.  A Model to Determine Customer Lifetime Value in a Retail Banking Context , 2007 .

[27]  J. Andrew Petersen,et al.  Driving Profitability by Encouraging Customer Referrals: Who, When, and How , 2010 .

[28]  M. Modarres,et al.  Developing an integrated revenue management and customer relationship management approach in the hotel industry , 2015 .

[29]  Warren B. Powell,et al.  Approximate dynamic programming: Lessons from the field , 2008, 2008 Winter Simulation Conference.

[30]  Tammo H. A. Bijmolt,et al.  Analytics for Customer Engagement , 2010 .

[31]  Warren B. Powell,et al.  An Approximate Dynamic Programming Algorithm for Monotone Value Functions , 2014, Oper. Res..

[32]  Sunil Gupta,et al.  Customer Metrics and Their Impact on Financial Performance , 2006 .

[33]  Hyunseok Hwang A Stochastic Approach for Valuing Customers: A Case Study , 2016 .

[34]  Philip S. Thomas,et al.  Ad Recommendation Systems for Life-Time Value Optimization , 2015, WWW.

[35]  Dominique M. Hanssens,et al.  Modeling Customer Lifetime Value , 2006 .

[36]  Kevin D. Glazebrook,et al.  Developing Effective Service Policies for Multiclass Queues with Abandonment: Asymptotic Optimality and Approximate Policy Improvement , 2016, INFORMS J. Comput..

[37]  R. Nasution,et al.  The Customer Experience Framework as Baseline for Strategy and Implementation in Services Marketing , 2014 .

[38]  V. Kumar,et al.  Customer lifetime value approaches and best practice applications , 2004 .

[39]  D. Jain,et al.  Customer lifetime value research in marketing: A review and future directions , 2002 .

[40]  Xi Chen,et al.  Quantitative models for direct marketing: A review from systems perspective , 2009, Eur. J. Oper. Res..

[41]  A. Gelman Objections to Bayesian statistics , 2008 .

[42]  V. Kumar,et al.  Do loyal customers really pay more for services? , 2017 .

[43]  Yeliz Ekinci,et al.  Using customer lifetime value to plan optimal promotions , 2014 .

[44]  Oded Netzer,et al.  Dynamic Targeted Pricing in B2B Relationships , 2014, Mark. Sci..

[45]  Scott A. Neslin,et al.  Overcoming the “recency trap” in customer relationship management , 2013 .

[46]  Takayoshi Tamura,et al.  New approximate dynamic programming algorithms for large-scale undiscounted Markov decision processes and their application to optimize a production and distribution system , 2016, Eur. J. Oper. Res..

[47]  Yan Xu,et al.  Optimizing Quantiles in Preference-Based Markov Decision Processes , 2016, AAAI.

[48]  Lu Hou,et al.  A nonhomogeneous hidden Markov model of response dynamics and mailing optimization in direct marketing , 2016, Eur. J. Oper. Res..

[49]  K.-K. Wong,et al.  Customer lifetime value: stochastic optimization approach , 2004, J. Oper. Res. Soc..

[50]  Sahar F. Sabbeh,et al.  Machine-Learning Techniques for Customer Retention: A Comparative Study , 2018 .

[51]  Warren B. Powell,et al.  What you should know about approximate dynamic programming , 2009, Naval Research Logistics (NRL).

[52]  Ming Ma,et al.  Phase-type distribution of customer relationship with Markovian response and marketing expenditure decision on the customer lifetime value , 2008, Eur. J. Oper. Res..

[53]  Chi-Bin Cheng,et al.  Customer lifetime value prediction by a Markov chain based data mining model: Application to an auto repair and maintenance company in Taiwan , 2012 .

[54]  Mathias Klier,et al.  Customer lifetime network value: customer valuation in the context of network effects , 2017, Electronic Markets.

[55]  Sergey Levine,et al.  QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation , 2018, CoRL.

[56]  Alexander S. Poznyak,et al.  Simple computing of the customer lifetime value: A fixed local-optimal policy approach , 2014, Journal of Systems Science and Systems Engineering.

[57]  Christian Bauckhage,et al.  Customer Lifetime Value Prediction in Non-Contractual Freemium Settings: Chasing High-Value Users Using Deep Neural Networks and SMOTE , 2018, HICSS.

[58]  Philip S. Thomas,et al.  Personalized Ad Recommendation Systems for Life-Time Value Optimization with Guarantees , 2015, IJCAI.

[59]  Yeliz Ekinci,et al.  Analysis of customer lifetime value and marketing expenditure decisions through a Markovian-based model , 2014, Eur. J. Oper. Res..

[60]  Andrew W. H. Ip,et al.  A dynamic decision support system to predict the value of customer for new product development , 2011, Decis. Support Syst..

[61]  Mercedes Esteban-Bravo,et al.  Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition , 2014, Mark. Sci..

[62]  Ilaria Dalla Pozza,et al.  The antecedents of customer lifetime duration and discounted expected transactions: Discrete-time based transaction data analysis , 2014 .

[63]  John N. Tsitsiklis,et al.  Dynamic Catalog Mailing Policies , 2006, Manag. Sci..

[64]  M. H. Alavidoost,et al.  A CLV-Based Framework to Prioritize Promotion Marketing Strategies: A Case Study of Telecom Industry , 2018 .

[65]  Warren B. Powell,et al.  Optimal Hour-Ahead Bidding in the Real-Time Electricity Market with Battery Storage Using Approximate Dynamic Programming , 2014, INFORMS J. Comput..

[66]  V. Kumar,et al.  Unprofitable Cross-Buying: Evidence from Consumer and Business Markets , 2012 .

[67]  Peter S. Fader,et al.  Customer-base analysis using repeated cross-sectional summary (RCSS) data , 2016, Eur. J. Oper. Res..

[68]  Amir Albadvi,et al.  Customer Lifetime Value: Literature Scoping Map, and an Agenda for Future Research , 2014 .

[69]  Derong Liu,et al.  Adaptive Dynamic Programming for Optimal Tracking Control of Unknown Nonlinear Systems With Application to Coal Gasification , 2014, IEEE Transactions on Automation Science and Engineering.

[70]  Rajkumar Venkatesan,et al.  A Customer Lifetime Value Framework for Customer Selection and Resource Allocation Strategy , 2004 .

[71]  André Elisseeff,et al.  The 2005 ISMS Practice Prize Winner---Customer Equity and Lifetime Management (CELM) Finnair Case Study , 2007 .

[72]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[73]  Roland T. Rust,et al.  Will the frog change into a prince? Predicting future customer profitability , 2011 .

[74]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[75]  Jie Chen,et al.  Robust Model-Based Fault Diagnosis for Dynamic Systems , 1998, The International Series on Asian Studies in Computer and Information Science.

[76]  Robert Klein,et al.  Maximizing customer equity subject to capacity constraints , 2015 .

[77]  M. Haenlein,et al.  Managing Customer Relationships in the Social Media Era: Introducing the Social CRM House , 2013 .

[78]  Rajkumar Venkatesan,et al.  Optimal Customer Relationship Management Using Bayesian Decision Theory: An Application for Customer Selection , 2007 .

[79]  Dimitris Bertsimas,et al.  A Learning Approach for Interactive Marketing to a Customer Segment , 2007, Oper. Res..

[80]  Luigi Atzori,et al.  OTT-ISP joint service management: A Customer Lifetime Value based approach , 2017, 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM).