Learning to Rank for Uplift Modeling

Uplift modeling has effectively been used in fields such as marketing and customer retention, to target those customers that are most likely to respond due to the campaign or treatment. Uplift models produce uplift scores which are then used to essentially create a ranking. We instead investigate to learn to rank directly by looking into the potential of learning-to-rank techniques in the context of uplift modeling. We propose a unified formalisation of different global uplift modeling measures in use today and explore how these can be integrated into the learning-to-rank framework. Additionally, we introduce a new metric for learning-to-rank that focusses on optimizing the area under the uplift curve called the promoted cumulative gain (PCG). We employ the learning-to-rank technique LambdaMART to optimize the ranking according to PCG and show improved results over standard learning-to-rank metrics and equal to improved results when compared with state-of-the-art uplift modeling. Finally, we show how learning-to-rank models can learn to optimize a certain targeting depth, however, these results do not generalize on the test set.

[1]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[2]  Tie-Yan Liu,et al.  Ranking Measures and Loss Functions in Learning to Rank , 2009, NIPS.

[3]  Szymon Jaroszewicz,et al.  Lp\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_p$$\end{document}-Support vector machines for uplift modeling , 2017, Knowledge and Information Systems.

[4]  Patrick D. Surry,et al.  Real-World Uplift Modelling with Significance-Based Uplift Trees , 2012 .

[5]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[6]  Nicholas Radcliffe,et al.  Using control groups to target on predicted lift: Building and assessing uplift model , 2007 .

[7]  Leandro Axel Guelman Optimal personalized treatment learning models with insurance applications , 2015 .

[8]  Uri Shalit,et al.  Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.

[9]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[10]  Kathleen Kane,et al.  Mining for the truly responsive customers and prospects using true-lift modeling: Comparison of new and existing methods , 2014 .

[11]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[12]  David Page,et al.  Support Vector Machines for Differential Prediction , 2014, ECML/PKDD.

[13]  Jude W. Shavlik,et al.  Uplift Modeling with ROC: An SRL Case Study , 2013, ILP.

[14]  Massih-Reza Amini,et al.  Uplift Prediction with Dependent Feature Representation in Imbalanced Treatment and Control Conditions , 2018, ICONIP.

[15]  Leo Guelman,et al.  Optimal personalized treatment rules for marketing interventions: A review of methods, a new proposal, and an insurance case study , 2014 .

[16]  Behram Hansotia,et al.  Incremental value modeling , 2002 .

[17]  Szymon Jaroszewicz,et al.  Decision trees for uplift modeling with single and multiple treatments , 2011, Knowledge and Information Systems.

[18]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[19]  D. Rubin Causal Inference Using Potential Outcomes , 2005 .

[20]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[21]  Gert R. G. Lanckriet,et al.  Metric Learning to Rank , 2010, ICML.

[22]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[23]  Szymon Jaroszewicz,et al.  Uplift Modeling in Direct Marketing , 2012 .

[24]  Christopher J. C. Burges,et al.  From RankNet to LambdaRank to LambdaMART: An Overview , 2010 .

[25]  Szymon Jaroszewicz,et al.  Ensemble methods for uplift modeling , 2014, Data Mining and Knowledge Discovery.

[26]  Szymon Jaroszewicz,et al.  Support Vector Machines for Uplift Modeling , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[27]  L. Lai INFLUENTIAL MARKETING: A NEW DIRECT MARKETING STRATEGY ADDRESSING THE EXISTENCE OF VOLUNTARY BUYERS , 2006 .

[28]  P. Holland Statistics and Causal Inference , 1985 .

[29]  Wouter Verbeke,et al.  A Literature Survey and Experimental Evaluation of the State-of-the-Art in Uplift Modeling: A Stepping Stone Toward the Development of Prescriptive Analytics , 2018, Big Data.

[30]  Pierre Gutierrez,et al.  Causal Inference and Uplift Modelling: A Review of the Literature , 2017, PAPIs.

[31]  S. Jaroszewicz,et al.  Uplift modeling for clinical trial data , 2012 .

[32]  Szymon Jaroszewicz,et al.  Boosting algorithms for uplift modeling , 2018, ArXiv.

[33]  Stefan Lessmann,et al.  Revenue Uplift Modeling , 2017, ICIS.

[34]  Szymon Jaroszewicz,et al.  Decision Trees for Uplift Modeling , 2010, 2010 IEEE International Conference on Data Mining.

[35]  Victor S. Y. Lo The true lift model: a novel data mining approach to response modeling in database marketing , 2002, SKDD.

[36]  David Simchi-Levi,et al.  Uplift Modeling with Multiple Treatments and General Response Types , 2017, SDM.