Prophit: Causal inverse classification for multiple continuously valued treatment policies

Inverse classification uses an induced classifier as a queryable oracle to guide test instances towards a preferred posterior class label. The result produced from the process is a set of instance-specific feature perturbations, or recommendations, that optimally improve the probability of the class label. In this work, we adopt a causal approach to inverse classification, eliciting treatment policies (i.e., feature perturbations) for models induced with causal properties. In so doing, we solve a long-standing problem of eliciting multiple, continuously valued treatment policies, using an updated framework and corresponding set of assumptions, which we term the inverse classification potential outcomes framework (ICPOF), along with a new measure, referred to as the individual future estimated effects ($i$FEE). We also develop the approximate propensity score (APS), based on Gaussian processes, to weight treatments, much like the inverse propensity score weighting used in past works. We demonstrate the viability of our methods on student performance.

[1]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[2]  Thorsten Joachims,et al.  Recommendations as Treatments: Debiasing Learning and Evaluation , 2016, ICML.

[3]  Michael V. Mannino,et al.  The cost-minimizing inverse classification problem: a genetic algorithm approach , 2000, Decis. Support Syst..

[4]  G. Imbens,et al.  The Propensity Score with Continuous Treatments , 2005 .

[5]  Andrew Forbes,et al.  Variance reduction in randomised trials by inverse probability weighting using the propensity score , 2013, Statistics in medicine.

[6]  Victor Chernozhukov,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011 .

[7]  M. Baiocchi,et al.  Instrumental variable methods for causal inference , 2014, Statistics in medicine.

[8]  Chen Yang,et al.  10-year CVD risk prediction and minimization via InverseClassification , 2012, IHI '12.

[9]  Marie Davidian,et al.  Doubly robust estimation of causal effects. , 2011, American journal of epidemiology.

[10]  J. Lunceford,et al.  Strati cation and weighting via the propensity score in estimation of causal treatment e ects : a comparative study , 2004 .

[11]  Parag C. Pendharkar A potential use of data envelopment analysis for the inverse classification problem , 2002 .

[12]  Foster J. Provost,et al.  Measuring Causal Impact of Online Actions via Natural Experiments: Application to Display Advertising , 2015, KDD.

[13]  Thorsten Joachims,et al.  Counterfactual Risk Minimization: Learning from Logged Bandit Feedback , 2015, ICML.

[14]  William Nick Street,et al.  A Budget-Constrained Inverse Classification Framework for Smooth Classifiers , 2016, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[15]  J. Lunceford,et al.  Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study , 2004, Statistics in medicine.

[16]  Uri Shalit,et al.  Estimating individual treatment effect: generalization bounds and algorithms , 2016, ICML.

[17]  C. Chen,et al.  The Inverse Classiflcation Problem , 2010 .

[18]  Paul R. Rosenbaum,et al.  Optimal Matching for Observational Studies , 1989 .

[19]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[20]  Rajeev Dehejia,et al.  Propensity Score-Matching Methods for Nonexperimental Causal Studies , 2002, Review of Economics and Statistics.

[21]  David R. Musicant,et al.  Understanding Support Vector Machine Classifications via a Recommender System-Like Approach , 2009, DMIN.

[22]  J. Robins,et al.  Doubly Robust Estimation in Missing Data and Causal Inference Models , 2005, Biometrics.

[23]  Mihaela van der Schaar,et al.  Bayesian Inference of Individualized Treatment Effects using Multi-task Gaussian Processes , 2017, NIPS.

[24]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[25]  Uri Shalit,et al.  Learning Representations for Counterfactual Inference , 2016, ICML.

[26]  Angelina A. Tzacheva,et al.  Discovery of Action Rules at Lowest Cost in Spark , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[27]  Tong Wang,et al.  Causal Rule Sets for Identifying Subgroups with Enhanced Treatment Effect , 2017, INFORMS J. Comput..

[28]  Chih-Lin Chi,et al.  Individualized Patient-centered Lifestyle Recommendations: an Expert System for Communicating Patient Specific Cardiovascular Risk Information and Prioritizing Lifestyle Options , 2022 .

[29]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[30]  J. Robins,et al.  Instruments for Causal Inference: An Epidemiologist's Dream? , 2006, Epidemiology.