Support Vector Machines for Differential Prediction

Machine learning is continually being applied to a growing set of fields, including the social sciences, business, and medicine. Some fields present problems that are not easily addressed using standard machine learning approaches and, in particular, there is growing interest in differential prediction. In this type of task we are interested in producing a classifier that specifically characterizes a subgroup of interest by maximizing the difference in predictive performance for some outcome between subgroups in a population. We discuss adapting maximum margin classifiers for differential prediction. We first introduce multiple approaches that do not affect the key properties of maximum margin classifiers, but which also do not directly attempt to optimize a standard measure of differential prediction. We next propose a model that directly optimizes a standard measure in this field, the uplift measure. We evaluate our models on real data from two medical applications and show excellent results.

[1]  T. Cleary TEST BIAS: PREDICTION OF GRADES OF NEGRO AND WHITE STUDENTS IN INTEGRATED COLLEGES , 1968 .

[2]  R. Linn Single-group validity, differential validity, and differential prediction. , 1978 .

[3]  K. Flegal,et al.  Differential misclassification arising from nondifferential errors in exposure measurement. , 1991, American journal of epidemiology.

[4]  D. Schultz,et al.  The influence of young age on outcome in early stage breast cancer. , 1994, International journal of radiation oncology, biology, physics.

[5]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[6]  John W. Young,et al.  Differential Validity, Differential Prediction,and College Admission Testing: A Comprehensive Review and Analysis , 2001 .

[7]  Behram Hansotia,et al.  Incremental value modeling , 2002 .

[8]  Victor S. Y. Lo The true lift model: a novel data mining approach to response modeling in database marketing , 2002, SKDD.

[9]  J. Hornaday,et al.  Cancer Facts & Figures 2004 , 2004 .

[10]  Richard J. K. Taylor,et al.  Is age at diagnosis an independent prognostic factor for survival following breast cancer? , 2005, ANZ journal of surgery.

[11]  Thorsten Joachims,et al.  A support vector method for multivariate performance measures , 2005, ICML.

[12]  P. Chyou Patterns of bias due to differential misclassification by case–control status in a case–control study , 2007, European Journal of Epidemiology.

[13]  J. Emberson,et al.  Do selective cyclo-oxygenase-2 inhibitors and traditional non-steroidal anti-inflammatory drugs increase the risk of atherothrombosis? Meta-analysis of randomised trials , 2006, BMJ : British Medical Journal.

[14]  Nicholas Radcliffe,et al.  Using control groups to target on predicted lift: Building and assessing uplift model , 2007 .

[15]  James Bailey,et al.  Feature Weighted SVMs Using Receiver Operating Characteristics , 2009, SDM.

[16]  Peter A. Flach,et al.  Evaluation Measures for Multi-class Subgroup Discovery , 2009, ECML/PKDD.

[17]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[18]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[19]  Stphane Tuffry,et al.  Data Mining and Statistics for Decision Making , 2011 .

[20]  Stéphane Tufféry,et al.  Data Mining and Statistics for Decision Making: Tufféry/Data Mining and Statistics for Decision Making , 2011 .

[21]  Szymon Jaroszewicz,et al.  Decision trees for uplift modeling with single and multiple treatments , 2011, Knowledge and Information Systems.

[22]  David Page,et al.  Relational Differential Prediction , 2012, ECML/PKDD.

[23]  S. Jaroszewicz,et al.  Uplift modeling for clinical trial data , 2012 .

[24]  Patrick D. Surry,et al.  Real-World Uplift Modelling with Significance-Based Uplift Trees , 2012 .

[25]  Harikrishna Narasimhan,et al.  A Structural SVM Based Approach for Optimizing Partial AUC , 2013, ICML.

[26]  V. S. Costa,et al.  A Preliminary Investigation into Predictive Models for Adverse Drug Events , 2013 .

[27]  David Page,et al.  Score As You Lift (SAYL): A Statistical Relational Learning Approach to Uplift Modeling , 2013, ECML/PKDD.

[28]  Szymon Jaroszewicz,et al.  Support Vector Machines for Uplift Modeling , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[29]  Stochastic Solutions Limited Identifying who can be saved and who wil l be driven away by retention activity Using uplift modelling to reduce churn in mobile telephony , .