Relational Differential Prediction

A typical classification problem involves building a model to correctly segregate instances of two or more classes. Such a model exhibits differential prediction with respect to given data subsets when its performance is significantly different over these subsets. Driven by a mammography application, we aim at learning rules that predict breast cancer stage while maximizing differential prediction over age-stratified data. In this work, we present the first multi-relational differential prediction (aka uplift modeling) system, and propose three different approaches to learn differential predictive rules within the Inductive Logic Programming framework. We first test and validate our methods on synthetic data, then apply them on a mammography dataset for breast cancer stage differential prediction rule discovery. We mine a novel rule linking calcification to in situ breast cancer in older women.

[1]  R. Linn Single-group validity, differential validity, and differential prediction. , 1978 .

[2]  Xerardo Estévez,et al.  Santiago de Compostela, Spain , 2013 .

[3]  Behram Hansotia,et al.  Incremental value modeling , 2002 .

[4]  P. Porter,et al.  Breast density as a predictor of mammographic detection: comparison of interval- and screen-detected cancers. , 2000, Journal of the National Cancer Institute.

[5]  D. Eignor The standards for educational and psychological testing. , 2013 .

[6]  F. Vicini,et al.  Age at diagnosis and outcome for women with ductal carcinoma-in-situ of the breast: a critical review of the literature. , 2002, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[7]  Janez Demsar,et al.  Statistical Comparisons of Classifiers over Multiple Data Sets , 2006, J. Mach. Learn. Res..

[8]  Jesse Davis,et al.  View Learning for Statistical Relational Learning: With an Application to Mammography , 2005, IJCAI.

[9]  Hendrik Blockeel,et al.  Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[10]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[11]  P. Chyou Patterns of bias due to differential misclassification by case–control status in a case–control study , 2007, European Journal of Epidemiology.

[12]  T. Cleary TEST BIAS: PREDICTION OF GRADES OF NEGRO AND WHITE STUDENTS IN INTEGRATED COLLEGES , 1968 .

[13]  Patrick D. Surry,et al.  Differential Response Analysis: Modeling True Responses by Isolating the Effect of a Single Action , 1999 .

[14]  Deutsche Version BREAST IMAGING REPORTING AND DATA SYSTEM (BI-RADS TM ) , 2001 .

[15]  K. Flegal,et al.  Differential misclassification arising from nondifferential errors in exposure measurement. , 1991, American journal of epidemiology.

[16]  V Shane Pankratz,et al.  Age-specific trends in mammographic density: the Minnesota Breast Cancer Family Study. , 2008, American journal of epidemiology.

[17]  J. Hornaday,et al.  Cancer Facts & Figures 2004 , 2004 .

[18]  Luc De Raedt,et al.  Logical and relational learning , 2008, Cognitive Technologies.

[19]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[20]  Luc De Raedt,et al.  Logical and Relational Learning: From ILP to MRDM (Cognitive Technologies) , 2008 .

[21]  David Page,et al.  Uncovering age-specific invasive and DCIS breast cancer rules using inductive logic programming , 2010, IHI.

[22]  V. Shane Pankratz,et al.  Age-specific Trends in Mammographic Density , 2008 .

[23]  P. Sackett,et al.  Differential prediction and the use of multiple predictors: the omitted variables problem. , 2003, The Journal of applied psychology.

[24]  Victor S. Y. Lo The true lift model: a novel data mining approach to response modeling in database marketing , 2002, SKDD.

[25]  John W. Young,et al.  Differential Validity, Differential Prediction,and College Admission Testing: A Comprehensive Review and Analysis , 2001 .

[26]  Vítor Santos Costa The Life of a Logic Programming System , 2008, ICLP.

[27]  M. Pike,et al.  National Institutes of Health State-of-the-Science Conference statement: Diagnosis and Management of Ductal Carcinoma In Situ September 22-24, 2009. , 2010, Journal of the National Cancer Institute.

[28]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[29]  Kefah Mokbel,et al.  Current management of DCIS: a review , 2008, Breast Cancer Research and Treatment.

[30]  Luc De Raedt,et al.  About Knowledge and Inference in Logical and Relational Learning , 2010, Advances in Machine Learning II.

[31]  Vítor Santos Costa,et al.  Inductive Logic Programming , 2013, Lecture Notes in Computer Science.

[32]  Ryszard S. Michalski,et al.  Inductive inference of VL decision rules , 1977, SGAR.

[33]  Susan T. Dumais,et al.  A Bayesian Approach to Filtering Junk E-Mail , 1998, AAAI 1998.

[34]  Nada Lavrac,et al.  Propositionalization-based relational subgroup discovery with RSD , 2006, Machine Learning.

[35]  I. Bleiweiss,et al.  Stage 0 to stage III breast cancer in young women. , 2000, Journal of the American College of Surgeons.

[36]  Patrick D. Surry,et al.  Real-World Uplift Modelling with Significance-Based Uplift Trees , 2012 .

[37]  Szymon Jaroszewicz,et al.  Decision Trees for Uplift Modeling , 2010, 2010 IEEE International Conference on Data Mining.