Logical Differential Prediction Bayes Net, improving breast cancer diagnosis for older women

Overdiagnosis is a phenomenon in which screening identities cancer which may not go on to cause symptoms or death. Women over 65 who develop breast cancer bear the heaviest burden of overdiagnosis. This work introduces novel machine learning algorithms to improve diagnostic accuracy of breast cancer in aging populations. At the same time, we aim at minimizing unnecessary invasive procedures (thus decreasing false positives) and concomitantly addressing overdiagnosis. We develop a novel algorithm. Logical Differential Prediction Bayes Net (LDP-BN), that calculates the risk of breast disease based on mammography findings. LDP-BN uses Inductive Logic Programming (ILP) to learn relational rules, selects older-specific differentially predictive rules, and incorporates them into a Bayes Net, significantly improving its performance. In addition, LDP-BN offers valuable insight into the classification process, revealing novel older-specific rules that link mass presence to invasive, and calcification presence and lack of detectable mass to DCIS.

[1]  D. Schultz,et al.  The influence of young age on outcome in early stage breast cancer. , 1994, International journal of radiation oncology, biology, physics.

[2]  Stephen Muggleton,et al.  Inverse entailment and progol , 1995, New Generation Computing.

[3]  J. Hornaday,et al.  Cancer Facts & Figures 2004 , 2004 .

[4]  B. Phibbs,et al.  Differential Classification of Acute Myocardial Infarction into ST‐ and Non‐ST Segment Elevation Is Not Valid or Rational , 2010, Annals of noninvasive electrocardiology : the official journal of the International Society for Holter and Noninvasive Electrocardiology, Inc.

[5]  G J Whitman,et al.  Positive predictive value of breast biopsy performed as a result of mammography: there is no abrupt change at age 50 years. , 1996, Radiology.

[6]  M. Pike,et al.  National Institutes of Health State-of-the-Science Conference statement: Diagnosis and Management of Ductal Carcinoma In Situ September 22-24, 2009. , 2010, Journal of the National Cancer Institute.

[7]  David Page,et al.  Uncovering age-specific invasive and DCIS breast cancer rules using inductive logic programming , 2010, IHI.

[8]  David Page,et al.  Information Extraction for Clinical Data Mining: A Mammography Case Study , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[9]  John W. Young,et al.  Differential Validity, Differential Prediction,and College Admission Testing: A Comprehensive Review and Analysis , 2001 .

[10]  R. Kane,et al.  Ductal carcinoma in situ: risk factors and impact of screening. , 2010, Journal of the National Cancer Institute. Monographs.

[11]  S. Schnitt,et al.  Local outcomes in ductal carcinoma in situ based on patient and tumor characteristics. , 2010, Journal of the National Cancer Institute. Monographs.

[12]  Victor S. Y. Lo The true lift model: a novel data mining approach to response modeling in database marketing , 2002, SKDD.

[13]  T. Cleary TEST BIAS: PREDICTION OF GRADES OF NEGRO AND WHITE STUDENTS IN INTEGRATED COLLEGES , 1968 .

[14]  W. Dupont,et al.  The natural history of low‐grade ductal carcinoma in situ of the breast in women treated by biopsy only revealed over 30 years of long‐term follow‐up , 2005, Cancer.

[15]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[16]  J. Elmore,et al.  Ten-year risk of false positive screening mammograms and clinical breast examinations. , 1998, The New England journal of medicine.

[17]  I. Bleiweiss,et al.  Stage 0 to stage III breast cancer in young women. , 2000, Journal of the American College of Surgeons.

[18]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[19]  Behram Hansotia,et al.  Incremental value modeling , 2002 .

[20]  C. D. Page,et al.  Probabilistic computer model developed from clinical data in national mammography database format to classify mammographic findings. , 2009, Radiology.

[21]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[22]  Deutsche Version BREAST IMAGING REPORTING AND DATA SYSTEM (BI-RADS TM ) , 2001 .

[23]  K. Freund,et al.  Mammography Use, Breast Cancer Stage at Diagnosis, and Survival Among Older Women , 2000, Journal of the American Geriatrics Society.

[24]  Thomas A Sellers,et al.  Breast biopsy utilization: a population-based study. , 2005, Archives of internal medicine.

[25]  M. Castiglione,et al.  The enigma of young age. , 2006, Annals of oncology : official journal of the European Society for Medical Oncology.

[26]  G. Colditz,et al.  Outcome of patients with ductal carcinoma in situ untreated after diagnostic biopsy , 2005, Cancer.

[27]  Kefah Mokbel,et al.  Current management of DCIS: a review , 2008, Breast Cancer Research and Treatment.

[28]  Steven P Poplack,et al.  Screening mammography: costs and use of screening-related services. , 2005, Radiology.

[29]  R Yancik,et al.  Effect of age and comorbidity in postmenopausal breast cancer patients aged 55 years and older. , 2001, JAMA.

[30]  Richard J. K. Taylor,et al.  Is age at diagnosis an independent prognostic factor for survival following breast cancer? , 2005, ANZ journal of surgery.

[31]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[32]  J. Unützer,et al.  National Institutes of Health State-of-the-Science Conference Statement , 2005, Journal of palliative medicine.

[33]  V. Shane Pankratz,et al.  Age-specific Trends in Mammographic Density , 2008 .

[34]  M. Ellis,et al.  A longitudinal study of factors associated with perceived risk of recurrence in women with ductal carcinoma in situ and early-stage invasive breast cancer , 2010, Breast Cancer Research and Treatment.

[35]  P. Porter,et al.  Breast density as a predictor of mammographic detection: comparison of interval- and screen-detected cancers. , 2000, Journal of the National Cancer Institute.

[36]  S. Devesa,et al.  Distinct incidence patterns among in situ and invasive breast carcinomas,with possible etiologic implications , 2004, Breast Cancer Research and Treatment.

[37]  R. Linn Single-group validity, differential validity, and differential prediction. , 1978 .

[38]  J. Goodwin,et al.  Regular Mammography Use Is Associated with Elimination of Age-Related Disparities in Size and Stage of Breast Cancer at Diagnosis , 2002, Annals of Internal Medicine.

[39]  Peter C Gøtzsche,et al.  Overdiagnosis in publicly organised mammography screening programmes: systematic review of incidence trends , 2009, BMJ : British Medical Journal.

[40]  H. Welch,et al.  Overdiagnosis in cancer. , 2010, Journal of the National Cancer Institute.

[41]  B. Kinosian,et al.  The impact of comorbidities on outcomes for elderly women treated with breast-conservation treatment for early-stage breast cancer. , 2008, International journal of radiation oncology, biology, physics.

[42]  C A Kelsey,et al.  Effects of age, breast density, ethnicity, and estrogen replacement therapy on screening mammographic sensitivity and cancer stage at diagnosis: review of 183,134 screening mammograms in Albuquerque, New Mexico. , 1998, Radiology.

[43]  H. Inskip,et al.  Cut-off points for anthropometric indices of adiposity: differential classification in a large population of young women. , 2008, The British journal of nutrition.

[44]  Jesse Davis,et al.  An Integrated Approach to Learning Bayesian Networks of Rules , 2005, ECML.

[45]  Lois E. Tetrick,et al.  Society for Industrial and Organizational Psychology , 2010 .

[46]  Jesse Davis,et al.  View Learning for Statistical Relational Learning: With an Application to Mammography , 2005, IJCAI.

[47]  E. Thurfjell,et al.  Nonpalpable breast cancer: mammographic appearance as predictor of histologic type. , 2002, Radiology.

[48]  Vítor Santos Costa The Life of a Logic Programming System , 2008, ICLP.

[49]  Werner H. Mess,et al.  Transcranial duplex in the differential diagnosis of parkinsonian syndromes , 2009, Journal of Neurology.

[50]  Luc De Raedt,et al.  Logical and relational learning , 2008, Cognitive Technologies.

[51]  Patrick D. Surry,et al.  Differential Response Analysis: Modeling True Responses by Isolating the Effect of a Single Action , 1999 .

[52]  David Page,et al.  Relational Differential Prediction , 2012, ECML/PKDD.

[53]  Yueh-Hsia Chiu,et al.  Mammographic tumor features can predict long‐term outcomes reliably in women with 1–14‐mm invasive breast carcinoma , 2004, Cancer.

[54]  M. Gnant,et al.  Young age as an independent adverse prognostic factor in premenopausal patients with breast cancer. , 2002, Clinical breast cancer.

[55]  Ross D. Shachter,et al.  Using a Bayesian Network to Predict the Probability and Type of Breast Cancer Represented by Microcalcifications on Mammography , 2004, MedInfo.

[56]  K. Kerlikowske,et al.  Detection of ductal carcinoma in situ in women undergoing screening mammography. , 2002, Journal of the National Cancer Institute.