Bayesian Networks for Clinical Decision Support in Lung Cancer Care

Survival prediction and treatment selection in lung cancer care are characterised by high levels of uncertainty. Bayesian Networks (BNs), which naturally reason with uncertain domain knowledge, can be applied to aid lung cancer experts by providing personalised survival estimates and treatment selection recommendations. Based on the English Lung Cancer Database (LUCADA), we evaluate the feasibility of BNs for these two tasks, while comparing the performances of various causal discovery approaches to uncover the most feasible network structure from expert knowledge and data. We show first that the BN structure elicited from clinicians achieves a disappointing area under the ROC curve of 0.75 (± 0.03), whereas a structure learned by the CAMML hybrid causal discovery algorithm, which adheres with the temporal restrictions, achieves 0.81 (± 0.03). Second, our causal intervention results reveal that BN treatment recommendations, based on prescribing the treatment plan that maximises survival, can only predict the recorded treatment plan 29% of the time. However, this percentage rises to 76% when partial matches are included.

[1]  M. Wegman International classification of diseases. , 1959, Pediatrics.

[2]  Steen Andreassen,et al.  MUNIN - A Causal Probabilistic Network for Interpretation of Electromyographic Findings , 1987, IJCAI.

[3]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[4]  Ross D. Shachter Probabilistic Inference and Influence Diagrams , 1988, Oper. Res..

[5]  C. Muir,et al.  International Classification of Diseases for Oncology , 1990 .

[6]  Steffen L. Lauritzen,et al.  Bayesian updating in causal probabilistic networks by local computations , 1990 .

[7]  Wray L. Buntine Theory Refinement on Bayesian Networks , 1991, UAI.

[8]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[9]  Judea Pearl,et al.  An Algorithm for Deciding if a Set of Observed Independencies Has a Causal Explanation , 1992, UAI.

[10]  D. Heckerman,et al.  Toward Normative Expert Systems: Part I The Pathfinder Project , 1992, Methods of Information in Medicine.

[11]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[12]  David J. Spiegelhalter,et al.  Bayesian analysis in expert systems , 1993 .

[13]  J. York,et al.  Bayesian Graphical Models for Discrete Data , 1995 .

[14]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[15]  Kevin B. Korb,et al.  Causal Discovery via MML , 1996, ICML.

[16]  Ross D. Shachter,et al.  Representation and Analysis of Medical Decision Problems with Influence Diagrams , 1997, Medical decision making : an international journal of the Society for Medical Decision Making.

[17]  David D. Lewis,et al.  Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval , 1998, ECML.

[18]  J. Zhang,et al.  What's the relative risk? A method of correcting the odds ratio in cohort studies of common outcomes. , 1998, JAMA.

[19]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[20]  P. Lucas,et al.  Computer-based Decision Support in the Management of Primary Gastric non-Hodgkin Lymphoma , 1998, Methods of Information in Medicine.

[21]  Nada Lavrac,et al.  Selected techniques for data mining in medicine , 1999, Artif. Intell. Medicine.

[22]  Steen Andreassen,et al.  Using probabilistic and decision-theoretic methods in treatment and prognosis modeling , 1999, Artif. Intell. Medicine.

[23]  Kevin Murphy,et al.  Bayes net toolbox for Matlab , 1999 .

[24]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[25]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .

[26]  Peter J. F. Lucas,et al.  A probabilistic and decision-theoretic approach to the management of infectious disease at the ICU , 2000, Artif. Intell. Medicine.

[27]  D. Ettinger,et al.  NCCN: Non-small cell lung cancer. , 2001, Cancer control : journal of the Moffitt Cancer Center.

[28]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[29]  Igor Kononenko,et al.  Machine learning for medical diagnosis: history, state of the art and perspective , 2001, Artif. Intell. Medicine.

[30]  José Mira Mira,et al.  NasoNet, Joining Bayesian Networks and Time to Model Nasopharyngeal Cancer Spread , 2001, AIME.

[31]  Linda C. van der Gaag,et al.  Building Bayesian Networks through Ontologies , 2002, ECAI.

[32]  Silja Renooij,et al.  Probabilities for a probabilistic network: a case study in oesophageal cancer , 2002, Artif. Intell. Medicine.

[33]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence , 2004, Computer science and data analysis series.

[34]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[35]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[36]  Peter J. F. Lucas,et al.  Bayesian networks in biomedicine and health-care , 2004, Artif. Intell. Medicine.

[37]  Tom. Mitchell GENERATIVE AND DISCRIMINATIVE CLASSIFIERS: NAIVE BAYES AND LOGISTIC REGRESSION Machine Learning , 2005 .

[38]  Kevin B. Korb,et al.  Knowledge Engineering Cardiovascular Bayesian Networks from the Literature , 2005 .

[39]  Daniel Zelterman,et al.  Bayesian Artificial Intelligence , 2005, Technometrics.

[40]  Ross D. Shachter,et al.  A Bayesian Network to Assist Mammography Interpretation , 2005 .

[41]  Susan Catt,et al.  Multidisciplinary teams in cancer care: are they effective in the UK? , 2006, The Lancet. Oncology.

[42]  Bart De Moor,et al.  Predicting the prognosis of breast cancer by integrating clinical and microarray data with Bayesian networks , 2006, ISMB.

[43]  Vic Hasselblad,et al.  Can one assess whether missing data are missing at random in medical studies? , 2006, Statistical methods in medical research.

[44]  B. Marcot,et al.  Using Bayesian belief networks in adaptive management , 2006 .

[45]  Dimitris Kanellopoulos,et al.  Data Preprocessing for Supervised Leaning , 2007 .

[46]  Remco R. Bouckaert,et al.  Bayesian Network Classifiers in Weka for Version 3-5-7 , 2007 .

[47]  David S. Wishart,et al.  Applications of Machine Learning in Cancer Prediction and Prognosis , 2006, Cancer informatics.

[48]  Nicandro Cruz-Ramírez,et al.  Diagnosis of breast cancer using Bayesian networks: A case study , 2007, Comput. Biol. Medicine.

[49]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[50]  Peter J. Haug,et al.  Exploiting missing clinical data in Bayesian network modeling for predicting medical problems , 2008, J. Biomed. Informatics.

[51]  Hai Hu,et al.  A Bayesian derived network of breast pathology co-occurrence , 2008, J. Biomed. Informatics.

[52]  Erol Sarigul,et al.  Exploring Bayesian networks for automated breast cancer detection , 2009, IEEE Southeastcon 2009.

[53]  Eugene R. Tracy,et al.  A Bayesian network approach to feature selection in mass spectrometry data , 2010, BMC Bioinformatics.

[54]  D. Gur,et al.  Development of a clinical decision model for thyroid nodules , 2009, BMC surgery.

[55]  Glenn Fung,et al.  Survival Prediction in Lung Cancer Treated with Radiotherapy: Bayesian Networks vs. Support Vector Machines in Handling Missing Data , 2009, 2009 International Conference on Machine Learning and Applications.

[56]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[57]  D De Ruysscher,et al.  Comparison of Bayesian network and support vector machine models for two-year survival prediction in lung cancer patients treated with radiotherapy. , 2010, Medical physics.

[58]  E. Felip,et al.  Early stage and locally advanced (non-metastatic) non-small-cell lung cancer: ESMO Clinical Practice Guidelines for diagnosis, treatment and follow-up. , 2010, Annals of oncology : official journal of the European Society for Medical Oncology.

[59]  K. Kerr,et al.  Guidelines on the radical management of patients with lung cancer , 2010, Thorax.

[60]  Qingxia Chen,et al.  Missing covariate data in medical research: to impute is better than to ignore. , 2010, Journal of clinical epidemiology.

[61]  L. Holmberg,et al.  National comparisons of lung cancer survival in England, Norway and Sweden 2001–2004: differences occur early in follow-up , 2010, Thorax.

[62]  Kevin B. Korb,et al.  Bayesian Artificial Intelligence, Second Edition , 2010 .

[63]  Danica Kragic,et al.  Multivariate discretization for Bayesian Network structure learning in robot grasping , 2011, 2011 IEEE International Conference on Robotics and Automation.

[64]  Qiang Shen,et al.  Learning Bayesian networks: approaches and issues , 2011, The Knowledge Engineering Review.

[65]  Kevin B. Korb,et al.  Incorporating expert knowledge when learning Bayesian network structure: A medical case study , 2011, Artif. Intell. Medicine.

[66]  John H. Healey,et al.  Estimating Survival in Patients with Operable Skeletal Metastases: An Application of a Bayesian Belief Network , 2011, PloS one.

[67]  B Rachet,et al.  Cancer survival in Australia, Canada, Denmark, Norway, Sweden, and the UK, 1995–2007 (the International Cancer Benchmarking Partnership): an analysis of population-based cancer registry data , 2011, Lancet.

[68]  Joseph O Deasy,et al.  A Bayesian network approach for modeling local failure in lung cancer , 2011, Physics in medicine and biology.

[69]  Di Zhao,et al.  Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction , 2011, J. Biomed. Informatics.

[70]  David Barber,et al.  Bayesian reasoning and machine learning , 2012 .

[71]  Henrik Møller,et al.  Variation in surgical resection for lung cancer in relation to survival: population-based study in England 2004-2006. , 2012, European journal of cancer.

[72]  S. Steele,et al.  Clinical Decision Support and Individualized Prediction of Survival in Colon Cancer: Bayesian Belief Network Model , 2012, Annals of Surgical Oncology.

[73]  M. Peake,et al.  Increasing the surgical resection rate for lung cancer in the UK: the debate , 2013 .

[74]  Liquid Biopsies Non-small-cell lung cancer , 2015, Nature Reviews Disease Primers.