Bayesian Networks for Risk Prediction Using Real-World Data: A Tool for Precision Medicine.

OBJECTIVE The fields of medicine and public health are undergoing a data revolution. An increasing availability of data has brought about a growing interest in machine-learning algorithms. Our objective is to present the reader with an introduction to a knowledge representation and machine-learning tool for risk estimation in medical science known as Bayesian networks (BNs). STUDY DESIGN In this article we review how BNs are compact and intuitive graphical representations of joint probability distributions (JPDs) that can be used to conduct causal reasoning and risk estimation analysis and offer several advantages over regression-based methods. We discuss how BNs represent a different approach to risk estimation in that they are graphical representations of JPDs that take the form of a network representing model random variables and the influences between them, respectively. METHODS We explore some of the challenges associated with traditional risk prediction methods and then describe BNs, their construction, application, and advantages in risk prediction based on examples in cancer and heart disease. RESULTS Risk modeling with BNs has advantages over regression-based approaches, and in this article we focus on three that are relevant to health outcomes research: (1) the generation of network structures in which relationships between variables can be easily communicated; (2) their ability to apply Bayes's theorem to conduct individual-level risk estimation; and (3) their easy transformation into decision models. CONCLUSIONS Bayesian networks represent a powerful and flexible tool for the analysis of health economics and outcomes research data in the era of precision medicine.

[1]  Judea Pearl,et al.  Bayesian Networks , 1998, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[2]  Martin Hofmann-Apitius,et al.  Challenges of Integrative Disease Modeling in Alzheimer's Disease , 2020, Frontiers in Molecular Biosciences.

[3]  Marek J Druzdzel,et al.  A New Bayesian Network-Based Risk Stratification Model for Prediction of Short-Term and Long-Term LVAD Mortality , 2015, ASAIO journal.

[4]  Uffe Kjærulff,et al.  Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis , 2007, Information Science and Statistics.

[5]  Laura Uusitalo,et al.  Advantages and challenges of Bayesian networks in environmental modelling , 2007 .

[6]  Isabelle Huys,et al.  Patient-Level Effectiveness Prediction Modeling for Glioblastoma Using Classification Trees , 2020, Frontiers in Pharmacology.

[7]  Myra Spiliopoulou,et al.  Building a Bayesian Network to Understand the Interplay of Variables in an Epidemiological Population-Based Study , 2018, 2018 IEEE 31st International Symposium on Computer-Based Medical Systems (CBMS).

[8]  N. Tangri,et al.  A predictive model for progression of chronic kidney disease to kidney failure. , 2011, JAMA.

[9]  N. Fenton,et al.  Improving risk management for violence in mental health services: a multimethods approach , 2016 .

[10]  Gary S Collins,et al.  A systematic review finds prediction models for chronic kidney disease were poorly reported and often developed using inappropriate methods. , 2013, Journal of clinical epidemiology.

[11]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine Learning.

[12]  D. Levy,et al.  Prediction of coronary heart disease using risk factor categories. , 1998, Circulation.

[13]  Marek J. Druzdzel,et al.  How to interpret the results of medical time series data analysis: Classical statistical approaches versus dynamic Bayesian network modeling , 2016, Journal of pathology informatics.

[14]  A. Stojadinovic,et al.  Development of a Bayesian Belief Network Model for Personalized Prognostic Risk Assessment in Colon Carcinomatosis , 2011, The American surgeon.

[15]  Marek J Druzdzel,et al.  The Pittsburgh Cervical Cancer Screening Model: a risk assessment tool. , 2010, Archives of pathology & laboratory medicine.

[16]  Francisco Javier Díez,et al.  Optimal sequence of tests for the mediastinal staging of non-small cell lung cancer , 2016, BMC Medical Informatics and Decision Making.

[17]  M Arias,et al.  Cost-effectiveness Analysis with Influence Diagrams , 2015, Methods of Information in Medicine.

[18]  F. Collins,et al.  A new initiative on precision medicine. , 2015, The New England journal of medicine.

[19]  Jorge Alberto Achcar,et al.  Trends in epidemiology in the 21st century: time to adopt Bayesian methods. , 2014, Cadernos de saude publica.

[20]  Antoni Ligeza,et al.  Bayesian network modeling: A case study of an epidemiologic system analysis of cardiovascular risk , 2016, Comput. Methods Programs Biomed..

[21]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[22]  Bruce Kaplan,et al.  A Primer on Bayesian Decision Analysis With an Application to a Kidney Transplant Decision , 2016, Transplantation.

[23]  P. Gao,et al.  Which Is a More Accurate Predictor in Colorectal Survival Analysis? Nine Data Mining Algorithms vs. the TNM Staging System , 2012, PloS one.

[24]  Choon-Sik Park,et al.  Risk Factors for Acute Exacerbations in Elderly Asthma: What Makes Asthma in Older Adults Distinctive? , 2020, Allergy, asthma & immunology research.

[25]  A. J. Feelders,et al.  Learning Bayesian Network Models from Incomplete Data using Importance Sampling , 2005, AISTATS.

[26]  Bianca Zadrozny,et al.  A Bayesian network decision model for supporting the diagnosis of dementia, Alzheimer's disease and mild cognitive impairment , 2014, Comput. Biol. Medicine.

[27]  Martin Hofmann-Apitius,et al.  Data science in neurodegenerative disease: its capabilities, limitations, and perspectives , 2020, Current opinion in neurology.

[28]  Marek J. Druzdzel,et al.  A Risk Calculator for the Pulmonary Arterial Hypertension Based on a Bayesian Network , 2016, BMA@UAI.

[29]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[30]  T. Kadir,et al.  Bayesian Networks for Clinical Decision Support in Lung Cancer Care , 2013, PloS one.