A Mathematical Model for Interpretable Clinical Decision Support with Applications in Gynecology

Background Over time, methods for the development of clinical decision support (CDS) systems have evolved from interpretable and easy-to-use scoring systems to very complex and non-interpretable mathematical models. In order to accomplish effective decision support, CDS systems should provide information on how the model arrives at a certain decision. To address the issue of incompatibility between performance, interpretability and applicability of CDS systems, this paper proposes an innovative model structure, automatically leading to interpretable and easily applicable models. The resulting models can be used to guide clinicians when deciding upon the appropriate treatment, estimating patient-specific risks and to improve communication with patients. Methods and Findings We propose the interval coded scoring (ICS) system, which imposes that the effect of each variable on the estimated risk is constant within consecutive intervals. The number and position of the intervals are automatically obtained by solving an optimization problem, which additionally performs variable selection. The resulting model can be visualised by means of appealing scoring tables and color bars. ICS models can be used within software packages, in smartphone applications, or on paper, which is particularly useful for bedside medicine and home-monitoring. The ICS approach is illustrated on two gynecological problems: diagnosis of malignancy of ovarian tumors using a dataset containing 3,511 patients, and prediction of first trimester viability of pregnancies using a dataset of 1,435 women. Comparison of the performance of the ICS approach with a range of prediction models proposed in the literature illustrates the ability of ICS to combine optimal performance with the interpretability of simple scoring systems. Conclusions The ICS approach can improve patient-clinician communication and will provide additional insights in the importance and influence of available variables. Future challenges include extensions of the proposed methodology towards automated detection of interaction effects, multi-class decision support systems, prognosis and high-dimensional data.

[1]  Robert S. Ledley,et al.  Computers in Medical Data Processing , 1960 .

[2]  C. R. Deboor,et al.  A practical guide to splines , 1978 .

[3]  Anne Lohrli Chapman and Hall , 1985 .

[4]  D. Oram,et al.  A risk of malignancy index incorporating CA 125, ultrasound and menopausal status for the accurate preoperative diagnosis of ovarian cancer , 1991, British journal of obstetrics and gynaecology.

[5]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[6]  Randolph A. Miller,et al.  Review: Medical Diagnostic Decision Support Systems - Past, Present, And Future: A Threaded Bibliography and Brief Commentary , 1994, J. Am. Medical Informatics Assoc..

[7]  J. Wyatt,et al.  Commentary: Prognostic models: clinically useful or quickly forgotten? , 1995 .

[8]  G. Pilu,et al.  Sonography of pregnancies with first‐trimester bleeding and a viable embryo: a study of prognostic indicators by logistic regression analysis , 1996, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[9]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[10]  T. Bourne,et al.  Terms, definitions and measurements to describe the sonographic features of adnexal tumors: a consensus opinion from the International Ovarian Tumor Analysis (IOTA) group , 2000, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[11]  Kevin Regan,et al.  Nomogram for overall survival of patients with progressive metastatic prostate cancer after castration. , 2002, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[12]  P. Tang,et al.  Medical Computer Applications in Health Care and Biomedicine , 2002 .

[13]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[14]  P. Bossuyt,et al.  The diagnostic odds ratio: a single indicator of test performance. , 2003, Journal of clinical epidemiology.

[15]  M. Kattan Comparison of Cox regression with other methods for determining prediction models and nomograms. , 2003, The Journal of urology.

[16]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .

[17]  M. Kattan Nomograms are superior to staging and risk grouping systems for identifying high-risk patients: preoperative application in prostate cancer , 2003, Current opinion in urology.

[18]  Per Kragh Andersen,et al.  Regression Modeling Strategies with Applications to Linear Models, Logistic Regression and Survival Analysis. Frank E. Harrell, Jun, Springer‐Verlag, New York, 2001. No. of pages: 568. ISBN 0‐387‐95232‐2 , 2003 .

[19]  L. Rombauts,et al.  Ultrasound prediction of risk of spontaneous miscarriage in live embryos from assisted conceptions , 2003, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[20]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[21]  Hiram S. Cody,et al.  A Nomogram for Predicting the Likelihood of Additional Nodal Metastases in Breast Cancer Patients With a Positive Sentinel Node Biopsy , 2003, Annals of Surgical Oncology.

[22]  Ralph B D'Agostino,et al.  Presentation of multivariate data for clinical use: The Framingham Study risk score functions. , 2005, Statistics in medicine.

[23]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[24]  G. Purcell,et al.  What makes a good clinical decision support system , 2005, BMJ : British Medical Journal.

[25]  Michael W Kattan,et al.  When and how to use informatics tools in caring for urologic patients , 2005, Nature Clinical Practice Urology.

[26]  E. Balas,et al.  Improving clinical practice using clinical decision support systems: a systematic review of trials to identify features critical to success , 2005, BMJ : British Medical Journal.

[27]  T. Bourne,et al.  Logistic regression model to distinguish between the benign and malignant adnexal mass before surgery: a multicenter study by the International Ovarian Tumor Analysis Group. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[28]  M. Kattan,et al.  Development and validation of a nomogram predicting the outcome of prostate biopsy based on patient age, digital rectal examination and serum prostate specific antigen. , 2005, The Journal of urology.

[29]  R. Wears,et al.  Computer technology and clinical work: still waiting for Godot. , 2005, JAMA.

[30]  M. Kattan,et al.  Predicting Nonsentinel Node Status After Positive Sentinel Lymph Biopsy for Breast Cancer: Clinicians Versus Nomogram , 2005, Annals of Surgical Oncology.

[31]  Johan A. K. Suykens,et al.  Primal-Dual Monotone Kernel Regression , 2005, Neural Processing Letters.

[32]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[33]  Stephen P. Boyd,et al.  Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[34]  Hartwig Huland,et al.  A critical appraisal of logistic regression‐based nomograms, artificial neural networks, classification and regression‐tree models, look‐up tables and risk‐group stratification models for prostate cancer , 2007, BJU international.

[35]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[36]  Pierre I Karakiewicz,et al.  Initial biopsy outcome prediction--head-to-head comparison of a logistic regression-based nomogram versus artificial neural network. , 2007, European urology.

[37]  J. Suykens,et al.  Preoperative diagnosis of ovarian tumors using Bayesian kernel‐based methods , 2007, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[38]  E.J. Candes,et al.  An Introduction To Compressive Sampling , 2008, IEEE Signal Processing Magazine.

[39]  Shahrokh F. Shariat,et al.  Comparison of Nomograms With Other Methods for Predicting Outcomes in Prostate Cancer: A Critical Analysis of the Literature , 2008, Clinical Cancer Research.

[40]  Ofer Harel,et al.  MIDAS: A SAS Macro for Multiple Imputation Using Distance-Aided Selection of Donors , 2009 .

[41]  Bart De Moor,et al.  Development of a kernel function for clinical data , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[42]  Dean F Sittig,et al.  Implementation pearls from a new guidebook on improving medication use and outcomes with clinical decision support. Effective CDS is essential for addressing healthcare performance improvement imperatives. , 2009, Journal of healthcare information management : JHIM.

[43]  N. Obuchowski,et al.  Assessing the Performance of Prediction Models: A Framework for Traditional and Novel Measures , 2010, Epidemiology.

[44]  Andrew P. Bradley,et al.  Rule extraction from support vector machines: A review , 2010, Neurocomputing.

[45]  Sabine Van Huffel,et al.  Polytomous diagnosis of ovarian tumors as benign, borderline, primary invasive or metastatic: development and validation of standard and kernel-based risk prediction models , 2010, BMC medical research methodology.

[46]  S Van Huffel,et al.  Ovarian cancer prediction in adnexal masses using ultrasound‐based logistic regression models: a temporal and external validation study by the IOTA group , 2010, Ultrasound in obstetrics & gynecology : the official journal of the International Society of Ultrasound in Obstetrics and Gynecology.

[47]  Sabine Van Huffel,et al.  Support vector methods for survival analysis: a comparison between ranking and regression approaches , 2011, Artif. Intell. Medicine.

[48]  L. Hayden,et al.  Ten Commandments for Effective Clinical Decision Support: Making the Practice of Evidence-based Medicine a Reality , 2011 .

[49]  Sabine Van Huffel,et al.  Learning Transformation Models for Ranking and Survival Analysis , 2011, J. Mach. Learn. Res..

[50]  R. Tibshirani,et al.  Generalized Additive Models , 1986 .

[51]  R. Shah,et al.  Least Squares Support Vector Machines , 2022 .