Assessing the Clinical Impact of Risk Prediction Models With Decision Curves: Guidance for Correct Interpretation and Appropriate Use.

The decision curve is a graphical summary recently proposed for assessing the potential clinical impact of risk prediction biomarkers or risk models for recommending treatment or intervention. It was applied recently in an article in Journal of Clinical Oncology to measure the impact of using a genomic risk model for deciding on adjuvant radiation therapy for prostate cancer treated with radical prostatectomy. We illustrate the use of decision curves for evaluating clinical- and biomarker-based models for predicting a man's risk of prostate cancer, which could be used to guide the decision to biopsy. Decision curves are grounded in a decision-theoretical framework that accounts for both the benefits of intervention and the costs of intervention to a patient who cannot benefit. Decision curves are thus an improvement over purely mathematical measures of performance such as the area under the receiver operating characteristic curve. However, there are challenges in using and interpreting decision curves appropriately. We caution that decision curves cannot be used to identify the optimal risk threshold for recommending intervention. We discuss the use of decision curves for miscalibrated risk models. Finally, we emphasize that a decision curve shows the performance of a risk model in a population in which every patient has the same expected benefit and cost of intervention. If every patient has a personal benefit and cost, then the curves are not useful. If subpopulations have different benefits and costs, subpopulation-specific decision curves should be used. As a companion to this article, we released an R software package called DecisionCurve for making decision curves and related graphics.

[1]  C S Peirce,et al.  The numerical measure of the success of predictions. , 1884, Science.

[2]  E. Elkin,et al.  Decision Curve Analysis: A Novel Method for Evaluating Prediction Models , 2006, Medical decision making : an international journal of the Society for Medical Decision Making.

[3]  B. Guillonneau,et al.  Is it necessary to remove the seminal vesicles completely at radical prostatectomy? decision curve analysis of European Society of Urologic Oncology criteria. , 2009, The Journal of urology.

[4]  S. Baker Putting risk prediction in perspective: relative utility curves. , 2009, Journal of the National Cancer Institute.

[5]  Nancy R Cook,et al.  Using relative utility curves to evaluate risk prediction , 2009, Journal of the Royal Statistical Society. Series A,.

[6]  Andrew J Vickers,et al.  Traditional statistical methods for evaluating prediction models are uninformative as to clinical value: towards a decision analytic framework. , 2010, Seminars in oncology.

[7]  David I. Lee,et al.  Comparison of three different tools for prediction of seminal vesicle invasion at radical prostatectomy. , 2012, European urology.

[8]  S. Duffy,et al.  Predictive Accuracy of the Liverpool Lung Project Risk Model for Stratifying Patients for Computed Tomography Screening for Lung Cancer , 2012, Annals of Internal Medicine.

[9]  J. Reitsma,et al.  Beyond Diagnostic Accuracy: The Clinical Utility of Diagnostic Tests , 2012 .

[10]  Y. Vergouwe,et al.  Zinc protoporphyrin levels have added value in the prediction of low hemoglobin deferral in whole blood donors , 2013, Transfusion.

[11]  U. Capitanio,et al.  Head-to-head comparison of prostate health index and urinary PCA3 for predicting cancer at initial or repeat biopsy. , 2013, The Journal of urology.

[12]  Ksenija Slankamenac,et al.  Novel Prediction Score Including Pre- and Intraoperative Parameters Best Predicts Acute Kidney Injury after Liver Surgery , 2013, World Journal of Surgery.

[13]  W. Lau,et al.  A nomogram predicting pulmonary metastasis of hepatocellular carcinoma following partial hepatectomy , 2014, British Journal of Cancer.

[14]  Hua Yang,et al.  Urinary cell mRNA profiles and differential diagnosis of acute kidney graft dysfunction. , 2014, Journal of the American Society of Nephrology : JASN.

[15]  P. Chow,et al.  Correction: The Singapore Liver Cancer Recurrence (SLICER) Score for Relapse Prediction in Patients with Surgically Resected Hepatocellular Carcinoma , 2015, PloS one.

[16]  A. Weil,et al.  Predicting success of endoscopic third ventriculostomy: validation of the ETV Success Score in a mixed population of adult and pediatric patients. , 2015, Journal of neurosurgery.

[17]  B. Kramer,et al.  Evaluating Prognostic Markers Using Relative Utility Curves and Test Tradeoffs. , 2015, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[18]  P. Scardino,et al.  Predicting High-Grade Cancer at Ten-Core Prostate Biopsy Using Four Kallikrein Markers Measured in Blood in the ProtecT Study , 2015, Journal of the National Cancer Institute.

[19]  F. Feng,et al.  Genomic classifier identifies men with adverse pathology after radical prostatectomy who benefit from adjuvant radiation therapy. , 2015, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[20]  B. van Calster,et al.  Calibration of Risk Prediction Models , 2015, Medical decision making : an international journal of the Society for Medical Decision Making.

[21]  Hester F. Lingsma,et al.  Measurement of the glial fibrillary acidic protein and its breakdown products GFAP-BDP biomarker for the detection of traumatic brain injury compared to computed tomography and magnetic resonance imaging. , 2015, Journal of neurotrauma.

[22]  Jing Fan,et al.  The Net Reclassification Index (NRI): A Misleading Measure of Prediction Improvement Even with Independent Test Data Sets , 2015, Statistics in biosciences.