Explicit inclusion of treatment in prognostic modeling was recommended in observational and randomized settings.

OBJECTIVES To compare different methods to handle treatment when developing a prognostic model that aims to produce accurate probabilities of the outcome of individuals if left untreated. STUDY DESIGN AND SETTING Simulations were performed based on two normally distributed predictors, a binary outcome, and a binary treatment, mimicking a randomized trial or an observational study. Comparison was made between simply ignoring treatment (SIT), restricting the analytical data set to untreated individuals (AUT), inverse probability weighting (IPW), and explicit modeling of treatment (MT). Methods were compared in terms of predictive performance of the model and the proportion of incorrect treatment decisions. RESULTS Omitting a genuine predictor of the outcome from the prognostic model decreased model performance, in both an observational study and a randomized trial. In randomized trials, the proportion of incorrect treatment decisions was smaller when applying AUT or MT, compared to SIT and IPW. In observational studies, MT was superior to all other methods regarding the proportion of incorrect treatment decisions. CONCLUSION If a prognostic model aims to produce correct probabilities of the outcome in the absence of treatment, ignoring treatments that affect that outcome can lead to suboptimal model performance and incorrect treatment decisions. Explicitly, modeling treatment is recommended.

[1]  G. Collins,et al.  Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): The TRIPOD Statement , 2015, Annals of Internal Medicine.

[2]  Richard D Riley,et al.  Prognosis research strategy (PROGRESS) 1: A framework for researching clinical outcomes , 2013, BMJ : British Medical Journal.

[3]  T. Alonzo Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating By Ewout W. Steyerberg , 2009 .

[4]  Shah Ebrahim,et al.  European Guidelines on Cardiovascular Disease Prevention in Clinical Practice (Version 2012) , 2012, International Journal of Behavioral Medicine.

[5]  Ewout W. Steyerberg,et al.  Validation of Prediction Models , 2019, Statistics for Biology and Health.

[6]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[7]  P. Royston,et al.  Prognosis and prognostic research: application and impact of prognostic models in clinical practice , 2009, BMJ : British Medical Journal.

[8]  Yoav Ben-Shlomo,et al.  A longitudinal model for disease progression was developed and applied to multiple sclerosis , 2015, Journal of Clinical Epidemiology.

[9]  N. Cook,et al.  Further insight into the cardiovascular risk calculator: the roles of statins, revascularizations, and underascertainment in the Women's Health Study. , 2014, JAMA internal medicine.

[10]  F. Harrell,et al.  Regression modelling strategies for improved prognostic prediction. , 1984, Statistics in medicine.

[11]  J. Broderick,et al.  Predicting prognosis after stroke , 2000, Neurology.

[12]  Andrea Ganna,et al.  Prediction impact curve is a new measure integrating intervention effects in the evaluation of risk models. , 2016, Journal of clinical epidemiology.

[13]  D. Lawlor,et al.  Clustered Environments and Randomized Genes: A Fundamental Distinction between Conventional and Genetic Epidemiology , 2007, PLoS medicine.

[14]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[15]  P Glasziou,et al.  Cardiovascular risk scores do not account for the effect of treatment: a review , 2011, Heart.

[16]  Hester F. Lingsma,et al.  Prognostic Value of Major Extracranial Injury in Traumatic Brain Injury: An Individual Patient Data Meta-analysis in 39 274 Patients , 2012, Neurosurgery.