Aggregating published prediction models with individual participant data: a comparison of different approaches

During the recent decades, interest in prediction models has substantially increased, but approaches to synthesize evidence from previously developed models have failed to keep pace. This causes researchers to ignore potentially useful past evidence when developing a novel prediction model with individual participant data (IPD) from their population of interest. We aimed to evaluate approaches to aggregate previously published prediction models with new data. We consider the situation that models are reported in the literature with predictors similar to those available in an IPD dataset. We adopt a two-stage method and explore three approaches to calculate a synthesis model, hereby relying on the principles of multivariate meta-analysis. The former approach employs a naive pooling strategy, whereas the latter accounts for within-study and between-study covariance. These approaches are applied to a collection of 15 datasets of patients with traumatic brain injury, and to five previously published models for predicting deep venous thrombosis. Here, we illustrated how the generally unrealistic assumption of consistency in the availability of evidence across included studies can be relaxed. Results from the case studies demonstrate that aggregation yields prediction models with an improved discrimination and calibration in a vast majority of scenarios, and result in equivalent performance (compared with the standard approach) in a small minority of situations. The proposed aggregation approaches are particularly useful when few participant data are at hand. Assessing the degree of heterogeneity between IPD and literature findings remains crucial to determine the optimal approach in aggregating previous evidence into new prediction models.

[1]  Ewout W Steyerberg,et al.  Validation and updating of predictive logistic regression models: a study on sample size and shrinkage , 2004, Statistics in medicine.

[2]  Katalin Balázs,et al.  Detecting Heterogeneity in Logistic Regression Models , 2006 .

[3]  A. Hoes,et al.  Diagnostic classification in patients with suspected deep venous thrombosis: physicians' judgement or a decision rule? , 2010, The British journal of general practice : the journal of the Royal College of General Practitioners.

[4]  P. Royston,et al.  Prognosis and prognostic research: application and impact of prognostic models in clinical practice , 2009, BMJ : British Medical Journal.

[5]  E W Steyerberg,et al.  Stepwise selection in small data sets: a simulation study of bias in logistic regression analysis. , 1999, Journal of clinical epidemiology.

[6]  M. Pencina,et al.  Evaluating the added predictive ability of a new marker: From area under the ROC curve to reclassification and beyond , 2008, Statistics in medicine.

[7]  H C Van Houwelingen,et al.  Construction, validation and updating of a prognostic model for kidney graft survival. , 1995, Statistics in medicine.

[8]  Johannes B Reitsma,et al.  Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. , 2005, Journal of clinical epidemiology.

[9]  G. Kovacs,et al.  Evaluation of D-Dimer in the diagnosis of suspected deep-vein thrombosis , 2004 .

[10]  N. Laird,et al.  Meta-analysis in clinical trials. , 1986, Controlled clinical trials.

[11]  A. Abu-Hanna,et al.  Prognostic Models in Medicine , 2001, Methods of Information in Medicine.

[12]  D E Grobbee,et al.  External validation is necessary in prediction research: a clinical example. , 2003, Journal of clinical epidemiology.

[13]  J. Sleigh,et al.  Diagnosis of lower limb deep venous thrombosis in emergency department patients: performance of Hamilton and modified Wells scores. , 2006, Annals of emergency medicine.

[14]  Y. Vergouwe,et al.  Validation, updating and impact of clinical prediction rules: a review. , 2008, Journal of clinical epidemiology.

[15]  Dan Jackson,et al.  Multivariate meta-analysis: Potential and promise , 2011, Statistics in medicine.

[16]  Betsy Jane Becker,et al.  The Synthesis of Regression Slopes in Meta-Analysis. , 2007, 0801.4442.

[17]  Ewout W Steyerberg,et al.  A systematic review finds methodological improvements necessary for prognostic models in determining traumatic brain injury outcomes. , 2008, Journal of clinical epidemiology.

[18]  E W Steyerberg,et al.  See Blockindiscussions, Blockinstats, Blockinand Blockinauthor Blockinprofiles Blockinfor Blockinthis Blockinpublication Prognostic Blockinmodels Blockinbased Blockinon Blockinliterature Blockinand Individual Blockinpatient Blockindata Blockinin Blockinlogistic Blockinregression Analysis Article Blo , 2022 .

[19]  Juan Lu,et al.  IMPACT database of traumatic brain injury: design and description. , 2007, Journal of neurotrauma.

[20]  A J Sutton,et al.  Meta‐analysis of individual‐ and aggregate‐level data , 2008, Statistics in medicine.

[21]  D. Altman,et al.  Statistical heterogeneity in systematic reviews of clinical trials: a critical appraisal of guidelines and practice , 2002, Journal of health services research & policy.

[22]  W. Levack,et al.  Experience of recovery and outcome following traumatic brain injury: a metasynthesis of qualitative research , 2010, Disability and rehabilitation.

[23]  G. Bedogni,et al.  Clinical Prediction Models—a Practical Approach to Development, Validation and Updating , 2009 .

[24]  Karel Moons,et al.  Safely Ruling Out Deep Venous Thrombosis in Primary Care , 2009, Annals of Internal Medicine.

[25]  A. Skene,et al.  A trial of the effect of nimodipine on outcome after head injury , 2005, Acta Neurochirurgica.

[26]  Theo Stijnen,et al.  Advanced methods in meta‐analysis: multivariate approach and meta‐regression , 2002, Statistics in medicine.

[27]  Richard D. Riley,et al.  A systematic review of breast cancer incidence risk prediction models with meta-analysis of their performance , 2012, Breast Cancer Research and Treatment.

[28]  Y Vergouwe,et al.  A new diagnostic rule for deep vein thrombosis: safety and efficiency in clinically relevant subgroups. , 2007, Family practice.

[29]  R. Ferguson,et al.  LYMPHOID CELLS IN JEJUNAL MUCOSA , 1975, The Lancet.

[30]  B Jennett,et al.  PREDICTING OUTCOME IN INDIVIDUAL PATIENTS AFTER SEVERE HEAD INJURY , 1976, The Lancet.

[31]  Richard D Riley,et al.  Bivariate random-effects meta-analysis and the estimation of between-study correlation , 2007, BMC Medical Research Methodology.

[32]  D. Altman,et al.  Measuring inconsistency in meta-analyses , 2003, BMJ : British Medical Journal.

[33]  Dan Jackson,et al.  Extending DerSimonian and Laird's methodology to perform multivariate random effects meta‐analyses , 2009, Statistics in medicine.

[34]  K. Covinsky,et al.  Assessing the Generalizability of Prognostic Information , 1999, Annals of Internal Medicine.

[35]  Gordon D. Murray,et al.  A multicenter trial of the efficacy of nimodipine on outcome after severe head injury. The European Study Group on Nimodipine in Severe Head Injury. , 1994, Journal of neurosurgery.

[36]  D. Mottier,et al.  Réalisation d’un score clinique de prédiction de thrombose veineuse profonde des membres inférieurs spécifique à la médecine générale , 2009 .

[37]  Richard D Riley,et al.  Meta‐analysis of continuous outcomes combining individual patient data and aggregate data , 2008, Statistics in medicine.

[38]  Yvonne Vergouwe,et al.  A simple method to adjust clinical prediction models to local circumstances , 2009, Canadian journal of anaesthesia = Journal canadien d'anesthesie.

[39]  N. Cook Use and Misuse of the Receiver Operating Characteristic Curve in Risk Prediction , 2007, Circulation.

[40]  Christiaan de Leeuw,et al.  Augmenting Data With Published Results in Bayesian Linear Regression , 2012, Multivariate behavioral research.

[41]  Lawrence Joseph,et al.  Impact of approximating or ignoring within‐study covariances in multivariate meta‐analyses , 2008, Statistics in medicine.

[42]  Ian Roberts,et al.  Systematic review of prognostic models in traumatic brain injury , 2006, BMC Medical Informatics Decis. Mak..

[43]  A. Hyder,et al.  The impact of traumatic brain injuries: a global perspective. , 2007, NeuroRehabilitation.

[44]  Janis Bormanis,et al.  Value of assessment of pretest probability of deep-vein thrombosis in clinical management , 1997, The Lancet.

[45]  Juan Lu,et al.  Predicting Outcome after Traumatic Brain Injury: Development and International Validation of Prognostic Scores Based on Admission Characteristics , 2008, PLoS medicine.

[46]  Ewout W Steyerberg,et al.  Internal and external validation of predictive models: a simulation study of bias and precision in small samples. , 2003, Journal of clinical epidemiology.

[47]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[48]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[49]  Karel G M Moons,et al.  Ruling out deep venous thrombosis in primary care , 2005, Thrombosis and Haemostasis.

[50]  Richard D Riley,et al.  Evidence synthesis combining individual patient data and aggregate data: a systematic review identified current practice and possible methods. , 2007, Journal of clinical epidemiology.

[51]  Douglas G Altman,et al.  Prognostic Models: A Methodological Framework and Review of Models for Breast Cancer , 2009, Cancer investigation.

[52]  Richard D Riley,et al.  Ten steps towards improving prognosis research , 2009, BMJ : British Medical Journal.