A comparison of existing methods for multiple imputation in individual participant data meta‐analysis

Multiple imputation is a popular method for addressing missing data, but its implementation is difficult when data have a multilevel structure and one or more variables are systematically missing. This systematic missing data pattern may commonly occur in meta-analysis of individual participant data, where some variables are never observed in some studies, but are present in other hierarchical data settings. In these cases, valid imputation must account for both relationships between variables and correlation within studies. Proposed methods for multilevel imputation include specifying a full joint model and multiple imputation with chained equations (MICE). While MICE is attractive for its ease of implementation, there is little existing work describing conditions under which this is a valid alternative to specifying the full joint model. We present results showing that for multilevel normal models, MICE is rarely exactly equivalent to joint model imputation. Through a simulation study and an example using data from a traumatic brain injury study, we found that in spite of theoretical differences, MICE imputations often produce results similar to those obtained using the joint model. We also assess the influence of prior distributions in MICE imputation methods and find that when missingness is high, prior choices in MICE models tend to affect estimation of across-study variability more than compatibility of conditional likelihoods. Copyright © 2017 John Wiley & Sons, Ltd.

[1]  David Kline,et al.  Comparing multiple imputation methods for systematically missing subject‐level data , 2017, Research synthesis methods.

[2]  Patrick Royston,et al.  Multiple imputation using chained equations: Issues and guidance for practice , 2011, Statistics in medicine.

[3]  S. Thompson,et al.  How should meta‐regression analyses be undertaken and interpreted? , 2002, Statistics in medicine.

[4]  Eberechukwu Onukwugha,et al.  Concordance between administrative claims and registry data for identifying metastasis to the bone: an exploratory analysis in prostate cancer , 2014, BMC Medical Research Methodology.

[5]  Evangelos Kontopantelis,et al.  A comparison of one‐stage vs two‐stage individual patient data meta‐analysis methods: A simulation study , 2018, Research synthesis methods.

[6]  D. Rubin,et al.  Fully conditional specification in multivariate imputation , 2006 .

[7]  Rebecca R Andridge,et al.  Quantifying the impact of fixed effects modeling of clusters in multiple imputation for cluster randomized trials , 2011, Biometrical journal. Biometrische Zeitschrift.

[8]  James R Carpenter,et al.  Multiple imputation of covariates by fully conditional specification: Accommodating the substantive model , 2012, Statistical methods in medical research.

[9]  Craig K Enders,et al.  Multilevel multiple imputation: A review and evaluation of joint modeling and chained equations imputation. , 2016, Psychological methods.

[10]  J. Schafer,et al.  Computational Strategies for Multivariate Linear Mixed-Effects Models With Missing Values , 2002 .

[11]  Recai M. Yucel,et al.  Multiple imputation inference for multivariate multilevel continuous data with ignorable non-response , 2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[12]  J. Tawn,et al.  Efficient inference for spatial extreme value processes associated to log-Gaussian random functions , 2014 .

[13]  H Tunstall-Pedoe,et al.  Systematically missing confounders in individual participant data meta-analysis of observational cohort studies , 2009, Statistics in medicine.

[14]  H. Taylor,et al.  Clinically significant behavior problems during the initial 18 months following early childhood traumatic brain injury. , 2010, Rehabilitation psychology.

[15]  Michael G. Kenward,et al.  Multiple Imputation and its Application: Carpenter/Multiple Imputation and its Application , 2013 .

[16]  Karel G M Moons,et al.  Imputation of systematically missing predictors in an individual participant data meta‐analysis: a generalized approach using MICE , 2015, Statistics in medicine.

[17]  B. Arnold,et al.  Conditionally Specified Distributions: An Introduction (with comments and a rejoinder by the authors) , 2001 .

[18]  Matthieu Resche-Rigon,et al.  Multiple imputation for handling systematically missing confounders in meta‐analysis of individual participant data , 2013, Statistics in medicine.

[19]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[20]  D. Drotar,et al.  Predicting Longitudinal Patterns of Functional Deficits in Children with Traumatic Brain Injury Many Studies Have Found Evidence of Deterioration in Behavioral Adjust- Ment across Time and Persistent Postinjury Behavior Problems In , 2022 .

[21]  Robert D Gibbons,et al.  Multiple imputation for harmonizing longitudinal non‐commensurate measures in individual participant data meta‐analysis , 2015, Statistics in medicine.

[22]  B. Arnold,et al.  Compatible Conditional Distributions , 1989 .

[23]  L. Derogatis,et al.  The Brief Symptom Inventory: an introductory report , 1983, Psychological Medicine.