Multiple imputation by chained equations: what is it and how does it work?

Multivariate imputation by chained equations (MICE) has emerged as a principled method of dealing with missing data. Despite properties that make MICE particularly useful for large imputation procedures and advances in software development that now make it accessible to many researchers, many psychiatric researchers have not been trained in these methods and few practical resources exist to guide researchers in the implementation of this technique. This paper provides an introduction to the MICE method with a focus on practical aspects and challenges in using this method. A brief review of software programs available to implement MICE and then analyze multiply imputed data is also provided. Copyright © 2011 John Wiley & Sons, Ltd.

[1]  Geert Molenberghs,et al.  Incomplete hierarchical data , 2007, Statistical methods in medical research.

[2]  Donald Hedeker,et al.  An imputation strategy for incomplete longitudinal ordinal data , 2008, Statistics in medicine.

[3]  J. Graham,et al.  How Many Imputations are Really Needed? Some Practical Clarifications of Multiple Imputation Theory , 2007, Prevention Science.

[4]  Brigitte Manteuffel,et al.  Overview of the National Evaluation of the Comprehensive Community Mental Health Services for Children and Their Families Program and Summary of Current Findings , 2002 .

[5]  Patrick Royston,et al.  Multiple Imputation of Missing Values: Update , 2005 .

[6]  Robert M. Friedman,et al.  Overview of the National Evaluation of the Comprehensive Community Mental Health Services for Children and Their Families Program , 2001 .

[7]  Xiao-Hua Zhou,et al.  Multiple imputation: review of theory, implementation and software , 2007, Statistics in medicine.

[8]  S. van Buuren Multiple imputation of discrete and continuous data by fully conditional specification , 2007, Statistical methods in medical research.

[9]  Elizabeth A Stuart,et al.  American Journal of Epidemiology Practice of Epidemiology Multiple Imputation with Large Data Sets: a Case Study of the Children's Mental Health Initiative , 2022 .

[10]  StataCorp Stata multiple-imputation reference manual , 2011 .

[11]  T. Raghunathan,et al.  Multiple Imputation of Missing Income Data in the National Health Interview Survey , 2006 .

[12]  Recai M. Yucel,et al.  Multiple imputation inference for multivariate multilevel continuous data with ignorable non-response , 2008, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[13]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[14]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[15]  Devan V Mehrotra,et al.  Analysis of incomplete longitudinal binary data using multiple imputation , 2006, Statistics in medicine.

[16]  J. Graham Adding Missing-Data-Relevant Variables to FIML-Based Structural Equation Models , 2003 .

[17]  Trevillore E. Raghunathan,et al.  IVEware: Imputation and Variance Estimation Software User Guide , 2002 .

[18]  Theo Stijnen,et al.  Using the outcome for imputation of missing predictor values was preferred. , 2006, Journal of clinical epidemiology.

[19]  A. Gelman,et al.  Multiple Imputation with Diagnostics (mi) in R: Opening Windows into the Black Box , 2011 .

[20]  Jaakko Nevalainen,et al.  Missing values in longitudinal dietary data: A multiple imputation approach based on a fully conditional specification , 2009, Statistics in medicine.

[21]  Oliver Rivero-Arias,et al.  Evaluation of software for multiple imputation of semi-continuous data , 2007, Statistical methods in medical research.

[22]  Andrew Gelman,et al.  Diagnostics for multivariate imputations , 2007 .

[23]  J. Schafer Multiple imputation: a primer , 1999, Statistical methods in medical research.

[24]  J. Schafer Multiple Imputation in Multivariate Problems When the Imputation and Analysis Models Differ , 2003 .

[25]  Patrick Royston,et al.  Multiple Imputation of Missing Values: New Features for Mim , 2009 .

[26]  John Van Hoewyk,et al.  A multivariate technique for multiply imputing missing values using a sequence of regression models , 2001 .

[27]  John B Carlin,et al.  American Journal of Epidemiology Practice of Epidemiology Multiple Imputation for Missing Data: Fully Conditional Specification versus Multivariate Normal Imputation , 2022 .

[28]  J.P.L. Brand,et al.  Development, Implementation and Evaluation of Multiple Imputation Strategies for the Statistical Analysis of Incomplete Data Sets , 1999 .

[29]  J. Graham,et al.  Missing data analysis: making it work in the real world. , 2009, Annual review of psychology.

[30]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[31]  A. Zaslavsky,et al.  Multiple imputation in a large-scale complex survey: a practical guide , 2010, Statistical methods in medical research.

[32]  S Greenland,et al.  A critical look at methods for handling missing covariates in epidemiologic regression analyses. , 1995, American journal of epidemiology.