A primer on the use of modern missing-data methods in psychosomatic medicine research.

This paper summarizes recent methodologic advances related to missing data and provides an overview of two "modern" analytic options, direct maximum likelihood (DML) estimation and multiple imputation (MI). The paper begins with an overview of missing data theory, as explicated by Rubin. Brief descriptions of traditional missing data techniques are given, and DML and MI are outlined in greater detail; special attention is given to an "inclusive" analytic strategy that incorporates auxiliary variables into the analytic model. The paper concludes with an illustrative analysis using an artificial quality of life data set. Computer code for all DML and MI analyses is provided, and the inclusion of auxiliary variables is illustrated.

[1]  Richard J Cook,et al.  Marginal Analysis of Incomplete Longitudinal Binary Data: A Cautionary Note on LOCF Imputation , 2004, Biometrics.

[2]  Craig K. Enders,et al.  An SAS Macro for Implementing the Modified Bollen-Stine Bootstrap for Missing Data: Implementing the Bootstrap Using Existing Structural Equation Modeling Software , 2005 .

[3]  M. Kenward,et al.  Informative Drop‐Out in Longitudinal Data Analysis , 1994 .

[4]  Scott R. Eliason Maximum likelihood estimation: Logic and practice. , 1994 .

[5]  Bengt Muthén,et al.  On structural equation modeling with data that are not missing completely at random , 1987 .

[6]  David Kaplan,et al.  The Impact of BIB Spiraling-Induced Missing Data Patterns on Goodness-of-Fit Tests in Factor Analysis , 1995 .

[7]  John W. Graham,et al.  Analysis With Missing Data in Prevention Research , 1997 .

[8]  D. Rubin,et al.  Large-sample significance levels from multiply imputed data using moment-based statistics and an F reference distribution , 1991 .

[9]  Craig K Enders,et al.  Using the expectation maximization algorithm to estimate coefficient alpha for scales with item-level missing data. , 2003, Psychological methods.

[10]  J. Schafer,et al.  On the performance of multiple imputation for multivariate data with small sample size , 1999 .

[11]  J. Schafer,et al.  A comparison of inclusive and restrictive strategies in modern missing data procedures. , 2001, Psychological methods.

[12]  D. Rubin INFERENCE AND MISSING DATA , 1975 .

[13]  Craig K. Enders,et al.  The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models , 2001 .

[14]  J. Graham Adding Missing-Data-Relevant Variables to FIML-Based Structural Equation Models , 2003 .

[15]  Trivellore E Raghunathan,et al.  What do we do with missing data? Some options for analysis of incomplete data. , 2004, Annual review of public health.

[16]  R. Little A Test of Missing Completely at Random for Multivariate Data with Missing Values , 1988 .

[17]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[18]  R. Little Pattern-Mixture Models for Multivariate Incomplete Data , 1993 .

[19]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[20]  David S. Siscovick,et al.  A multiple-imputation analysis of a case-control study of the risk of primary cardiac arrest among pharmacologicallytreated hypertensives , 1996 .

[21]  Craig K. Enders,et al.  Missing Data in Educational Research: A Review of Reporting Practices and Suggestions for Improvement , 2004 .

[22]  Craig K. Enders,et al.  The impact of nonnormality on full information maximum-likelihood estimation for structural equation models with missing data. , 2001, Psychological methods.

[23]  M. Kenward,et al.  Informative dropout in longitudinal data analysis (with discussion) , 1994 .

[24]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[25]  Geert Molenberghs,et al.  Analyzing incomplete longitudinal clinical trial data. , 2004, Biostatistics.

[26]  James L. Arbuckle,et al.  Full Information Estimation in the Presence of Incomplete Data , 1996 .

[27]  Ian R White,et al.  Are missing outcome data adequately handled? A review of published randomized controlled trials in major medical journals , 2004, Clinical trials.

[28]  Craig K. Enders,et al.  Applying the Bollen-Stine Bootstrap for Goodness-of-Fit Measures to Structural Equation Models with Missing Data , 2002, Multivariate behavioral research.

[29]  P. Allison Multiple Imputation for Missing Data , 2000 .

[30]  J L Schafer,et al.  Multiple Imputation for Multivariate Missing-Data Problems: A Data Analyst's Perspective. , 1998, Multivariate behavioral research.

[31]  D. Osoba,et al.  Missing quality of life data in cancer clinical trials: serious problems and challenges. , 1998, Statistics in medicine.

[32]  D. Rubin,et al.  Small-sample degrees of freedom with multiple imputation , 1999 .

[33]  Peter M. Bentler,et al.  Treatments of Missing Data: A Monte Carlo Comparison of RBHDI, Iterative Stochastic Regression Imputation, and Expectation-Maximization , 2000 .

[34]  D P MacKinnon,et al.  Maximizing the Usefulness of Data Obtained with Planned Missing Value Patterns: An Application of Maximum Likelihood Procedures. , 1996, Multivariate behavioral research.

[35]  Geert Molenberghs,et al.  Likelihood Based Frequentist Inference When Data Are Missing at Random , 1998 .

[36]  K. Yuan,et al.  5. Three Likelihood-Based Methods for Mean and Covariance Structure Analysis with Nonnormal Missing Data , 2000 .

[37]  D. Rubin Multiple imputation for nonresponse in surveys , 1989 .