Inference from Multiple Imputation for Missing Data Using Mixtures of Normals.

We consider two difficulties with standard multiple imputation methods for missing data based on Rubin's t method for confidence intervals: their often excessive width, and their instability. These problems are present most often when the number of copies is small, as is often the case when a data collection organization is making multiple completed datasets available for analysis. We suggest using mixtures of normals as an alternative to Rubin's t. We also examine the performance of improper imputation methods as an alternative to generating copies from the true posterior distribution for the missing observations. We report the results of simulation studies and analyses of data on health-related quality of life in which the methods suggested here gave narrower confidence intervals and more stable inferences, especially with small numbers of copies or non-normal posterior distributions of parameter estimates. A free R software package called MImix that implements our methods is available from CRAN.

[1]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[2]  T. Ferguson BAYESIAN DENSITY ESTIMATION BY MIXTURES OF NORMAL DISTRIBUTIONS , 1983 .

[3]  J. Ware,et al.  International quality of life assessment (IQOLA) project , 1992, Quality of Life Research.

[4]  H. Chernoff,et al.  Recent advances in statistics : papers in honor of Herman Chernoff on his sixtieth birthday , 1983 .

[5]  H. Carabin,et al.  Comparison of methods to analyse imprecise faecal coliform count data from environmental samples , 2001, Epidemiology and Infection.

[6]  R. Fay Alternative Paradigms for the Analysis of Imputed Survey Data , 1996 .

[7]  L. Wasserman,et al.  Practical Bayesian Density Estimation Using Mixtures of Normals , 1997 .

[8]  M. West Approximating posterior distributions by mixtures , 1993 .

[9]  P M Bentler,et al.  Use of structural equation modeling to test the construct validity of the SF-36 Health Survey in ten countries: results from the IQOLA Project. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[10]  Joseph L Schafer,et al.  Inference with Imputed Conditional Means , 2000 .

[11]  J. Geweke,et al.  Measuring the pricing error of the arbitrage pricing theory , 1996 .

[12]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[13]  D. Rubin,et al.  MULTIPLE IMPUTATIONS IN SAMPLE SURVEYS-A PHENOMENOLOGICAL BAYESIAN APPROACH TO NONRESPONSE , 2002 .

[14]  J E Ware,et al.  Overview of the SF-36 Health Survey and the International Quality of Life Assessment (IQOLA) Project. , 1998, Journal of clinical epidemiology.

[15]  M Sullivan,et al.  The factor structure of the SF-36 Health Survey in 10 countries: results from the IQOLA Project. International Quality of Life Assessment. , 1998, Journal of clinical epidemiology.

[16]  D. Rubin,et al.  Small-sample degrees of freedom with multiple imputation , 1999 .

[17]  Michael A. West,et al.  BAYESIAN MODEL ASSESSMENT IN FACTOR ANALYSIS , 2004 .

[18]  Julie Jomeen,et al.  The factor structure of the SF-36 in early pregnancy. , 2005, Journal of psychosomatic research.

[19]  M. Escobar,et al.  Bayesian Density Estimation and Inference Using Mixtures , 1995 .

[20]  J. N. K. Rao,et al.  On Balanced Half-Sample Variance Estimation in Stratified Random Sampling , 1996 .

[21]  Joseph L Schafer,et al.  Analysis of Incomplete Multivariate Data , 1997 .

[22]  G. C. Wei,et al.  A Monte Carlo Implementation of the EM Algorithm and the Poor Man's Data Augmentation Algorithms , 1990 .

[23]  D. Rubin,et al.  Statistical Analysis with Missing Data , 1988 .

[24]  Xiao-Li Meng,et al.  Missing Data: Dial M for ??? , 2000 .

[25]  D. Rubin Multiple Imputation After 18+ Years , 1996 .

[26]  Roger A. Sugden,et al.  Multiple Imputation for Nonresponse in Surveys , 1988 .

[27]  Chia-huei Wu,et al.  Examining the hierarchical factor structure of the SF-36 Taiwan version by exploratory and confirmatory factor analysis. , 2007, Journal of evaluation in clinical practice.

[28]  J. Robins,et al.  Inference for imputation estimators , 2000 .