A Comparison of Maximum-Likelihood and Asymptotically Distribution-Free Methods of Treating Incomplete Nonnormal Data

This article describes a Monte Carlo study of 2 methods for treating incomplete nonnormal data. Skewed, kurtotic data sets conforming to a single structured model, but varying in sample size, percentage of data missing, and missing-data mechanism, were produced. An asymptotically distribution-free available-case (ADFAC) method and structured-model expectation-maximization (EM) with nonnormality corrections were applied to these data sets, and the 2 methods were then compared in terms of bias in parameter estimates, bias in standard-error estimates, efficiency of parameter estimates, and model chi-squares. The results favored the nonnormality corrected EM over the ADFAC method in almost all respects, the only important exceptions involving (a) bias in standard-error estimates with large samples and (b) mixed results with respect to the efficiency of parameter estimates.

[1]  Herbert W. Marsh,et al.  Pairwise Deletion for Missing Data in Structural Equation Models: Nonpositive Definite Matrices, Parameter Estimates, Goodness of Fit, and Adjusted Sample Sizes. , 1998 .

[2]  S. West,et al.  The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. , 1996 .

[3]  Sik-Yum Lee,et al.  Estimation for structural equation models with missing data , 1986 .

[4]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[5]  Peter M. Bentler,et al.  EQS : structural equations program manual , 1989 .

[6]  Roger L. Brown Efficacy of the indirect approach for estimating structural equation models with missing data: A comparison of five methods , 1994 .

[7]  Peter M. Bentler,et al.  Treatments of Missing Data: A Monte Carlo Comparison of RBHDI, Iterative Stochastic Regression Imputation, and Expectation-Maximization , 2000 .

[8]  D P MacKinnon,et al.  Maximizing the Usefulness of Data Obtained with Planned Missing Value Patterns: An Application of Maximum Likelihood Procedures. , 1996, Multivariate behavioral research.

[9]  R. Carroll,et al.  A Note on the Efficiency of Sandwich Covariance Matrix Estimation , 2001 .

[10]  Clifford C. Clogg,et al.  Handbook of Statistical Modeling for the Social and Behavioral Sciences. , 1995 .

[11]  T. Dijkstra,et al.  Least-squares theory based on general distributional assumptions with an application to the incomplete observations problem , 1985 .

[12]  John L.P. Thompson,et al.  Missing data , 2004, Amyotrophic lateral sclerosis and other motor neuron disorders : official publication of the World Federation of Neurology, Research Group on Motor Neuron Diseases.

[13]  Donald B. Rubin,et al.  EM and beyond , 1991 .

[14]  Craig K. Enders,et al.  The impact of nonnormality on full information maximum-likelihood estimation for structural equation models with missing data. , 2001, Psychological methods.

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  K. Yuan,et al.  5. Three Likelihood-Based Methods for Mean and Covariance Structure Analysis with Nonnormal Missing Data , 2000 .

[17]  James L. Arbuckle,et al.  Full Information Estimation in the Presence of Incomplete Data , 1996 .

[18]  J. Schafer,et al.  Missing data: our view of the state of the art. , 2002, Psychological methods.

[19]  Carl T. Finkbeiner Estimation for the multiple factor model when data are missing , 1979 .

[20]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[21]  Y Kano,et al.  Can test statistics in covariance structure analysis be trusted? , 1992, Psychological bulletin.

[22]  Craig K. Enders,et al.  The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models , 2001 .

[23]  Michael E. Sobel,et al.  Pseudo-Maximum Likelihood Estimation of Mean and Covariance Structures with Missing Data , 1990 .

[24]  Roderick J. A. Little,et al.  The Analysis of Social Science Data with Missing Values , 1989 .

[25]  Jürgen Baumert,et al.  Modeling longitudinal and multilevel data: Practical issues, applied approaches, and specific examples. , 2000 .

[26]  P. Bentler,et al.  ML Estimation of Mean and Covariance Structures with Missing Data Using Complete Data Routines , 1999 .

[27]  W. Wothke Longitudinal and multigroup modeling with missing data. , 2000 .

[28]  M. Browne Asymptotically distribution-free methods for the analysis of covariance structures. , 1984, The British journal of mathematical and statistical psychology.

[29]  A. Satorra,et al.  Corrections to test statistics and standard errors in covariance structure analysis. , 1994 .

[30]  Richard Staelin,et al.  A proposal for handling missing data , 1975 .