A Statistically Justified Pairwise ML Method for Incomplete Nonnormal Data: A Comparison With Direct ML and Pairwise ADF

This article proposes a new approach to the statistical analysis of pairwise-present covariance structure data. The estimator is based on maximizing the complete data likelihood function, and the associated test statistic and standard errors are corrected for misspecification using Satorra-Bentler corrections. A Monte Carlo study was conducted to compare the proposed method (pairwise maximum likelihood [ML]) to 2 other methods for dealing with incomplete nonnormal data: direct ML estimation with the Yuan-Bentler corrections for nonnormality (direct ML) and the asymptotically distribution free (ADF) method applied to available cases (pairwise ADF). Data were generated from a 4-factor model with 4 indicators per factor; sample size varied from 200 to 5,000; data were either missing completely at random (MCAR) or missing at random (MAR); and the proportion of missingness was either 15% or 30%. Measures of relative performance included model fit, relative accuracy in parameter estimates and their standard errors, and efficiency of parameter estimates. The results generally favored direct ML over either of the pairwise methods, except at N = 5,000, when ADF outperformed both ML methods with MAR data. The inferior performance of the 2 pairwise methods was primarily due to inflated test statistics. Among the unexpected findings was that ADF did better at estimating factor covariances in all conditions, and that MCAR data presented more problems for all methods than did MAR data, in terms of convergence, performance of test statistics, and relative accuracy of parameter estimates.

[1]  M. Browne Asymptotically distribution-free methods for the analysis of covariance structures. , 1984, The British journal of mathematical and statistical psychology.

[2]  Craig K. Enders,et al.  Using an EM Covariance Matrix to Estimate Structural Equation Models With Missing Data: Choosing an Adjusted Sample Size to Improve the Accuracy of Inferences , 2004 .

[3]  Roderick J. A. Little,et al.  Statistical Analysis with Missing Data: Little/Statistical Analysis with Missing Data , 2002 .

[4]  Peter M. Bentler,et al.  A Comparison of Maximum-Likelihood and Asymptotically Distribution-Free Methods of Treating Incomplete Nonnormal Data , 2003 .

[5]  Tron Foss,et al.  The Performance of ML, GLS, and WLS Estimation in Structural Equation Modeling Under Conditions of Misspecification and Nonnormality , 2000 .

[6]  J. Magnus,et al.  Matrix Differential Calculus with Applications in Statistics and Econometrics , 2019, Wiley Series in Probability and Statistics.

[7]  Peter C. M. Molenaar,et al.  A comparison of four methods of calculating standard errors of maximum likelihood estimates in the analysis of covariance structure. , 1991 .

[8]  Craig K. Enders,et al.  The Relative Performance of Full Information Maximum Likelihood Estimation for Missing Data in Structural Equation Models , 2001 .

[9]  T. Micceri The unicorn, the normal curve, and other improbable creatures. , 1989 .

[10]  C. D. Vale,et al.  Simulating multivariate nonnormal distributions , 1983 .

[11]  Craig K. Enders,et al.  The impact of nonnormality on full information maximum-likelihood estimation for structural equation models with missing data. , 2001, Psychological methods.

[12]  Y Kano,et al.  Can test statistics in covariance structure analysis be trusted? , 1992, Psychological bulletin.

[13]  Albert Satorra,et al.  Model Conditions for Asymptotic Robustness in the Analysis of Linear Relations , 1990 .

[14]  J. Magnus,et al.  Matrix Differential Calculus with Applications in Statistics and Econometrics (Revised Edition) , 1999 .

[15]  A. Shapiro,et al.  Robustness of normal theory methods in the analysis of linear latent variate models. , 1988 .

[16]  T. Dijkstra,et al.  Least-squares theory based on general distributional assumptions with an application to the incomplete observations problem , 1985 .

[17]  A. Satorra,et al.  Corrections to test statistics and standard errors in covariance structure analysis. , 1994 .

[18]  Peter M. Bentler,et al.  Tests of homogeneity of means and covariance matrices for multivariate incomplete data , 2002 .

[19]  D P MacKinnon,et al.  Maximizing the Usefulness of Data Obtained with Planned Missing Value Patterns: An Application of Maximum Likelihood Procedures. , 1996, Multivariate behavioral research.

[20]  S. West,et al.  The robustness of test statistics to nonnormality and specification error in confirmatory factor analysis. , 1996 .

[21]  Bengt Muthén,et al.  On structural equation modeling with data that are not missing completely at random , 1987 .

[22]  Allen I. Fleishman A method for simulating non-normal distributions , 1978 .

[23]  P. Bentler,et al.  ML Estimation of Mean and Covariance Structures with Missing Data Using Complete Data Routines , 1999 .

[24]  V. Willson,et al.  Effects of Nonnormal Data on Parameter Estimates and Fit Indices for a Model with Latent and Manifest Variables: An Empirical Study. , 1996 .

[25]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[26]  Herbert W. Marsh,et al.  Pairwise Deletion for Missing Data in Structural Equation Models: Nonpositive Definite Matrices, Parameter Estimates, Goodness of Fit, and Adjusted Sample Sizes. , 1998 .

[27]  Roger L. Brown Efficacy of the indirect approach for estimating structural equation models with missing data: A comparison of five methods , 1994 .

[28]  Peter M. Bentler,et al.  Treatments of Missing Data: A Monte Carlo Comparison of RBHDI, Iterative Stochastic Regression Imputation, and Expectation-Maximization , 2000 .

[29]  K. Yuan,et al.  5. Three Likelihood-Based Methods for Mean and Covariance Structure Analysis with Nonnormal Missing Data , 2000 .

[30]  Kenneth A. Bollen,et al.  Monte Carlo Experiments: Design and Implementation , 2001 .

[31]  James L. Arbuckle,et al.  Full Information Estimation in the Presence of Incomplete Data , 1996 .

[32]  T. W. Anderson,et al.  The asymptotic normal distribution of estimators in factor analysis under general conditions , 1988 .

[33]  A. Satorra,et al.  Scaled test statistics and robust standard errors for non-normal data in covariance structure analysis: a Monte Carlo study. , 1991, The British journal of mathematical and statistical psychology.