Full Maximum Likelihood Estimation of Polychoric and Polyserial Correlations With Missing Data

This article develops a full maximum likelihood method for obtaining joint estimates of variances and correlations among continuous and polytomous variables with incomplete data which are missing at random with an ignorable missing mechanism. The approach for obtaining the maximum likelihood estimate of the covariance matrix is via a simple confirmatory analysis model with a fixed identity loading matrix and a fixed diagonal matrix with small of unique variances. A Monte Carlo Expectation-Maximization (MCEM) algorithm is constructed to obtain the solution, in which the E-step is approximated by observations simulated by the Gibbs sampler. Results from a simulation study and a real example are provided to illustrate the methodology.

[1]  G. C. Wei,et al.  A Monte Carlo Implementation of the EM Algorithm and the Poor Man's Data Augmentation Algorithms , 1990 .

[2]  Wai-Yin Poon,et al.  Maximum likelihood estimation of multivariate polyserial and polychoric correlation coefficients , 1988 .

[3]  L. Ryan,et al.  Latent Variable Models for Mixed Discrete and Continuous Outcomes , 1997 .

[4]  Xin-Yuan Song,et al.  Bayesian Estimation and Model Selection of Multivariate Linear Model with Polytomous Variables , 2002, Multivariate behavioral research.

[5]  Xiao-Li Meng,et al.  SIMULATING RATIOS OF NORMALIZING CONSTANTS VIA A SIMPLE IDENTITY: A THEORETICAL EXPLORATION , 1996 .

[6]  Xiao-Li Meng,et al.  Fitting Full-Information Item Factor Models and an Empirical Investigation of Bridge Sampling , 1996 .

[7]  S. Lee,et al.  Estimation of multivariate polychoric and polyserial correlations with missing observations. , 1992, The British journal of mathematical and statistical psychology.

[8]  John Geweke,et al.  Efficient Simulation from the Multivariate Normal and Student-t Distributions Subject to Linear Constraints and the Evaluation of Constraint Probabilities , 1991 .

[9]  Accuracy of maximum-likelihood estimates of correlation for a biserial model , 1966 .

[10]  S Y Lee,et al.  Latent variable models with mixed continuous and polytomous data , 2001, Biometrics.

[11]  Ulf Olsson,et al.  Maximum likelihood estimation of the polychoric correlation coefficient , 1979 .

[12]  S Y Lee,et al.  Bayesian estimation and test for factor analysis model with continuous and polytomous data in several populations. , 2001, The British journal of mathematical and statistical psychology.

[13]  P M Bentler,et al.  A two-stage estimation of structural equation models with continuous and polytomous variables. , 1995, The British journal of mathematical and statistical psychology.

[14]  Nicole A. Lazar,et al.  Statistical Analysis With Missing Data , 2003, Technometrics.

[15]  S Y Lee,et al.  Statistical analysis of nonlinear structural equation models with continuous and polytomous data. , 2000, The British journal of mathematical and statistical psychology.

[16]  A L Gross Bayesian Interval Estimation of Multiple Correlations with Missing Data: A Gibbs Sampling Approach , 2000, Multivariate behavioral research.

[17]  R. Scheines,et al.  Bayesian estimation and testing of structural equation models , 1999 .

[18]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[19]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  S Y Lee,et al.  Hypothesis Testing and Model Comparison in Two-level Structural Equation Models , 2001, Multivariate behavioral research.

[21]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[22]  N. Cox Estimation of the correlation between a continuous and a discrete variable. , 1974, Biometrics.

[23]  Stan Lipovetsky,et al.  Latent Variable Models and Factor Analysis , 2001, Technometrics.

[24]  J A Stein,et al.  The effects of establishment practices, knowledge and attitudes on condom use among Filipina sex workers. , 1998, AIDS care.

[25]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[26]  Bradley Efron,et al.  Missing Data, Imputation, and the Bootstrap , 1994 .

[27]  Irini Moustaki,et al.  A Latent Variable Model for Ordinal Variables , 2000 .

[28]  Peter J. Cameron,et al.  Rank three permutation groups with rank three subconstituents , 1985, J. Comb. Theory, Ser. B.

[29]  David E. Booth,et al.  Analysis of Incomplete Multivariate Data , 2000, Technometrics.

[30]  C. Robert Simulation of truncated normal variables , 2009, 0907.4010.

[31]  Wai-Yin Poon,et al.  Maximum likelihood estimation of polyserial correlations , 1986 .

[32]  U. Olsson On The Robustness Of Factor Analysis Against Crude Classification Of The Observations. , 1979, Multivariate behavioral research.

[33]  Jian Qing Shi,et al.  Maximum Likelihood Estimation of Two‐Level Latent Variable Models with Mixed Continuous and Polytomous Data , 2001 .