Generating Non-normal Data for Simulation of Structural Equation Models Using Mattson's Method

SEM researchers use Monte-Carlo simulations to ascertain the robustness of statistical estimators and the performance of various fit indices under varying conditions of non-normality. The efficacy of these Monte-Carlo simulations is closely related to the generation of non-normal data. Traditionally, SEM researchers have used approaches proposed by Fleishman (1978) and Vale and Maurelli (1983) to generate multivariate non-normal random numbers. However, both approaches do not provide a method to determine univariate skewness and kurtosis of the observed variables when a non-normal distribution is specified for some or all of the latent variables. Mattson (1997) proposed a method for generating data on the latent variables with controlled skewness and kurtosis of the observed variables. We empirically test the applicability of Mattson's theoretical method in a Monte-Carlo simulation. Specifically, we assess the impact of data generation, selection of transformation method, sample size and degree of skewness/kurtosis on the performance of the method. Our results suggest that Mattson's method seems to be a good approach to generate data with defined levels of skewness and kurtosis. In addition, based on the results of our analysis, we provide practicing researchers recommendations regarding their empirical implementation schemes.

[1]  A. Satorra,et al.  Scaled test statistics and robust standard errors for non-normal data in covariance structure analysis: a Monte Carlo study. , 1991, The British journal of mathematical and statistical psychology.

[2]  C. D. Vale,et al.  Simulating multivariate nonnormal distributions , 1983 .

[3]  Pandu R. Tadikamalla,et al.  A Probability Distribution and its Uses in Fitting Data , 1979 .

[4]  G. S. Fishman,et al.  A Statistical Evaluation of Multiplicative Congruential Generators with Modulus 2 Super 31-1. , 1978 .

[5]  Linus Schrage,et al.  A More Portable Fortran Random Number Generator , 1979, TOMS.

[6]  George Marsaglia,et al.  Toward a universal random number generator , 1987 .

[7]  S. Mattson,et al.  How to Generate Non-normal Data for Simulation of Structural Equation Models. , 1997, Multivariate behavioral research.

[8]  Irving W. Burr,et al.  Pameters for a general system of distributions to match a grid of α3 and α4 , 1973 .

[9]  G. Marsaglia,et al.  A New Class of Random Number Generators , 1991 .

[10]  Jeffrey Johannes Hoogland The robustness of estimation methods for covariance structure analysis. , 1999 .

[11]  W. R. Dillon,et al.  Some Results on the Behavior of Alternate Covariance Structure Estimation Procedures in the Presence of Non-Normal Data , 1989 .

[12]  Bruce D. McCullough,et al.  Assessing the Reliability of Statistical Software: Part I , 1998 .

[13]  Allen I. Fleishman A method for simulating non-normal distributions , 1978 .

[14]  James C. Anderson,et al.  The Effects of Sampling Error and Model Characteristics on Parameter Estimation for Maximum Likelihood Confirmatory Factor Analysis. , 1985, Multivariate behavioral research.

[15]  H. Cramér Mathematical methods of statistics , 1947 .

[16]  G. S. Fishman,et al.  A Statistical Evaluation of Multiplicative Congruential Random Number Generators with Modulus 231 — 1 , 1982 .

[17]  Randall E. Schumacker,et al.  Random-Number Generator Validity in Simulation Studies: An Investigation of Normality , 1998 .

[18]  P. A. W. Lewis,et al.  A Pseudo-Random Number Generator for the System/360 , 1969, IBM Syst. J..

[19]  Rex B. Kline,et al.  Principles and Practice of Structural Equation Modeling , 1998 .

[20]  M. Browne Asymptotically distribution-free methods for the analysis of covariance structures. , 1984, The British journal of mathematical and statistical psychology.

[21]  Pandu R. Tadikamalla,et al.  On simulating non-normal distributions , 1980 .