Evaluating Univariate, Bivariate, and Multivariate Normality Using Graphical Procedures.

This paper reviews graphical and non-graphical procedures for evaluating multivariate normality by guiding the reader through univariate and bivariate procedures that are necessary, but insufficient, indications of a multivariate normal distribution. A data set utilizing three dependent variables for two groups provided by George and Mallery (1999) is used to analyze histograms, stem-and-leaf plots, box-and-whisker plots, kurtosis and skewness coefficients, Q-Q plots, the Shapiro-Wilk or Kolmogorov-Smirnov statistic, and bivariate scatterplots. A procedure programmed by Thompson (1990, 1997) is used to explore multivariate normality by plotting Mahalanobis distances against derived chi-square values in a scatterplot. Evaluating Normality 3 Evaluating Univariate, Bivariate, and Multivariate Normality Using Graphical Procedures Reality is complex. Over time, researchers in the social sciences have become increasingly aware that simple univariate methods comparing an experimental group with a control group on a single dependent variable are inadequate to meet the needs of the complex phenomena that dominate educational and psychological research. In the majority of social science research, two or more dependent variables are necessary, because nearly every effect has multiple causes and nearly every cause has multiple effects. Even when studying a single construct, such as self-concept, it is often helpful to use multiple tools to measure elusive constructs (called "multi-operationalizing"). In a methodological shift that increasingly emphasizes honoring the complexity of reality, Grimm and Yarnold (1995) reported that the use of multivariate statistics in research has accelerated in the last 20 years and that it is difficult to find empirically based research articles that do not employ one or more multivariate analyses. In a comparison of the 1976 and 1992 volumes of the Journal of Consulting and Clinical Psychology (JCCP) and the Journal of Personality and Social Psychology (JPSP), Grimm and Yamold found that the use of multivariate statistics in JCCP increased from 9% to 67% in that 16 year period. For JPSP, the use of multivariate statistics increased from 16% to 57% in the same time frame. In an exhaustive study of statistical methods used in studies published in three major social science journals during a 16-year period, Emmons, Stallings, and Lane (1990) found increases for MANOVA, MANCOVA, multiple regression, and multiple correlation methods. For example, the use of MANOVA increased from 44% of the studies published from 1977 to 1981 to 54% of the studies published from 1982 to 1987. Emmons, Stallings, and Lane (1990, p. 14) concluded that "the

[1]  Leland Wilkinson,et al.  Statistical Methods in Psychology Journals Guidelines and Explanations , 2005 .

[2]  G. Box Some Theorems on Quadratic Forms Applied in the Study of Analysis of Variance Problems, I. Effect of Inequality of Variance in the One-Way Classification , 1954 .

[3]  John Cotton,et al.  Statistical analysis in psychology and education. 4th ed. , 1976 .

[4]  B. Thompson Problems with Multivariate Normality: Can the Multivariate Bootstrap Help?. , 1996 .

[5]  Marsha P. Johnson Statistical Methods for Health Care Research , 1996 .

[6]  G. Box,et al.  A general distribution theory for a class of likelihood criteria. , 1949, Biometrika.

[7]  T. Gerig Multivariate Analysis: Techniques for Educational and Psychological Research , 1975 .

[8]  J. Stevens Applied Multivariate Statistics for the Social Sciences , 1986 .

[9]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[10]  S. Shapiro,et al.  A Comparative Study of Various Tests for Normality , 1968 .

[11]  D. George,et al.  SPSS for Windows step by step , 1998 .

[12]  J. Stevens,et al.  Applied multivariate statistics for the social sciences, 4th ed. , 2002 .

[13]  D. A. Wolf Recent advances in descriptive multivariate analysis , 1996 .

[14]  W. J. Krzanowski,et al.  Recent Advances in Descriptive Multivariate Analysis. , 1996 .

[15]  L. Fish Why Multivariate Methods are Usually Vital , 1988 .

[16]  Frederick Mosteller,et al.  Data Analysis and Regression , 1978 .

[17]  Alyce S. Ashcraft Ways To Evaluate the Assumption of Multivariate Normality. , 1998 .

[18]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[19]  Paul R. Yarnold,et al.  Reading and Understanding Multivariate Statistics , 1995 .

[20]  J. Gower,et al.  Methods for statistical data analysis of multivariate observations , 1977, A Wiley publication in applied statistics.

[21]  Stephen G. West,et al.  Structural equation models with non-normal variables: Problems and remedies , 1995 .

[22]  Wren Bump The Normal Curve Takes Many Forms: A Review of Skewness and Kurtosis. , 1991 .

[23]  B. Tabachnick,et al.  Using Multivariate Statistics , 1983 .

[24]  James V. Bradley,et al.  The insidious L-shaped distribution. , 1982 .

[25]  Randall MacIniosh A Program Implementing Mardia's Multivariate Normality Test for Use in Structural Equation Modeling with Latent Variables , 1997 .

[26]  Samuel T. Mayo,et al.  Statistical methods in education and psychology , 1979 .

[27]  Bruce Thompson,et al.  Multinor: A Fortran Program that Assists in Evaluating Multivariate Normality , 1990 .