Multivariate Statistical Analysis

Classical multivariate statistical methods concern models, distributions and inference based on the Gaussian distribution. These are the topics in the first textbook for mathematical statisticians by T.W. Anderson that was published in 1958 and that appeared as a slightly expanded 3rd edition in 2003. Matrix theory and notation is used there extensively to efficiently derive properties of the multivariate Gaussian or the Wishart distribution, of principal components, of canonical correlation and discriminant analysis and of the general multivariate linear model in which a Gaussian response vector variable Ya has linear least-squares regression on all components of an explanatory vector variable Yb. In contrast, many methods for analysing sets of observed variables have been developed first within special substantive fields and some or all of the models in a given class were justified in terms of probabilistic and statistical theory much later. Among them are factor analysis, path analysis, structural equation models, and models for which partial-least squares estimation have been proposed. Other multivariate techniques such as cluster analysis and multidimensional scaling have been often used, but the result of such an analysis cannot be formulated as a hypothesis to be tested in a new study and satisfactory theoretical justifications are still lacking. Factor analysis was proposed by psychologist C. Spearman (1904), (1926) and, at the time, thought of as a tool for measuring human intelligence. Such a model has one or several latent variables. These are hidden or unobserved and are to explain the observed correlations among a set of observed variables, called items in that context. The difficult task is to decide how many and which of a possibly large set of items to include into a model. But, given a set of latent variables, a classical factor analysis model specifies for a joint Gaussian distribution mutual independence of the observed variables given the latent variables. This can be recognized to be one special type of a graphical Markov model; see Cox and Wermuth (1996), Edwards (2000), Lauritzen (1996), Whittaker (1990).

[1]  A. Goldberger STRUCTURAL EQUATION METHODS IN THE SOCIAL SCIENCES , 1972 .

[2]  Nanny Wermuth,et al.  On association models defined over independence graphs , 1998 .

[3]  N. Wermuth,et al.  Matrix representations and independencies in directed acyclic graphs , 2009, 0904.0333.

[4]  S. Lauritzen,et al.  The TM algorithm for maximising a conditional likelihood function , 2001 .

[5]  O. D. Duncan Path Analysis: Sociological Examples , 1966, American Journal of Sociology.

[6]  J. Darroch,et al.  Generalized Iterative Scaling for Log-Linear Models , 1972 .

[7]  Eric R. Ziegel,et al.  Generalized Linear Models , 2002, Technometrics.

[8]  S. Wright,et al.  The Theory of Path Coefficients a Reply to Niles's Criticism. , 1923, Genetics.

[9]  M. Drton Discrete chain graph models , 2009, 0909.0843.

[10]  M. Frydenberg,et al.  A modified iterative proportional scaling algorithm for estimation for in regular exponential families , 1989 .

[11]  John A. Nelder,et al.  Generalized linear models. 2nd ed. , 1993 .

[12]  S. Johansen Cointegration: Overview and Development , 2009 .

[13]  T. Richardson,et al.  Multimodality of the likelihood in the bivariate seemingly unrelated regressions model , 2004 .

[14]  K. Jöreskog Analysis of Covariance Structures , 1988 .

[15]  Rex B. Kline,et al.  Principles and Practice of Structural Equation Modeling , 1998 .

[16]  Michael I. Jordan Graphical Models , 1998 .

[17]  H. Wold Causality and Econometrics , 1954 .

[18]  C. Spearman,et al.  "THE ABILITIES OF MAN". , 1928, Science.

[19]  M. Frydenberg,et al.  Decomposition of maximum likelihood in mixed graphical interaction models , 1989 .

[20]  K. Jöreskog A General Method for Estimating a Linear Structural Equation System. , 1970 .

[21]  Graham J. Wills,et al.  Introduction to graphical modelling , 1995 .

[22]  Nanny Wermuth,et al.  An approximation to maximum likelihood estimates in reduced models , 1990 .

[23]  G. M. Marchetti,et al.  Chain graph models of multivariate regression type for categorical data , 2009, 0906.2098.

[24]  Herman Wold,et al.  Model Construction and Evaluation When Theoretical Knowledge Is Scarce , 1980 .

[25]  Flat and Multimodal Likelihoods and Model Lack of Fit in Curved Exponential Families , 2010 .

[26]  N. Wermuth,et al.  On Substantive Research Hypotheses, Conditional Independence Graphs and Graphical Chain Models , 1990 .

[27]  N. Wermuth,et al.  Linear Dependencies Represented by Chain Graphs , 1993 .

[28]  P. Spirtes,et al.  Ancestral graph Markov models , 2002 .

[29]  S. Haberman,et al.  The analysis of frequency data , 1974 .

[30]  Nanny Wermuth,et al.  Multivariate Dependencies: Models, Analysis and Interpretation , 1996 .

[31]  A. Zellner An Efficient Method of Estimating Seemingly Unrelated Regressions and Tests for Aggregation Bias , 1962 .

[32]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[33]  Dan Geiger,et al.  Identifying independence in bayesian networks , 1990, Networks.

[34]  C. Spearman General intelligence Objectively Determined and Measured , 1904 .

[35]  Michael Eichler,et al.  Computing Maximum Likelihood Estimates in Recursive Linear Models with Correlated Errors , 2009, J. Mach. Learn. Res..

[36]  N. Wermuth,et al.  Graphical Models for Associations between Variables, some of which are Qualitative and some Quantitative , 1989 .

[37]  Econometric Modeling: A Likelihood Approach , 2007 .

[38]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[39]  N. Wermuth,et al.  Triangular systems for symmetric binary variables , 2009 .

[40]  P. McCullagh,et al.  Multivariate Logistic Models , 1995 .

[41]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[42]  Leo A. Goodman,et al.  Analyzing Qualitative Categorical Data. , 1979 .

[43]  Rolf Sundberg The convergence rate of the TM algorithm of Edwards and Lauritzen by , 2002 .

[44]  N. Wermuth PROBABILITY DISTRIBUTIONS WITH SUMMARY GRAPH STRUCTURE , 2010, 1003.3259.

[45]  T. Haavelmo The Statistical Implications of a System of Simultaneous Equations , 1943 .

[46]  S. Lauritzen The EM algorithm for graphical association models with missing data , 1995 .

[47]  Clive W. J. Granger,et al.  Long-Run Economic Relationships: Readings in Cointegration , 1991 .

[48]  T. Lancaster Introduction to Structural Equation Models , 1976 .

[49]  A. N. Tikhonov,et al.  The regularization of ill-posed problems , 1963 .

[50]  N. Wermuth,et al.  Joint response graphs and separation induced by triangular systems , 2004 .

[51]  H. Kiiveri An incomplete data approach to the analysis of covariance structures , 1987 .

[52]  Anders Björkström,et al.  A Generalized View on Continuum Regression , 1999 .

[53]  S. Johansen STATISTICAL ANALYSIS OF COINTEGRATION VECTORS , 1988 .

[54]  Chenlei Leng,et al.  Unified LASSO Estimation by Least Squares Approximation , 2007 .

[55]  Arthur S. Goldberger,et al.  Econometrics and psychometrics: A survey of communalities , 1971 .