Learning Factor Patterns in Exploratory Factor Analysis Using the Genetic Algorithm and Information Complexity as the Fitness Function

This paper presents a novel data-adaptive expert approach to determining the best factor pattern structure in exploratory factor analysis (EFA) models using a clever genetic algorithm (GA) hy- bridized with information theoretic complexity (ICOMP) criterion as the fitness function. These factor pattern structures from EFA model could then be utilized for various inductive inferences for example to study substantive hypotheses of researchers in the light of the data or could be used as data-adaptive prior information in a Bayesian Factor Analysis (BFA) model among other uses. Numerical applications are shown on a large scale Monte Carlo study and on a real benchmark data set to demonstrate the versatility of ICOMP, Akaike's Information Criterion (AIC) as the fitness functions in GA, in two types of modeling problems: (i) Choosing the number of factors in EFA, and (ii) determining the best factor pattern structure in EFA models in one expert system.

[1]  H. Akaike Factor analysis and AIC , 1987 .

[2]  R. Cudeck,et al.  Applications of standard error estimates in unrestricted factor analysis: significance tests for factor loadings and correlations. , 1994, Psychological bulletin.

[3]  D. Child,et al.  The essentials of factor analysis, 2nd ed. , 1990 .

[4]  Herman Rubin,et al.  Statistical Inference in Factor Analysis , 1956 .

[5]  H. Akaike,et al.  Information Theory and an Extension of the Maximum Likelihood Principle , 1973 .

[6]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[7]  B. Everitt,et al.  Multivariate Exploratory Data Analysis: A Perspective on Exploratory Factor Analysis. , 1988 .

[8]  Glenn Gamst,et al.  Applied Multivariate Research: Design and Interpretation , 2005 .

[9]  Michael A. West,et al.  BAYESIAN MODEL ASSESSMENT IN FACTOR ANALYSIS , 2004 .

[10]  Donald E. Ramirez,et al.  An Expert Model Selection Approach to Determine the “Best” Pattern Structure in Factor Analysis Models , 1987 .

[11]  Kenneth A. Bollen,et al.  Structural Equations with Latent Variables , 1989 .

[12]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[13]  B. Tabachnick,et al.  Using Multivariate Statistics , 1983 .

[14]  H. Kaiser A NOTE ON GUTTMAN'S LOWER BOUND FOR THE NUMBER OF COMMON FACTORS1 , 1961 .

[15]  Hamparsum Bozdogan,et al.  Multivariate Regressions, Genetic Algorithms, and Information Complexity: A Three Way Hybrid , 2002 .

[16]  G. Thomson,et al.  Methods of Estimating Mental Factors , 1938, Nature.

[17]  J. Stevens,et al.  Applied multivariate statistics for the social sciences, 4th ed. , 2002 .

[18]  van M.H. Emden,et al.  An analysis of complexity , 1971 .

[19]  Hamparsum Bozdogan,et al.  Intelligent Statistical Data Mining with Information Complexity and Genetic Algorithms , 2004 .

[20]  Stanley A. Mulaik,et al.  Linear Causal Modeling with Structural Equations , 2009 .

[21]  Fionn Murtagh,et al.  Multivariate methods for data analysis , 1993 .

[22]  H. Bozdogan,et al.  Bayesian Factor Analysis Model and Choosing the Number of Factors Using a New Informational Complexity Criterion , 1998 .

[23]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[24]  J. Stevens Applied Multivariate Statistics for the Social Sciences , 1986 .

[25]  J. Horn A rationale and test for the number of factors in factor analysis , 1965, Psychometrika.

[26]  H. Bozdogan Model selection and Akaike's Information Criterion (AIC): The general theory and its analytical extensions , 1987 .

[27]  Margaret A. Nemeth,et al.  Applied Multivariate Methods for Data Analysis , 1998, Technometrics.

[28]  H. Kaiser The Application of Electronic Computers to Factor Analysis , 1960 .

[29]  Howard B. Lee,et al.  A First Course in Factor Analysis 2nd Ed , 1973 .

[30]  H. Bozdogan,et al.  Akaike's Information Criterion and Recent Developments in Information Complexity. , 2000, Journal of mathematical psychology.