Stochastic search item selection for factor analytic models.

In this paper we implement a Markov chain Monte Carlo algorithm based on the stochastic search variable selection method of George and McCulloch (1993) for identifying promising subsets of manifest variables (items) for factor analysis models. The suggested algorithm is constructed by embedding in the usual factor analysis model a normal mixture prior for the model loadings with latent indicators used to identify not only which manifest variables should be included in the model but also how each manifest variable is associated with each factor. We further extend the suggested algorithm to allow for factor selection. We also develop a detailed procedure for the specification of the prior parameters values based on the practical significance of factor loadings using ideas from the original work of George and McCulloch (1993). A straightforward Gibbs sampler is used to simulate from the joint posterior distribution of all unknown parameters and the subset of variables with the highest posterior probability is selected. The proposed method is illustrated using real and simulated data sets.

[1]  Andrew Thomas,et al.  The BUGS project: Evolution, critique and future directions , 2009, Statistics in medicine.

[2]  K. Holzinger,et al.  A study in factor analysis : the stability of a bi-factor solution , 1939 .

[3]  Nengjun Yi,et al.  Stochastic search variable selection for identifying multiple quantitative trait loci. , 2003, Genetics.

[4]  P. Dellaportas,et al.  Stochastic search variable selection for log-linear models , 2000 .

[5]  A. Raftery Bayesian Model Selection in Social Research , 1995 .

[6]  James G. Scott,et al.  Bayes and empirical-Bayes multiplicity adjustment in the variable-selection problem , 2010, 1011.2333.

[7]  Alan J. Miller Subset Selection in Regression , 1992 .

[8]  Petros Dellaportas,et al.  Joint Specification of Model Space and Parameter Space Prior Distributions , 2012, 1207.5651.

[9]  Alan J. Miller Sélection of subsets of regression variables , 1984 .

[10]  Tomohiro Ando,et al.  Bayesian factor analysis with fat-tailed factors and its exact marginal likelihood , 2009, J. Multivar. Anal..

[11]  M. Clyde,et al.  Mixtures of g Priors for Bayesian Variable Selection , 2008 .

[12]  J. Geweke,et al.  Measuring the pricing error of the arbitrage pricing theory , 1996 .

[13]  M. West,et al.  Bayesian Dynamic Factor Models and Portfolio Allocation , 2000 .

[14]  On Bayesian estimation in unrestricted factor analysis , 1978 .

[15]  David B. Dunson,et al.  Efficient Bayesian Model Averaging in Factor Analysis , 2006 .

[16]  Michael A. West,et al.  BAYESIAN MODEL ASSESSMENT IN FACTOR ANALYSIS , 2004 .

[17]  Akira Harada,et al.  Stepwise variable selection in factor analysis , 2000 .

[18]  R. P. McDonald,et al.  Bayesian estimation in unrestricted factor analysis: A treatment for heywood cases , 1975 .

[19]  Edward I. George,et al.  Empirical Bayes vs. Fully Bayes Variable Selection , 2008 .

[20]  Sik-Yum Lee,et al.  Bayesian Selection on the Number of Factors in a Factor Analysis Model , 2002 .

[21]  J. Berger,et al.  Optimal predictive model selection , 2004, math/0406464.

[22]  Jelte M. Wicherts,et al.  Testing Measurement Invariance in the Target Rotated Multigroup Exploratory Factor Model , 2009 .

[23]  Adrian E. Raftery,et al.  Bayesian Model Selection in Structural Equation Models , 1992 .

[24]  E. Fokoue Stochastic Determination of the Intrinsic Structure in Bayesian Factor Analysis , 2004 .

[25]  E. George,et al.  Journal of the American Statistical Association is currently published by American Statistical Association. , 2007 .

[26]  Y. Kano Selection of Manifest Variables , 2007 .