Sparse principal component analysis subject to prespecified cardinality of loadings

Most of the existing procedures for sparse principal component analysis (PCA) use a penalty function to obtain a sparse matrix of weights by which a data matrix is post-multiplied to produce PC scores. In this paper, we propose a new sparse PCA procedure which differs from the existing ones in two ways. First, the new procedure does not sparsify the weight matrix. Instead, the so-called loadings matrix is sparsified by which the score matrix is post-multiplied to approximate the data matrix. Second, the cardinality of the loading matrix i.e., the total number of nonzero loadings, is pre-specified to be an integer without using penalty functions. The procedure is called unpenalized sparse loading PCA (USLPCA). A desirable property of USLPCA is that the indices for the percentages of explained variances can be defined in the same form as in the standard PCA. We develop an alternate least squares algorithm for USLPCA which uses the fact that the PCA loss function can be decomposed as a sum of a term irrelevant to the loadings, and another one being easily minimized under cardinality constraints. A procedure is also presented for selecting the best cardinality using information criteria. The procedures are assessed in a simulation study and illustrated with real data examples.

[1]  H. Akaike A new look at the statistical model identification , 1974 .

[2]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[3]  Jianhua Z. Huang,et al.  Sparse principal component analysis via regularized low rank matrix approximation , 2008 .

[4]  George A. F. Seber,et al.  A matrix handbook for statisticians , 2007 .

[5]  Nickolay T. Trendafilov,et al.  From simple structure to sparse components: a review , 2014, Comput. Stat..

[6]  Tom F. Wilderjans,et al.  A flexible framework for sparse simultaneous component based data integration , 2011, BMC Bioinformatics.

[7]  I. Jolliffe,et al.  A Modified Principal Component Technique Based on the LASSO , 2003 .

[8]  Nickolay T. Trendafilov,et al.  Sparse principal components by semi-partition clustering , 2012, Comput. Stat..

[9]  Ka Yee Yeung,et al.  Principal component analysis for clustering gene expression data , 2001, Bioinform..

[10]  I. Jolliffe Principal Component Analysis , 2002 .

[11]  R. Tibshirani,et al.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. , 2009, Biostatistics.

[12]  Alan Julian Izenman,et al.  Modern Multivariate Statistical Techniques , 2008 .

[13]  Yoshio Takane Constrained Principal Component Analysis and Related Techniques , 2013 .

[14]  R. Maronna Alan Julian Izenman (2008): Modern Multivariate Statistical Techniques: Regression, Classification and Manifold Learning , 2011 .

[15]  Alexandre d'Aspremont,et al.  Optimal Solutions for Sparse Principal Component Analysis , 2007, J. Mach. Learn. Res..

[16]  Yurii Nesterov,et al.  Generalized Power Method for Sparse Principal Component Analysis , 2008, J. Mach. Learn. Res..

[17]  N. Trendafilov,et al.  Sparse Versus Simple Structure Loadings , 2015, Psychometrika.

[18]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[19]  Alan Julian Izenman,et al.  Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning , 2008 .

[20]  C. Eckart,et al.  The approximation of one matrix by another of lower rank , 1936 .

[21]  J. N. R. Jeffers,et al.  Two Case Studies in the Application of Principal Component Analysis , 1967 .