论文信息 - Low-Rank Principal Eigenmatrix Analysis

Low-Rank Principal Eigenmatrix Analysis

Sparse PCA is a widely used technique for high-dimensional data analysis. In this paper, we propose a new method called low-rank principal eigenmatrix analysis. Different from sparse PCA, the dominant eigenvectors are allowed to be dense but are assumed to have a low-rank structure when matricized appropriately. Such a structure arises naturally in several practical cases: Indeed the top eigenvector of a circulant matrix, when matricized appropriately is a rank-1 matrix. We propose a matricized rank-truncated power method that could be efficiently implemented and establish its computational and statistical properties. Extensive experiments on several synthetic data sets demonstrate the competitive empirical performance of our method.

[1] T. W. Anderson. ASYMPTOTIC THEORY FOR PRINCIPAL COMPONENT ANALYSIS , 1963 .

[2] R. Muirhead. Aspects of Multivariate Statistical Theory , 1982, Wiley Series in Probability and Statistics.

[3] William J. J. Roberts,et al. Hidden Markov modeling of speech using Toeplitz covariance matrices , 2000, Speech Commun..

[4] Robert M. Gray,et al. Toeplitz and Circulant Matrices: A Review , 2005, Found. Trends Commun. Inf. Theory.

[5] R. Tibshirani,et al. Sparse Principal Component Analysis , 2006 .

[6] Lars P. B. Christensen. An EM-Algorithm for Band-Toeplitz Covariance Matrix Estimation , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[7] Alan Julian Izenman,et al. Modern Multivariate Statistical Techniques , 2008 .

[8] Petre Stoica,et al. On Estimation of Covariance Matrices With Kronecker Product Structure , 2008, IEEE Transactions on Signal Processing.

[9] I. Johnstone,et al. On Consistency and Sparsity for Principal Components Analysis in High Dimensions , 2009, Journal of the American Statistical Association.

[10] J. Marron,et al. PCA CONSISTENCY IN HIGH DIMENSION, LOW SAMPLE SIZE CONTEXT , 2009, 0911.3827.

[11] Ian T. Jolliffe,et al. Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[12] Jing Lei,et al. Minimax Rates of Estimation for Sparse PCA in High Dimensions , 2012, AISTATS.

[13] P. Rigollet,et al. Optimal detection of sparse principal components in high dimension , 2012, 1202.5070.

[14] T. Cai,et al. Sparse PCA: Optimal rates and adaptive estimation , 2012, 1211.1309.

[15] Xiao-Tong Yuan,et al. Truncated power method for sparse eigenvalue problems , 2011, J. Mach. Learn. Res..

[16] Vincent Q. Vu,et al. MINIMAX SPARSE PRINCIPAL SUBSPACE ESTIMATION IN HIGH DIMENSIONS , 2012, 1211.0373.

[17] Harrison H. Zhou,et al. Optimal rates of convergence for estimating Toeplitz covariance matrices , 2013 .

[18] B. Nadler,et al. MINIMAX BOUNDS FOR SPARSE PCA WITH NOISY HIGH-DIMENSIONAL DATA. , 2012, Annals of statistics.

[19] Yihong Wu,et al. Computational Barriers in Minimax Submatrix Detection , 2013, ArXiv.

[20] Quentin Berthet,et al. Statistical and computational trade-offs in estimation of sparse principal components , 2014, 1408.5369.

[21] Vincent Q. Vu,et al. Sparsistency and agnostic inference in sparse PCA , 2014, 1401.6978.

[22] Avi Wigderson,et al. Sum-of-Squares Lower Bounds for Sparse PCA , 2015, NIPS.

[23] Qiang Sun,et al. Principal Component Analysis for Big Data , 2018, Wiley StatsRef: Statistics Reference Online.

[24] Wasim Huleihel,et al. Reducibility and Computational Lower Bounds for Problems with Planted Sparse Structure , 2018, COLT.

[25] Debashis Paul,et al. PCA in High Dimensions: An Orientation , 2018, Proceedings of the IEEE.