On determining the number of spikes in a high-dimensional spiked population model

In a spiked population model, the population covariance matrix has all its eigenvalues equal to units except for a few fixed eigenvalues (spikes). Determining the number of spikes is a fundamental problem which appears in many scientific fields, including signal processing (linear mixture model) or economics (factor model). Several recent papers studied the asymptotic behavior of the eigenvalues of the sample covariance matrix (sample eigenvalues) when the dimension of the observations and the sample size both grow to infinity so that their ratio converges to a positive constant. Using these results, we propose a new estimator based on the difference between two consecutive sample eigenvalues.

[1]  Eric R. Ziegel,et al.  Tsukuba Meeting: Largest Attendance Ever , 2004, Technometrics.

[2]  D. Paul ASYMPTOTICS OF SAMPLE EIGENSTRUCTURE FOR A LARGE DIMENSIONAL SPIKED COVARIANCE MODEL , 2007 .

[3]  J. W. Silverstein,et al.  Eigenvalues of large sample covariance matrices of spiked population models , 2004, math/0408165.

[4]  A. Guionnet,et al.  Fluctuations of the Extreme Eigenvalues of Finite Rank Deformations of Random Matrices , 2010, 1009.0145.

[5]  V. Marčenko,et al.  DISTRIBUTION OF EIGENVALUES FOR SOME SETS OF RANDOM MATRICES , 1967 .

[6]  S. Ross The arbitrage theory of capital asset pricing , 1976 .

[7]  Z. Bai,et al.  Central limit theorems for eigenvalues in a spiked population model , 2008, 0806.2503.

[8]  A. Onatski TESTING HYPOTHESES ABOUT THE NUMBER OF FACTORS IN LARGE FACTOR MODELS , 2009 .

[9]  B. Nadler,et al.  Determining the number of components in a factor model from limited noisy data , 2008 .

[10]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[11]  Jianfeng Yao,et al.  On sample eigenvalues in a generalized spiked population model , 2008, J. Multivar. Anal..

[12]  F. Dias,et al.  Determining the number of factors in approximate factor models with global and group-specific factors , 2008 .

[13]  Z. Bai,et al.  CLT for linear spectral statistics of large dimensional sample covariance matrices with dependent data , 2017, Statistical Papers.

[14]  I. Johnstone On the distribution of the largest eigenvalue in principal components analysis , 2001 .