Asymptotics of the principal components estimator of large factor models with weakly influential factors

This paper introduces a drifting-parameter asymptotic framework to derive accurate approximations to the finite sample distribution of the principal components (PC) estimator in situations when the factors’ explanatory power does not strongly dominate the explanatory power of the cross-sectionally and temporally correlated idiosyncratic terms. Under our asymptotics, the PC estimator is inconsistent. We find explicit formulae for the amount of the inconsistency, and propose an estimator of the number of factors for which the PC estimator works reasonably well. For the special case when the idiosyncratic terms are cross-sectionally but not temporally correlated (or vice versa), we show that the coefficients in the OLS regressions of the PC estimates of factors (loadings) on the true factors (true loadings) are asymptotically normal, and find explicit formulae for the corresponding asymptotic covariance matrix. We explain how to estimate the parameters of the derived asymptotic distributions. Our Monte Carlo analysis suggests that our asymptotic formulae and estimators work well even for relatively small n and T. We apply our theoretical results to test a hypothesis about the factor content of the US stock return data.

[1]  Semiparametric Estimation of aCharacteristic-based Factor Model ofCommon Stock Returns , 2006 .

[2]  J. Bai,et al.  Large Dimensional Factor Analysis , 2008 .

[3]  Mark W. Watson,et al.  Advances in Economics and Econometrics: Macroeconomic Forecasting Using Many Predictors , 2003 .

[4]  Song-xi Chen,et al.  Tests for High-Dimensional Covariance Matrices , 2010, Random Matrices: Theory and Applications.

[5]  I. Johnstone,et al.  Sparse Principal Components Analysis , 2009, 0901.4392.

[6]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1983 .

[7]  J. W. Silverstein,et al.  Spectral Analysis of Large Dimensional Random Matrices , 2009 .

[8]  Jean Boivin,et al.  Macroeconomic Dynamics in the Euro Area , 2008 .

[9]  Mario Forni,et al.  Aggregation of linear dynamic microeconomic models , 1999 .

[10]  Z. Bai,et al.  On the limit of the largest eigenvalue of the large dimensional sample covariance matrix , 1988 .

[11]  Catherine Doz,et al.  A Quasi–Maximum Likelihood Approach for Large, Approximate Dynamic Factor Models , 2006, Review of Economics and Statistics.

[12]  U. Grenander,et al.  Toeplitz Forms And Their Applications , 1958 .

[13]  C. De Mol,et al.  Forecasting Using a Large Number of Predictors: Is Bayesian Regression a Valid Alternative to Principal Components? , 2006, SSRN Electronic Journal.

[14]  Mark W. Watson,et al.  Chapter 10 Forecasting with Many Predictors , 2006 .

[15]  A. Onatski Determining the Number of Factors from Empirical Distribution of Eigenvalues , 2010, The Review of Economics and Statistics.

[16]  I. Johnstone,et al.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions , 2009, Journal of the American Statistical Association.

[17]  V. Marčenko,et al.  DISTRIBUTION OF EIGENVALUES FOR SOME SETS OF RANDOM MATRICES , 1967 .

[18]  J. MacKinnon,et al.  Econometric Theory and Methods , 2003 .

[19]  R. Bass,et al.  Review: P. Billingsley, Convergence of probability measures , 1971 .

[20]  Mark M. Carhart On Persistence in Mutual Fund Performance , 1997 .

[21]  Douglas A. McManus Who Invented Local Power Analysis? , 1991, Econometric Theory.

[22]  D. McLeish Dependent Central Limit Theorems and Invariance Principles , 1974 .

[23]  J. Stock,et al.  Macroeconomic Forecasting Using Diffusion Indexes , 2002 .

[24]  Jean Boivin,et al.  How Has the Euro Changed the Monetary Transmission Mechanism? , 2008, NBER Macroeconomics Annual.

[25]  E. Fama,et al.  Common risk factors in the returns on stocks and bonds , 1993 .

[26]  R. Cattell The Scree Test For The Number Of Factors. , 1966, Multivariate behavioral research.

[27]  F. Dias,et al.  Determining the number of factors in approximate factor models with global and group-specific factors , 2008 .

[28]  C. Loan,et al.  Approximation with Kronecker Products , 1992 .

[29]  Marcelo J. Moreira,et al.  Asymptotic power of sphericity tests for high-dimensional data , 2013, 1306.4867.

[30]  J. W. Silverstein,et al.  No eigenvalues outside the support of the limiting spectral distribution of large-dimensional sample covariance matrices , 1998 .

[31]  Z. Bai,et al.  CLT for linear spectral statistics of large dimensional sample covariance matrices with dependent data , 2017, Statistical Papers.

[32]  P. Hall,et al.  Martingale Limit Theory and Its Application , 1980 .

[33]  Y. Yin Limiting spectral distribution for a class of random matrices , 1986 .

[34]  J. W. Silverstein,et al.  Analysis of the limiting spectral distribution of large dimensional random matrices , 1995 .

[35]  J. Bai,et al.  Inferential Theory for Factor Models of Large Dimensions , 2003 .

[36]  Gregory Connor,et al.  Semiparametric Estimation of a Characteristic-Based Factor Model of Stock Returns , 2000 .

[37]  J. Stock,et al.  Instrumental Variables Regression with Weak Instruments , 1994 .

[38]  Paul A. Bekker,et al.  ALTERNATIVE APPROXIMATIONS TO THE DISTRIBUTIONS OF INSTRUMENTAL VARIABLE ESTIMATORS , 1994 .

[39]  Marco Lippi,et al.  THE GENERALIZED DYNAMIC FACTOR MODEL: REPRESENTATION THEORY , 2001, Econometric Theory.

[40]  John C. W. Rayner,et al.  Smooth test of goodness of fit , 1989 .

[41]  N. Levenberg,et al.  Function theory in several complex variables , 2001 .

[42]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[43]  J. Stock,et al.  Instrumental Variables Regression with Weak Instruments , 1994 .

[44]  Lucrezia Reichlin,et al.  Factor Models in Large Cross-Sections of Time Series , 2002 .

[45]  T. W. Anderson An Introduction to Multivariate Statistical Analysis, 2nd Edition. , 1985 .

[46]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[47]  D. Paul ASYMPTOTICS OF SAMPLE EIGENSTRUCTURE FOR A LARGE DIMENSIONAL SPIKED COVARIANCE MODEL , 2007 .

[48]  J. W. Silverstein Strong convergence of the empirical distribution of eigenvalues of large dimensional random matrices , 1995 .

[49]  J. Neyman »Smooth test» for goodness of fit , 1937 .

[50]  Jean Boivin,et al.  How Has the Euro Changed the Monetary Transmission? , 2008 .

[51]  Serena Ng,et al.  Are More Data Always Better for Factor Analysis? , 2003 .