Factor modeling for high-dimensional time series: inference for the number of factors

This paper deals with the factor modeling for high-dimensional time series based on a dimension-reduction viewpoint. Under stationary settings, the inference is simple in the sense that both the number of factors and the factor loadings are estimated in terms of an eigenanalysis for a nonnegative definite matrix, and is therefore applicable when the dimension of time series is on the order of a few thousands. Asymptotic properties of the proposed method are investigated under two settings: (i) the sample size goes to infinity while the dimension of time series is fixed; and (ii) both the sample size and the dimension of time series go to infinity together. In particular, our estimators for zero-eigenvalues enjoy faster convergence (or slower divergence) rates, hence making the estimation for the number of factors easier. In particular, when the sample size and the dimension of time series go to infinity together, the estimators for the eigenvalues are no longer consistent. However, our estimator for the number of the factors, which is based on the ratios of the estimated eigenvalues, still works fine. Furthermore, this estimation shows the so-called “blessing of dimensionality” property in the sense that the performance of the estimation may improve when the dimension of time series increases. A two-step procedure is investigated when the factors are of different degrees of strength. Numerical illustration with both simulated and real data is also reported.

[1]  T. W. Anderson,et al.  The use of factor analysis in the statistical analysis of multiple time series , 1963 .

[2]  E. J. Hannan,et al.  Multiple time series , 1970 .

[3]  G. Stewart Error and Perturbation Bounds for Subspaces Associated with Certain Eigenvalue Problems , 1973 .

[4]  H. Tong,et al.  Applications of principal component analysis and factor analysis in the identification of multivariable systems , 1974 .

[5]  S. Ross The arbitrage theory of capital asset pricing , 1976 .

[6]  Thomas J. Sargent,et al.  Business cycle modeling without pretending to have too much a priori economic theory , 1976 .

[7]  Gary Chamberlain,et al.  FUNDS, FACTORS, AND DIVERSIFICATION IN ARBITRAGE PRICING MODELS , 1983 .

[8]  D. B. Preston Spectral Analysis and Time Series , 1983 .

[9]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1983 .

[10]  George E. P. Box,et al.  Identifying a Simplifying Structure in Time Series , 1987 .

[11]  G. C. Tiao,et al.  Model Specification in Multivariate Time Series , 1989 .

[12]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[13]  Helmut Lütkepohl,et al.  Introduction to multiple time series analysis , 1991 .

[14]  Ian T. Jolliffe,et al.  Introduction to Multiple Time Series Analysis , 1993 .

[15]  Danny Quah,et al.  A Dynamic Index Model for Large Cross Sections , 1993 .

[16]  L. Dümbgen A simple proof and refinement of Wielandt's eigenvalue inequality , 1995 .

[17]  G. Reinsel Elements of Multivariate Time Series Analysis , 1995 .

[18]  G. Reinsel Elements of Multivariate Time Series Analysis, 2nd Edition , 1998 .

[19]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[20]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[21]  Marco Lippi,et al.  The Generalized Dynamic Factor Model , 2002 .

[22]  J. Bai,et al.  Inferential Theory for Factor Models of Large Dimensions , 2003 .

[23]  Marco Lippi,et al.  The generalized dynamic factor model: consistency and rates , 2004 .

[24]  N. Wermuth,et al.  Nonlinear Time Series: Nonparametric and Parametric Methods , 2005 .

[25]  Daniel Peña,et al.  Nonstationary dynamic factor analysis , 2006 .

[26]  Serena Ng,et al.  Determining the Number of Primitive Shocks in Factor Models , 2007 .

[27]  M. Hallin,et al.  Determining the Number of Factors in the General Dynamic Factor Model , 2007 .

[28]  S. Péché Universality results for largest eigenvalues of some sample covariance matrix ensembles , 2007, 0705.1701.

[29]  Bakhodir A Ergashev,et al.  Analysis of Multi-Factor Affine Yield Curve Models , 2008 .

[30]  Q. Yao,et al.  Modelling multiple time series via common factors , 2008 .

[31]  M. Pesaran,et al.  Infinite Dimensional VARs and Factor Models , 2009, SSRN Electronic Journal.

[32]  Manfred Deistler,et al.  Generalized linear dynamic factor models - An approach via singular autoregressions , 2009, 2009 European Control Conference (ECC).

[33]  Brian D. O. Anderson,et al.  Generalized linear dynamic factor models - An approach via singular autoregressions , 2009, ECC.

[34]  Hansheng Wang Factor Profiling for Ultra High Dimensional Variable Selection , 2010 .

[35]  Qiwei Yao,et al.  Identifying the finite dimensionality of curve time series , 2010, 1211.2522.

[36]  Clifford Lam,et al.  Estimation of latent factors for high-dimensional time series , 2011 .

[37]  Qiwei Yao,et al.  Large Volatility Matrix Inference via Combining Low-Frequency and High-Frequency Approaches , 2011 .