Estimation of latent factors for high-dimensional time series

This paper deals with the dimension reduction of high-dimensional time series based on a lower-dimensional factor process. In particular, we allow the dimension of time series N to be as large as, or even larger than, the length of observed time series T. The estimation of the factor loading matrix and the factor process itself is carried out via an eigenanalysis of a N×N non-negative definite matrix. We show that when all the factors are strong in the sense that the norm of each column in the factor loading matrix is of the order N-super-1/2, the estimator of the factor loading matrix is weakly consistent in L 2 -norm with the convergence rate independent of N. Thus the curse is cancelled out by the blessing of dimensionality. We also establish the asymptotic properties of the estimators when factors are not strong. The proposed method together with the asymptotic properties are illustrated in a simulation study. An application to an implied volatility data set, with a trading strategy derived from the fitted factor model, is also reported. Copyright 2011, Oxford University Press.

[1]  T. W. Anderson,et al.  The use of factor analysis in the statistical analysis of multiple time series , 1963 .

[2]  H. Tong,et al.  Applications of principal component analysis and factor analysis in the identification of multivariable systems , 1974 .

[3]  Gary Chamberlain,et al.  FUNDS, FACTORS, AND DIVERSIFICATION IN ARBITRAGE PRICING MODELS , 1983 .

[4]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1983 .

[5]  George E. P. Box,et al.  Identifying a Simplifying Structure in Time Series , 1987 .

[6]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[7]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[8]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[9]  D. Brillinger Time series - data analysis and theory , 1981, Classics in applied mathematics.

[10]  Rama Cont,et al.  Dynamics of implied volatility surfaces , 2002 .

[11]  Marco Lippi,et al.  The Generalized Dynamic Factor Model , 2002 .

[12]  J. Bai,et al.  Inferential Theory for Factor Models of Large Dimensions , 2003 .

[13]  Marco Lippi,et al.  The generalized dynamic factor model: consistency and rates , 2004 .

[14]  Enno Mammen,et al.  A Dynamic Semiparametric Factor Model for Implied Volatility String Dynamics , 2005 .

[15]  Daniel Peña,et al.  Nonstationary dynamic factor analysis , 2006 .

[16]  Serena Ng,et al.  Determining the Number of Primitive Shocks in Factor Models , 2007 .

[17]  M. Hallin,et al.  Determining the Number of Factors in the General Dynamic Factor Model , 2007 .

[18]  S. Péché Universality results for largest eigenvalues of some sample covariance matrix ensembles , 2007, 0705.1701.

[19]  Q. Yao,et al.  Modelling multiple time series via common factors , 2008 .

[20]  I. Johnstone,et al.  On Consistency and Sparsity for Principal Components Analysis in High Dimensions , 2009, Journal of the American Statistical Association.

[21]  M. Pesaran,et al.  Weak and Strong Cross-Section Dependence and Estimation of Large Panels , 2009, SSRN Electronic Journal.

[22]  Qiwei Yao,et al.  Identifying the finite dimensionality of curve time series , 2010, 1211.2522.

[23]  Clifford Lam,et al.  Factor Modeling for High Dimensional Time Series , 2011 .