Factor models in high-dimensional time series—A time-domain approach

High-dimensional time series may well be the most common type of dataset in the so-called “big data” revolution, and have entered current practice in many areas, including meteorology, genomics, chemometrics, connectomics, complex physics simulations, biological and environmental research, finance and econometrics. The analysis of such datasets poses significant challenges, both from a statistical as well as from a numerical point of view. The most successful procedures so far have been based on dimension reduction techniques and, more particularly, on high-dimensional factor models. Those models have been developed, essentially, within time series econometrics, and deserve being better known in other areas. In this paper, we provide an original time-domain presentation of the methodological foundations of those models (dynamic factor models usually are described via a spectral approach), contrasting such concepts as commonality and idiosyncrasy, factors and common shocks, dynamic and static principal components. That time-domain approach emphasizes the fact that, contrary to the static factor models favored by practitioners, the so-called general dynamic factor model essentially does not impose any constraints on the data-generating process, but follows from a general representation result.

[1]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1982 .

[2]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[3]  Catherine Doz,et al.  A Quasi–Maximum Likelihood Approach for Large, Approximate Dynamic Factor Models , 2006, Review of Economics and Statistics.

[4]  Kunpeng Li,et al.  STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION , 2012, 1205.6617.

[5]  Thomas J. Sargent,et al.  Business cycle modeling without pretending to have too much a priori economic theory , 1976 .

[6]  Marco Lippi,et al.  THE GENERALIZED DYNAMIC FACTOR MODEL: REPRESENTATION THEORY , 2001, Econometric Theory.

[7]  M. Hallin,et al.  Dynamic factor models with infinite-dimensional factor spaces: One-sided representations , 2013 .

[8]  J. Stock,et al.  Forecasting Using Principal Components From a Large Number of Predictors , 2002 .

[9]  A. E. Maxwell,et al.  Factor Analysis as a Statistical Method. , 1964 .

[10]  Gary Chamberlain,et al.  FUNDS, FACTORS, AND DIVERSIFICATION IN ARBITRAGE PRICING MODELS , 1983 .

[11]  Marco Lippi,et al.  OPENING THE BLACK BOX: STRUCTURAL FACTOR MODELS WITH LARGE CROSS SECTIONS , 2009, Econometric Theory.

[12]  Jushan Bai,et al.  Estimating cross-section common stochastic trends in nonstationary panel data , 2004 .

[13]  Christian M. Hafner,et al.  LOCALLY STATIONARY FACTOR MODELS: IDENTIFICATION AND NONPARAMETRIC ESTIMATION , 2011, Econometric Theory.

[14]  Catherine Doz,et al.  A Two-Step Estimator for Large Approximate Dynamic Factor Models Based on Kalman Filtering , 2007 .

[15]  J. Stock,et al.  Macroeconomic Forecasting Using Diffusion Indexes , 2002 .

[16]  Isabella Corazziari,et al.  Dynamic Factor Analysis , 1999 .

[17]  Jianqing Fan,et al.  Large covariance estimation by thresholding principal orthogonal complements , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[18]  Marco Lippi,et al.  The general dynamic factor model: One-sided representation results , 2011 .

[19]  T. Apostol Mathematical Analysis , 1957 .

[20]  George E. P. Box,et al.  Identifying a Simplifying Structure in Time Series , 1987 .

[21]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[22]  Julius S. Bendat,et al.  Stationary Random Processes , 2012 .

[23]  Matteo Barigozzi,et al.  Improved penalization for determining the number of factors in approximate factor models , 2010 .

[24]  J. C. Gower,et al.  Factor Analysis as a Statistical Method. 2nd ed. , 1972 .

[25]  Mark W. Watson,et al.  Consistent Estimation of the Number of Dynamic Factors in a Large N and T Panel , 2007 .

[26]  A. E. Maxwell,et al.  Factor Analysis as a Statistical Method. , 1964 .

[27]  J. Bai,et al.  Large Dimensional Factor Analysis , 2008 .

[28]  Richard A. Davis,et al.  Time Series: Theory and Methods , 2013 .

[29]  C. De Mol,et al.  Forecasting Using a Large Number of Predictors: Is Bayesian Regression a Valid Alternative to Principal Components? , 2006, SSRN Electronic Journal.

[30]  A. Onatski TESTING HYPOTHESES ABOUT THE NUMBER OF FACTORS IN LARGE FACTOR MODELS , 2009 .

[31]  Clifford Lam,et al.  Factor modeling for high-dimensional time series: inference for the number of factors , 2012, 1206.0613.

[32]  D. Brillinger Time series - data analysis and theory , 1981, Classics in applied mathematics.

[33]  P. Whittle,et al.  Latent Variables in Socio‐Economic Models , 1978 .

[34]  M. Hallin,et al.  Determining the Number of Factors in the General Dynamic Factor Model , 2007 .

[35]  J. Bai,et al.  A Panic Attack on Unit Roots and Cointegration , 2001 .