Network-Assisted Estimation for Large-dimensional Factor Model with Guaranteed Convergence Rate Improvement.

Network structure is growing popular for capturing the intrinsic relationship between large-scale variables. In the paper we propose to improve the estimation accuracy for large-dimensional factor model when a network structure between individuals is observed. To fully excavate the prior network information, we construct two different penalties to regularize the factor loadings and shrink the idiosyncratic errors. Closed-form solutions are provided for the penalized optimization problems. Theoretical results demonstrate that the modified estimators achieve faster convergence rates and lower asymptotic mean squared errors when the underlying network structure among individuals is correct. An interesting finding is that even if the priori network is totally misleading, the proposed estimators perform no worse than conventional state-of-art methods. Furthermore, to facilitate the practical application, we propose a data-driven approach to select the tuning parameters, which is computationally efficient. We also provide an empirical criterion to determine the number of common factors. Simulation studies and application to the S&P100 weekly return dataset convincingly illustrate the superiority and adaptivity of the new approach.

[1]  J. Stock,et al.  Forecasting Using Principal Components From a Large Number of Predictors , 2002 .

[2]  C. Mallows More comments on C p , 1995 .

[3]  Hansheng Wang,et al.  Factor profiled sure independence screening , 2012 .

[4]  Jianqing Fan,et al.  Large covariance estimation by thresholding principal orthogonal complements , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[5]  E. Fama,et al.  Common risk factors in the returns on stocks and bonds , 1993 .

[6]  Henghsiu Tsai,et al.  Constrained Factor Models , 2010 .

[7]  E. Fama,et al.  The Cross‐Section of Expected Stock Returns , 1992 .

[8]  Guan Yu,et al.  Graph-based sparse linear discriminant analysis for high-dimensional classification , 2019, J. Multivar. Anal..

[9]  J. Bai,et al.  Rank regularized estimation of approximate factor models , 2019, Journal of Econometrics.

[10]  Clifford Lam,et al.  Factor modeling for high-dimensional time series: inference for the number of factors , 2012, 1206.0613.

[11]  J. Bai,et al.  Inferential Theory for Factor Models of Large Dimensions , 2003 .

[12]  Yufeng Liu,et al.  Sparse Regression Incorporating Graphical Structure Among Predictors , 2016, Journal of the American Statistical Association.

[13]  Jiashun Jin,et al.  Coauthorship and Citation Networks for Statisticians , 2014, ArXiv.

[14]  S. Ross The arbitrage theory of capital asset pricing , 1976 .

[15]  Badi H. Baltagi,et al.  Identification and estimation of a large factor model with structural instability , 2017 .

[16]  Piotr Fryzlewicz,et al.  Simultaneous multiple change-point and factor analysis for high-dimensional time series , 2016, Journal of Econometrics.

[17]  J. Lucas,et al.  Sparse latent factor models with interactions: Analysis of gene expression data , 2013, 1312.1818.

[18]  Jianqing Fan,et al.  High Dimensional Covariance Matrix Estimation in Approximate Factor Models , 2011, Annals of statistics.

[19]  Elynn Y. Chen,et al.  Constrained Factor Models for High-Dimensional Matrix-Variate Time Series , 2017, Journal of the American Statistical Association.

[20]  Edward M. H. Lin,et al.  DOUBLY CONSTRAINED FACTOR MODELS WITH APPLICATIONS , 2016 .

[21]  Seung C. Ahn,et al.  Eigenvalue Ratio Test for the Number of Factors , 2013 .

[22]  Kunpeng Li,et al.  STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION , 2012, 1205.6617.

[23]  S. Ross THE CAPITAL ASSET PRICING MODEL (CAPM), SHORT‐SALE RESTRICTIONS AND RELATED ISSUES , 1977 .

[24]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[25]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1983 .

[26]  E. Levina,et al.  Prediction models for network-linked data , 2016, The Annals of Applied Statistics.

[27]  Kunpeng Li,et al.  Maximum Likelihood Estimation and Inference for Approximate Factor Models of High Dimension , 2016, Review of Economics and Statistics.

[28]  Rong Chen,et al.  Factor models for matrix-valued high-dimensional time series , 2016, Journal of Econometrics.

[29]  Jianqing Fan,et al.  Risks of Large Portfolios , 2013, Journal of econometrics.

[30]  J. Stock,et al.  Consistent Factor Estimation in Dynamic Factor Models with Structural Instability , 2013 .

[31]  Xu Han,et al.  Determining the number of factors with potentially strong within-block correlations in error terms , 2017 .

[32]  Liangjun Su,et al.  On Time-Varying Factor Models: Estimation and Testing ∗ , 2017 .

[33]  Matteo Barigozzi,et al.  Improved penalization for determining the number of factors in approximate factor models , 2010 .

[34]  Weidong Liu Structural similarity and difference testing on multiple sparse Gaussian graphical models , 2017 .