Clustering Huge Number of Financial Time Series: A Panel Data Approach With High-Dimensional Predictors and Factor Structures

Abstract This article introduces a new procedure for clustering a large number of financial time series based on high-dimensional panel data with grouped factor structures. The proposed method attempts to capture the level of similarity of each of the time series based on sensitivity to observable factors as well as to the unobservable factor structure. The proposed method allows for correlations between observable and unobservable factors and also allows for cross-sectional and serial dependence and heteroscedasticities in the error structure, which are common in financial markets. In addition, theoretical properties are established for the procedure. We apply the method to analyze the returns for over 6000 international stocks from over 100 financial markets. The empirical analysis quantifies the extent to which the U.S. subprime crisis spilled over to the global financial markets. Furthermore, we find that nominal classifications based on either listed market, industry, country or region are insufficient to characterize the heterogeneity of the global financial markets. Supplementary materials for this article are available online.

[1]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1982 .

[2]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[3]  Cun-Hui Zhang Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[4]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[5]  E. Fama,et al.  Size, Value, and Momentum in International Stock Returns , 2011 .

[6]  Thomas J. Sargent,et al.  Business cycle modeling without pretending to have too much a priori economic theory , 1976 .

[7]  John M. Griffin,et al.  Are the Fama and French Factors Global or Country-Specific? , 2002 .

[8]  Richard Roll,et al.  Industrial Structure and the Comparative Behavior of International Stock Market Indices , 1992 .

[9]  Mark W. Watson,et al.  Consistent Estimation of the Number of Dynamic Factors in a Large N and T Panel , 2007 .

[10]  Yixiao Sun Estimation and Inference in Panel Structure Models , 2005 .

[11]  Terence Tao,et al.  The Dantzig selector: Statistical estimation when P is much larger than n , 2005, math/0506081.

[12]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[13]  G. Karolyi,et al.  Another Look at the Role of the Industrial Structure of Markets for International Diversification Strategies , 1996 .

[14]  T. Ando,et al.  Stock return predictability: A factor-augmented predictive regression system with shrinkage method , 2018 .

[15]  E. Fama,et al.  Risk, Return, and Equilibrium: Empirical Tests , 1973, Journal of Political Economy.

[16]  Serena Ng,et al.  Estimation of Panel Data Models with Parameter Heterogeneity when Group Membership is Unknown , 2007 .

[17]  C. Whiteman,et al.  Understanding the Evolution of World Business Cycles , 2005, SSRN Electronic Journal.

[18]  Jian Huang,et al.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION. , 2011, The annals of applied statistics.

[19]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[20]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[21]  P. Phillips,et al.  Identifying Latent Structures in Panel Data , 2014 .

[22]  S. Beckers,et al.  National versus Global Influences on Equity Returns , 1996 .

[23]  Jianqing Fan,et al.  Large covariance estimation by thresholding principal orthogonal complements , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[24]  Michael J. Aked,et al.  The Increasing Importance of Industry Factors , 2000 .

[25]  Gregory Connor,et al.  Performance Measurement with the Arbitrage Pricing Theory: A New Framework for Analysis , 1985 .

[26]  Kewei Hou,et al.  What Factors Drive Global Stock Returns? , 2011 .

[27]  Tomohiro Ando,et al.  Panel Data Models with Grouped Factor Structure Under Unknown Group Membership , 2013 .

[28]  G. Casella,et al.  The Bayesian Lasso , 2008 .

[29]  F. Diebold,et al.  Global Yield Curve Dynamics and Interactions: A Dynamic Nelson-Siegel Approach , 2007 .

[30]  E. Forgy,et al.  Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[31]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[32]  Matteo Barigozzi,et al.  Improved penalization for determining the number of factors in approximate factor models , 2010 .

[33]  R. Rigobón,et al.  No Contagion, Only Interdependence: Measuring Stock Market Comovements , 2002 .

[34]  Clifford Lam,et al.  PROFILE-KERNEL LIKELIHOOD INFERENCE WITH DIVERGING NUMBER OF PARAMETERS. , 2008, Annals of statistics.

[35]  Jianqing Fan,et al.  Nonconcave penalized likelihood with a diverging number of parameters , 2004, math/0406466.

[36]  J. Bai,et al.  Panel Data Models With Interactive Fixed Effects , 2009 .

[37]  F. Longstaff The subprime credit crisis and contagion in financial markets , 2010 .

[38]  F. Diebold,et al.  UNIVERSITY OF SOUTHERN CALIFORNIA Center for Applied Financial Economics (CAFE) On the Network Topology of Variance Decompositions: Measuring the Connectedness of Financial Firms , 2011 .

[39]  S. Satchell,et al.  Global equity styles and industry effects: the pre-eminence of value relative to size , 2001 .

[40]  F. T. Magiera Are the Fama and French Factors Global or Country Specific , 2002 .

[41]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[42]  Elena Manresa,et al.  Grouped Patterns of Heterogeneity in Panel Data , 2015 .

[43]  Marc,et al.  The Generalized Dynamic Factor Model determining the number of factors ∗ , 2005 .

[44]  M. Pesaran Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error Structure , 2004, SSRN Electronic Journal.

[45]  Simon M. Potter,et al.  Dynamic Hierarchical Factor Models , 2011, Review of Economics and Statistics.

[46]  J. Bai,et al.  Selecting the regularization parameters in high-dimensional panel data models: Consistency and efficiency , 2018 .

[47]  Sean P. Baca,et al.  The Rise of Sector Effects in Major Equity Markets , 2000 .