High-Dimensional Estimation, Basis Assets, and the Adaptive Multi-Factor Model

The paper proposes a new algorithm for the high-dimensional financial data — the Groupwise Interpretable Basis Selection (GIBS) algorithm, to estimate a new Adaptive Multi-Factor (AMF) asset pricing model, implied by the recently developed Generalized Arbitrage Pricing Theory, which relaxes the convention that the number of risk-factors is small. We first obtain an adaptive collection of basis assets and then simultaneously test which basis assets correspond to which securities, using high-dimensional methods. The AMF model, along with the GIBS algorithm, is shown to have a significantly better fitting and prediction power than the Fama–French 5-factor model.

[1]  Ang Li,et al.  QuClassi: A Hybrid Deep Neural Network Architecture based on Quantum State Fidelity , 2021, MLSys.

[2]  Dingwen Tao,et al.  Speculative Container Scheduling for Deep Learning Applications in a Kubernetes Cluster , 2020, IEEE Systems Journal.

[3]  Damek Davis,et al.  Clustering a Mixture of Gaussians with Unknown Covariance , 2021, ArXiv.

[4]  Jingwei Li Two ways towards combining Sequential Neural Network and Statistical Methods to Improve the Prediction of Time Series , 2021, ArXiv.

[5]  Liao Zhu The Adaptive Multi-Factor Model and the Financial Market , 2021, ArXiv.

[6]  Martin T. Wells,et al.  A News-based Machine Learning Model for Adaptive Asset Pricing , 2021, ArXiv.

[7]  Stephanie C. TerMaath,et al.  A Randomized Subspace-based Approach for Dimensionality Reduction and Important Variable Selection , 2021, J. Mach. Learn. Res..

[8]  Sicheng Wang,et al.  Staying at Home Is a Privilege: Evidence from Fine-Grained Mobile Phone Location Data in the United States during the COVID-19 Pandemic , 2021, Annals of the American Association of Geographers.

[9]  Almantas Galvanauskas,et al.  Improved Machine Learning Algorithms for Optimizing Coherent Pulse Stacking Amplification , 2021, 2021 Conference on Lasers and Electro-Optics (CLEO).

[10]  Yanci Zhang,et al.  Form 10-Q Itemization , 2021, CIKM.

[11]  Qiang Guan,et al.  A Hybrid System for Learning Classical Data in Quantum States , 2020, 2021 IEEE International Performance, Computing, and Communications Conference (IPCCC).

[12]  M. Wells,et al.  Time-Invariance Coefficients Tests with the Adaptive Multi-Factor Model , 2020, SSRN Electronic Journal.

[13]  Samuel A. Stein,et al.  QuGAN: A Quantum State Fidelity based Generative Adversarial Network , 2020, 2021 IEEE International Conference on Quantum Computing and Engineering (QCE).

[14]  Kuangyan Song,et al.  FrequentNet : A Novel Interpretable Deep Learning Model for Image Classification , 2020, SSRN Electronic Journal.

[15]  Ali Shojaie,et al.  In Defense of the Indefensible: A Very Naïve Approach to High-Dimensional Inference. , 2017, Statistical science : a review journal of the Institute of Mathematical Statistics.

[16]  Ming Chen,et al.  Differentiate Quality of Experience Scheduling for Deep Learning Applications with Docker Containers in the Cloud , 2020, ArXiv.

[17]  Xiao Huang,et al.  Time-series clustering for home dwell time during COVID-19: what can we learn from it? , 2020, medRxiv.

[18]  Tianming Du,et al.  Multiple Slice k-space Deep Learning for Magnetic Resonance Imaging Reconstruction , 2020, 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC).

[19]  Madeleine Udell,et al.  Matrix Completion with Quantified Uncertainty through Low Rank Gaussian Copula , 2020, NeurIPS.

[20]  M. Wells,et al.  The Low-volatility Anomaly and the Adaptive Multi-Factor Model , 2021, SSRN Electronic Journal.

[21]  Kaizheng Wang,et al.  Efficient Clustering for Stretched Mixtures: Landscape and Optimality , 2020, NeurIPS.

[22]  Madeleine Udell,et al.  Missing Value Imputation for Mixed Data via Gaussian Copula , 2019, KDD.

[23]  Serhiy Kozak,et al.  Shrinking the Cross Section , 2017, Journal of Financial Economics.

[24]  Dacheng Xiu,et al.  Taming the Factor Zoo: A Test of New Factors , 2017, The Journal of Finance.

[25]  M. Wells,et al.  Clustering Structure of Microstructure Measures , 2019, SSRN Electronic Journal.

[26]  Cheng Jie,et al.  Examining multiplicity and dynamics of publics’ crisis narratives with large-scale Twitter data , 2018, Public Relations Review.

[27]  Csaba Szepesvári,et al.  Stochastic Optimization in a Cumulative Prospect Theory Framework , 2018, IEEE Transactions on Automatic Control.

[28]  Tarun Chordia,et al.  p-Hacking: Evidence from Two Million Trading Strategies , 2017 .

[29]  Serhiy Kozak,et al.  Interpreting Factor Models , 2017 .

[30]  Michael C. Fu,et al.  Cumulative Prospect Theory Meets Reinforcement Learning: Prediction and Control , 2015, ICML.

[31]  Campbell R. Harvey,et al.  Editor's Choice … and the Cross-Section of Expected Returns , 2016 .

[32]  Robert Tibshirani,et al.  A General Framework for Estimation and Inference From Clusters of Features , 2015, 1511.07839.

[33]  Robert A. Jarrow,et al.  Bubbles and Multiple-Factor Asset Pricing Models , 2015 .

[34]  Philip Protter,et al.  Positive alphas and a generalized multiple-factor asset pricing model , 2015 .

[35]  Daniel Ferreira,et al.  Spurious Factors in Linear Asset Pricing Models , 2015 .

[36]  Campbell R. Harvey,et al.  . . . And the Cross-Section of Expected Returns , 2014 .

[37]  E. Fama,et al.  A Five-Factor Asset Pricing Model , 2014 .

[38]  R. Tibshirani,et al.  Exact Post-Selection Inference for Sequential Regression Procedures , 2014, 1401.3889.

[39]  Adel Javanmard,et al.  Confidence intervals and hypothesis testing for high-dimensional regression , 2013, J. Mach. Learn. Res..

[40]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[41]  Mohamed Hebiri,et al.  How Correlations Influence Lasso Prediction , 2012, IEEE Transactions on Information Theory.

[42]  S. Geer,et al.  The Lasso, correlated design, and improved oracle inequalities , 2011, 1107.0189.

[43]  R. Tibshirani,et al.  Strong rules for discarding predictors in lasso‐type problems , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[44]  Cun-Hui Zhang,et al.  Confidence intervals for low dimensional parameters in high dimensional linear models , 2011, 1110.2563.

[45]  Robert Tibshirani,et al.  Hierarchical Clustering With Prototypes via Minimax Linkage , 2011, Journal of the American Statistical Association.

[46]  Martin Larsson,et al.  THE MEANING OF MARKET EFFICIENCY , 2011 .

[47]  A. Belloni,et al.  Pivotal estimation via square-root Lasso in nonparametric regression , 2011, 1105.1475.

[48]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[49]  Ravi Jagannathan,et al.  Cross-Sectional Asset Pricing Tests , 2010 .

[50]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[51]  S. B. Thompson,et al.  Predicting Excess Stock Returns Out of Sample: Can Anything Beat the Historical Average? , 2008 .

[52]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[53]  Michael Wolf,et al.  Control of generalized error rates in multiple testing , 2007, 0710.2258.

[54]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[55]  John D. Storey A direct approach to false discovery rates , 2002 .

[56]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[57]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[58]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[59]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[60]  E. Fama,et al.  Common risk factors in the returns on stocks and bonds , 1993 .

[61]  E. Fama,et al.  The Cross‐Section of Expected Stock Returns , 1992 .

[62]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[63]  Stephen A. Ross,et al.  A Test of the Efficiency of a Given Portfolio , 1989 .

[64]  S. Ross The arbitrage theory of capital asset pricing , 1976 .

[65]  R. C. Merton,et al.  AN INTERTEMPORAL CAPITAL ASSET PRICING MODEL , 1973 .

[66]  J. Lintner THE VALUATION OF RISK ASSETS AND THE SELECTION OF RISKY INVESTMENTS IN STOCK PORTFOLIOS AND CAPITAL BUDGETS , 1965 .

[67]  W. Sharpe CAPITAL ASSET PRICES: A THEORY OF MARKET EQUILIBRIUM UNDER CONDITIONS OF RISK* , 1964 .

[68]  H. Theil,et al.  Economic Forecasts and Policy. , 1959 .