The Promises and Pitfalls of Machine Learning for Predicting Stock Returns

Recent research suggests that machine learning models dominate traditional linear models in predicting cross-sectional stock returns. The authors confirm this finding when predicting one-month-forward-looking returns based on a set of common stock characteristics, including predictors such as short-term reversal. Despite the statistical advantage of machine learning model predictions, the authors demonstrate that the economic gains tend to be more limited and critically dependent on the ability to take risk and implement trades efficiently. Unlike traditional models, machine learning models have been somewhat more effective over the past decade at discerning valuable predictions from cross-sectional equity characteristics. TOPICS: Security analysis and valuation, big data/machine learning Key Findings ▪ The authors compare a nonlinear machine learning model called gradient boosting machine (GBM) with traditional linear models in predicting cross-sectional stock returns based on well-known equity characteristics. ▪ They demonstrate how to rationalize the mechanics and outcome of GBM to alleviate its black-box characteristics. ▪ The extent to which the statistical advantage of GBM’s performance over that of linear models can be translated into economic gains depends critically on one’s ability to take risk and implement trades efficiently.

[1]  Eli Bartov,et al.  Post Loss/Profit Announcement Drift , 2008 .

[2]  Donald B. Keim,et al.  Earnings Yields, Market Values, and Stock Returns , 1989 .

[3]  E. Fama,et al.  The Cross‐Section of Expected Stock Returns , 1992 .

[4]  Bogdan E. Popescu,et al.  PREDICTIVE LEARNING VIA RULE ENSEMBLES , 2008, 0811.1679.

[5]  R. Litzenberger,et al.  The effect of personal taxes and dividends on capital asset prices , 1979 .

[6]  Quantifying Backtest Overfitting in Alternative Beta Strategies , 2017, The Journal of Portfolio Management.

[7]  Tim Loughran,et al.  The New Issues Puzzle , 1995 .

[8]  Kent D. Daniel,et al.  Market Reactions to Tangible and Intangible Information , 2001 .

[9]  Markus Pelger,et al.  Deep Learning in Asset Pricing , 2019, Manag. Sci..

[10]  P. M. Fairfield,et al.  Accrued Earnings and Growth: Implications for Future Profitability and Market Mispricing , 2003 .

[11]  Robert Novy-Marx,et al.  The other side of value: The gross profitability premium. , 2013 .

[12]  A Test for the Equality of Multiple Sharpe Ratios , 2014 .

[13]  S. Basu,et al.  Investment Performance of Common Stocks in Relation to their Price-Earnings Ratios , 1977 .

[14]  E. Fama,et al.  Dissecting Anomalies with a Five-Factor Model , 2015 .

[15]  R. Haugen,et al.  Commonality in the Determinants of Expected Stock Returns , 1996 .

[16]  E. Fama,et al.  Dividend yields and expected stock returns , 1988 .

[17]  Kewei Hou,et al.  What Factors Drive Global Stock Returns? , 2011 .

[18]  Bryan T. Kelly,et al.  Empirical Asset Pricing Via Machine Learning , 2018, The Review of Financial Studies.

[19]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[20]  Marcial Messmer,et al.  Deep Learning and the Cross-Section of Expected Returns , 2017 .

[21]  J. Ritter The Long-Run Performance of Initial Public Offerings , 1991 .

[22]  Chao Zhang,et al.  Alpha Go Everywhere: Machine Learning and International Stock Returns , 2020, SSRN Electronic Journal.

[23]  Kenneth Yung,et al.  The Interaction of Short-Term Reversal and Momentum Strategies , 2016, The Journal of Portfolio Management.

[24]  Joachim Freyberger,et al.  Dissecting Characteristics Nonparametrically , 2017, The Review of Financial Studies.

[25]  Narasimhan Jegadeesh,et al.  Returns to Buying Winners and Selling Losers: Implications for Stock Market Efficiency , 1993 .

[26]  Tom Zimmermann,et al.  Tree-Based Conditional Portfolio Sorts: The Relation between Past and Future Stock Returns , 2016 .

[27]  Guillaume Coqueret,et al.  Stock Returns and the Cross-Section of Characteristics: A Tree-Based Approach , 2018 .

[28]  Sheridan Titman,et al.  Capital Investments and Stock Returns , 2004, Journal of Financial and Quantitative Analysis.

[29]  Brandon M. Greenwell,et al.  A Simple and Effective Model-Based Variable Importance Measure , 2018, ArXiv.

[30]  Ramesh K. S. Rao,et al.  The Productivity of Corporate Cash Holdings and the Cross-Section of Expected Stock Returns , 2009 .

[31]  Jeffrey Pontiff,et al.  Share Issuance and Cross‐sectional Returns , 2008 .

[32]  R. Jagannathan,et al.  Risk Reduction in Large Portfolios: Why Imposing the Wrong Constraints Helps , 2002 .

[33]  R. Shiller,et al.  The Dividend-Price Ratio and Expectations of Future Dividends and Discount Factors , 1986 .

[34]  E. Fama,et al.  Profitability, investment and average returns , 2006 .

[35]  Laxminarayan Bhandari,et al.  Debt/Equity Ratio and Expected Common Stock Returns: Empirical Evidence , 1988 .

[36]  Narasimhan Jegadeesh,et al.  Evidence of Predictable Behavior of Security Returns , 1990 .

[37]  Michael J. Cooper,et al.  Asset Growth and the Cross-Section of Stock Returns , 2007 .

[38]  Fernando J. Corbacho,et al.  Nonlinear Support Vector Machines Can Systematically Identify Stocks with High and Low Future Returns , 2012, Algorithmic Finance.

[39]  Raman Uppal,et al.  A Transaction-Cost Perspective on the Multitude of Firm Characteristics , 2019, The Review of Financial Studies.

[40]  Zhi Da,et al.  Cashflow risk, systematic earnings revisions, and the cross-section of stock returns , 2009 .

[41]  J. Friedman Stochastic gradient boosting , 2002 .

[42]  S. Penman,et al.  FINANCIAL STATEMENT ANALYSIS AND THE PREDICTION OF STOCK RETURNS , 1989 .

[43]  Mark T. Soliman,et al.  The Use of Dupont Analysis by Market Participants , 2007 .

[44]  Jason Zhu,et al.  Forest Through the Trees: Building Cross-Sections of Stock Returns , 2020, SSRN Electronic Journal.

[45]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[46]  Harry M. Markowitz,et al.  A Backtesting Protocol in the Era of Machine Learning , 2019 .

[47]  D. Avramov,et al.  Machine Learning versus Economic Restrictions: Evidence from Stock Return Predictability , 2020, SSRN Electronic Journal.

[48]  RasekhschaffeKeywan Christian,et al.  Machine Learning for Stock Selection , 2019, Financial Analysts Journal.

[49]  Tarun Chordia,et al.  Liquidity and Autocorrelations in Individual Stock Returns , 2005 .

[50]  Marcos M. López de Prado,et al.  Advances in Financial Machine Learning: Numerai's Tournament (seminar slides) , 2018, SSRN Electronic Journal.

[51]  Asriel E. Levin,et al.  Stock Selection via Nonlinear Multi-Factor Models , 1995, NIPS.

[52]  Richard G. Sloan,et al.  Accrual Reliability, Earnings Persistence and Stock Prices , 2005 .

[53]  Nicholas G. Polson,et al.  Deep Learning in Characteristics-Sorted Factor Models , 2020 .

[54]  Josef Lakonishok,et al.  Fundamentals and Stock Returns in Japan , 1991 .

[55]  M. Blume,et al.  Stock Returns and Dividend Yields: Some More Evidence , 1980 .

[56]  R. Banz,et al.  The relationship between return and market value of common stocks , 1981 .

[57]  Harald Lohre,et al.  Optimal Timing and Tilting of Equity Factors , 2019, Financial Analysts Journal.

[58]  Shihao Gu,et al.  Autoencoder asset pricing models , 2021 .

[59]  R. Thaler,et al.  Does the Stock Market Overreact , 1985 .

[60]  Richard G. Sloan Do Stock Prices Fully Reflect Information in Accruals and Cash Flows About Future Earnings , 1998 .

[61]  Ronald J. Lanstein,et al.  Persuasive evidence of market inefficiency , 1985 .