Forecasting Financial and Macroeconomic Variables Using Data Reduction Methods: New Empirical Evidence

In this paper, we empirically assess the predictive accuracy of a large group of models based on the use of principle components and other shrinkage methods, including Bayesian model averaging and various bagging, boosting, LASSO and related methods. Our results suggest that model averaging does not dominate other well designed prediction model specification methods, and that using a combination of factor and other shrinkage methods often yields superior predictions. For example, when using recursive estimation windows, which dominate other “windowing” approaches in our experiments, prediction models constructed using pure principal component type models combined with shrinkage methods yield mean square forecast error “best” models around 70% of the time, when used to predict 11 key macroeconomic indicators at various forecast horizons. Baseline linear models (which “win” around 5% of the time) and model averaging methods (which win around 25% of the time) fare substantially worse than our sophisticated nonlinear models. Ancillary findings based on our forecasting experiments underscore the advantages of using recursive estimation strategies, and provide new evidence of the usefulness of yield and yield-spread variables in nonlinear prediction specification.

[1]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[2]  Mark W. Watson,et al.  Generalized Shrinkage Methods for Forecasting Using Many Predictors , 2012 .

[3]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[4]  Seeing Inside the Black Box: Using Diffusion Index Methodology to Construct Factor Proxies in Large Scale Macroeconomic Time Series Environments , 2010 .

[5]  Jonathan H. Wright,et al.  Forecasting In ation , 2011 .

[6]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[7]  J. Stock,et al.  Forecasting Using Principal Components From a Large Number of Predictors , 2002 .

[8]  Marco Lippi,et al.  The Generalized Dynamic Factor Model , 2002 .

[9]  Michael P. Clements,et al.  Intercept Corrections and Structural Change , 1996 .

[10]  Serena Ng,et al.  Understanding and Comparing Factor-Based Forecasts , 2005 .

[11]  Lutz Kilian,et al.  How Useful is Bagging in Forecasting Economic Time Series? A Case Study of Us CPI Inflation , 2005 .

[12]  Boosting With the L 2 Loss : Regression and Classi cation , 2008 .

[13]  J. Bai,et al.  Confidence Intervals for Diffusion Index Forecasts and Inference for Factor-Augmented Regressions , 2006 .

[14]  Ming Yuan,et al.  Nonnegative Garrote Component Selection in Functional ANOVA models , 2007, AISTATS.

[15]  Christophe Croux,et al.  Least angle regression for time series forecasting with many predictors , 2008 .

[16]  Michael P. Clements,et al.  Macro-economic Forecasting and Modelling , 1995 .

[17]  P. Bühlmann,et al.  Analyzing Bagging , 2001 .

[18]  Gregory Connor,et al.  Risk and Return in an Equilibrium Apt: Application of a New Test Methodology , 1988 .

[19]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[20]  M. Steel,et al.  Benchmark Priors for Bayesian Model Averaging , 2001 .

[21]  J. Stock,et al.  Forecasting with Many Predictors , 2006 .

[22]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[23]  Gary Koop,et al.  Forecasting in Dynamic Factor Models Using Bayesian Model Averaging , 2004 .

[24]  Edward I. George,et al.  The Practical Implementation of Bayesian Model Selection , 2001 .

[25]  A. Adam Ding,et al.  Prediction Intervals, Factor Analysis Models, and High-Dimensional Empirical Linear Prediction , 1999 .

[26]  R. Clemen Combining forecasts: A review and annotated bibliography , 1989 .

[27]  M. Hashem Pesaran,et al.  Selection of estimation window in the presence of breaks , 2007 .

[28]  Gregory Connor,et al.  Performance Measurement with the Arbitrage Pricing Theory: A New Framework for Analysis , 1985 .

[29]  A. Timmermann Forecast Combinations , 2005 .

[30]  Jonathan H. Wright,et al.  Forecasting U.S. Inflation by Bayesian Model Averaging , 2003 .

[31]  Michael P. Clements,et al.  Forecast Combination and Encompassing , 2009 .

[32]  F. Diebold,et al.  Forecast Evaluation and Combination , 1996 .

[33]  A. P. Dawid,et al.  Bayesian Model Averaging and Model Search Strategies , 2007 .

[34]  Norman R. Swanson,et al.  Nonparametric Bootstrap Procedures for Predictive Inference Based on Recursive Estimation Schemes , 2005 .

[35]  Serena Ng,et al.  Are more data always better for factor analysis , 2006 .

[36]  M. Hashem Pesaran,et al.  Variable Selection, Estimation and Inference for Multi-Period Forecasting Problems , 2010 .

[37]  Serena Ng,et al.  Evaluating latent and observed factors in macroeconomics and finance , 2006 .

[38]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[39]  H. White,et al.  A Reality Check for Data Snooping , 2000 .

[40]  J. Stock,et al.  Combination forecasts of output growth in a seven-country data set , 2004 .

[41]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[42]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[43]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[44]  Gregory Connor,et al.  A Test for the Number of Factors in an Approximate Factor Model , 1993 .

[45]  M. Steel,et al.  Model uncertainty in cross-country growth regressions , 2001 .

[46]  Durga L. Shrestha,et al.  Experiments with AdaBoost.RT, an Improved Boosting Scheme for Regression , 2006, Neural Computation.

[47]  J. Bai,et al.  Forecasting economic time series using targeted predictors , 2008 .

[48]  M. Yuan,et al.  On the non‐negative garrotte estimator , 2007 .

[49]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[50]  Mark W. Watson,et al.  AN EMPIRICAL COMPARISON OF METHODS FOR FORECASTING USING MANY PREDICTORS , 2005 .

[51]  Massimiliano Marcellino,et al.  Factor Forecasts for the UK , 2005 .

[52]  G. Chow,et al.  Best Linear Unbiased Interpolation, Distribution, and Extrapolation of Time Series by Related Series , 1971 .

[53]  F. Diebold,et al.  Comparing Predictive Accuracy , 1994, Business Cycles.

[54]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[55]  Thomas Richardson,et al.  Boosting methodology for regression problems , 1999, AISTATS.