Ensemble Methods for Causal Effects in Panel Data Settings

In many prediction problems researchers have found that combinations of prediction methods (“ensembles”) perform better than individual methods. A simple example is random forests, which combines predictions from many regression trees. A striking, and substantially more complex, example is the Netflix Prize competition where the winning entry combined predictions using a wide variety of conceptually very different models. In macro-economic forecasting researchers have often found that averaging predictions from different models leads to more accurate forecasts. In this paper we apply these ideas to synthetic control type problems in panel data setting. In this setting a number of conceptually quite different methods have been developed, with some assuming correlations between units that are stable over time, others assuming stable time series patterns common to all units, and others using factor models. With data on state level GDP for 270 quarters, we focus on three basic approaches to predicting missing values, one from each of these strands of the literature. Rather than try to test the different models against each other and find a true model, we focus on combining predictions based on each of the separate models using ensemble methods. For the ensemble predictor we focus on a weighted average of the three individual methods, with non-negative weights determined through out-of-sample cross-validation.

[1]  Adrian E. Raftery,et al.  Bayesian model averaging: a tutorial (with comments by M. Clyde, David Draper and E. I. George, and a rejoinder by the authors , 1999 .

[2]  Yehuda Koren,et al.  All Together Now: A Perspective on the Netflix Prize , 2010 .

[3]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: A General Method for Estimating Sampling Variances for Standard Estimators for Average Causal Effects , 2015 .

[4]  Michael Vitale,et al.  The Wisdom of Crowds , 2015, Cell.

[5]  Jens Hainmueller,et al.  Comparative Politics and the Synthetic Control Method , 2014 .

[6]  L. Breiman Stacked Regressions , 1996, Machine Learning.

[7]  M. J. van der Laan,et al.  Statistical Applications in Genetics and Molecular Biology Super Learner , 2010 .

[8]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[9]  William A. Roberts All Together Now , 2013 .

[10]  Q. Vuong Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses , 1989 .

[11]  G. Imbens,et al.  Matrix Completion Methods for Causal Panel Data Models , 2017, Journal of the American Statistical Association.

[12]  Hal R. Varian,et al.  Big Data: New Tricks for Econometrics , 2014 .

[13]  G. Imbens,et al.  Balancing, Regression, Difference-in-Differences and Synthetic Control Methods: A Synthesis , 2016, 1610.07748.

[14]  M. Clyde,et al.  Bayesian model averaging: A tutorial - Comment , 1999 .

[15]  J. Stock,et al.  Forecasting with Many Predictors , 2006 .

[16]  Serena Ng,et al.  Determining the Number of Primitive Shocks in Factor Models , 2007 .