Optimal Estimation when Researcher and Social Preferences are Misaligned

Econometric analysis typically focuses on the statistical properties of fixed estimators and ignores researcher choices. In this article, I approach the analysis of experimental data as a mechanism-design problem that acknowledges that researchers choose between estimators, sometimes based on the data and often according to their own preferences. Specifically, I focus on covariate adjustments, which can increase the precision of a treatment-effect estimate, but open the door to bias when researchers engage in specification searches. First, I establish that unbiasedness is a requirement on the estimation of the average treatment effect that aligns researchers’ preferences with the minimization of the mean-squared error relative to the truth, and that fixing the bias can yield an optimal restriction in a minimax sense. Second, I provide a constructive characterization of all treatment-effect estimators with fixed bias as samplesplitting procedures. Third, I show how these results imply flexible pre-analysis plans for randomized experiments that include beneficial specification searches and offer an opportunity to leverage machine learning. Jann Spiess, Microsoft Research New England, jspiess@stanford.edu. For their guidance I am indebted to Sendhil Mullainathan, Alberto Abadie, Elie Tamer, and Gary Chamberlain. For valuable comments or discussions I also thank Laura Blattner, Avi Feller, Edward Glaeser, Nathaniel Hendren, Simon Jäger, Maximilian Kasy, Lawrence Katz, Scott Kominers, Eben Lazarus, Shengwu Li, Mikkel Plagborg-Møller, Ashesh Rambachan, Jonathan Roth, Elizabeth Santorella, Klaus M. Schmidt, and seminar audiences at Harvard/MIT, MSR, Nortwestern, Princeton, Yale, NYU, Columbia, Chicago, Penn, Penn State, UCL, LSE, Stanford, Queen Mary, CERGE-EI, Copenhagen, UT Austin, and Toronto.

[1]  Gordon Tullock Publication Decisions and Tests of Significance—A Comment , 1959 .

[2]  Edward L. Glaeser,et al.  Researcher Incentives and Empirical Methods , 2006 .

[3]  T. Shakespeare,et al.  Observational Studies , 2003 .

[4]  J. Hahn On the Role of the Propensity Score in Efficient Semiparametric Estimation of Average Treatment Effects , 1998 .

[5]  D. Rubin [On the Application of Probability Theory to Agricultural Experiments. Essay on Principles. Section 9.] Comment: Neyman (1923) and Causal Inference in Experiments and Observational Studies , 1990 .

[6]  Edward E. Leamer,et al.  False Models and Post-Data Model Construction , 1974 .

[7]  C. Stein,et al.  Estimation with Quadratic Loss , 1992 .

[8]  T. Sterling Publication Decisions and their Possible Effects on Inferences Drawn from Tests of Significance—or Vice Versa , 1959 .

[9]  A. Sahn,et al.  Achieving Statistical Significance with Covariates , 2016 .

[10]  Jann Spiess Unbiased Shrinkage Estimation , 2017, 1708.06436.

[11]  Timothy B. Armstrong,et al.  Unbiased Instrumental Variables Estimation Under Known First-Stage Sign , 2015, 1501.06630.

[12]  Joel A. Middleton,et al.  A Class of Unbiased Estimators of the Average Treatment Effect in Randomized Experiments , 2013 .

[13]  C. Glymour,et al.  STATISTICS AND CAUSAL INFERENCE , 1985 .

[14]  B. Holmström,et al.  On incentives and control in organizations , 1977 .

[15]  P. Sørensen,et al.  Persuasion Bias in Science: Can Economics Help? , 2016 .

[16]  Bengt Holmstrom,et al.  On The Theory of Delegation , 1980 .

[17]  Francis X. Diebold,et al.  Comparing Predictive Accuracy, Twenty Years Later: A Personal Perspective on the Use and Abuse of Diebold–Mariano Tests , 2012 .

[18]  J. Robins,et al.  Semiparametric Efficiency in Multivariate Regression Models with Missing Data , 1995 .

[19]  David Lindley,et al.  Statistical Decision Functions , 1951, Nature.

[20]  Jaroslav Hájek,et al.  ASYMPTOTICALLY MOST POWERFUL RANK-ORDER TESTS' , 1962 .

[21]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[22]  Gabriel D. Carroll Robustness and Linear Contracts , 2015 .

[23]  Stefan Wager,et al.  High-dimensional regression adjustments in randomized experiments , 2016, Proceedings of the National Academy of Sciences.

[24]  F. Dahl,et al.  Data splitting as a countermeasure against hypothesis fishing: with a case study of predictors for low back pain , 2008, European Journal of Epidemiology.

[25]  Gerald S. Rogers,et al.  Mathematical Statistics: A Decision Theoretic Approach , 1967 .

[26]  Leonid Hurwicz,et al.  Incentive Structures Maximizing Residual Gain Under Incomplete Information , 1978 .

[27]  E. Miguel,et al.  Transparency, Reproducibility, and the Credibility of Economics Research , 2016, Journal of Economic Literature.

[28]  Bias Reduction in Instrumental Variable Estimation through First-Stage Shrinkage , 2017, 1708.06443.

[29]  Abhijit Banerjee,et al.  Decision Theoretic Approaches to Experiment Design and External Validity , 2016 .

[30]  J. Angrist,et al.  Jackknife Instrumental Variables Estimation , 1995 .

[31]  Charles A. Ingene,et al.  Specification Searches: Ad Hoc Inference with Nonexperimental Data , 1980 .

[32]  Alexander Frankel,et al.  Aligned Delegation , 2010 .

[33]  Muriel Niederle,et al.  Pre-analysis Plans Have Limited Upside, Especially Where Replications Are Feasible , 2015 .

[34]  Michael L. Anderson,et al.  Split-Sample Strategies for Avoiding False Discoveries , 2017 .

[35]  Frank Schorfheide,et al.  On the Use of Holdout Samples for Model Selection , 2012 .

[36]  Johann A. Gagnon-Bartsch,et al.  The LOOP Estimator: Adjusting for Covariates in Randomized Experiments , 2017, Evaluation review.

[37]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[38]  A. Belloni,et al.  Least Squares After Model Selection in High-Dimensional Sparse Models , 2009, 1001.0188.

[39]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.

[40]  Sendhil Mullainathan,et al.  Machine Learning: An Applied Econometric Approach , 2017, Journal of Economic Perspectives.

[41]  Abel Brodeur,et al.  Star Wars: The Empirics Strike Back , 2012, SSRN Electronic Journal.

[42]  J. Heckman,et al.  Abducting Economics. , 2017, The American economic review.

[43]  K. Hirano,et al.  Location Properties of Point Estimators in Linear Instrumental Variables and Related Models , 2015 .

[44]  A. Banerjee,et al.  A Theory of Experimenters , 2017, SSRN Electronic Journal.

[45]  Jeffrey S. Racine,et al.  Jackknife model averaging , 2012 .

[46]  J. Robins,et al.  Double/Debiased Machine Learning for Treatment and Structural Parameters , 2017 .

[47]  Frank Schorfheide,et al.  To Hold Out or Not to Hold Out , 2013 .

[48]  Cun-Hui Zhang,et al.  Lasso adjustments of treatment effect estimates in randomized experiments , 2015, Proceedings of the National Academy of Sciences.

[49]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[50]  Maximilian Kasy,et al.  Why Experimenters Might Not Always Want to Randomize, and What They Could Do Instead , 2016, Political Analysis.

[51]  Michael C. Frank,et al.  Estimating the reproducibility of psychological science , 2015, Science.

[52]  E. Lehmann Testing Statistical Hypotheses , 1960 .

[53]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[54]  A. Belloni,et al.  Inference on Treatment Effects after Selection Amongst High-Dimensional Controls , 2011, 1201.0224.

[55]  Toniann Pitassi,et al.  The reusable holdout: Preserving validity in adaptive data analysis , 2015, Science.

[56]  Isaiah Andrews,et al.  Identification of and Correction for Publication Bias , 2017, American Economic Review.

[57]  David A. Freedman,et al.  On regression adjustments to experimental data , 2008, Adv. Appl. Math..

[58]  M. Fafchamps,et al.  Using Split Samples to Improve Inference About Causal Effects , 2016 .

[59]  Christian Hansen,et al.  Double/Debiased/Neyman Machine Learning of Treatment Effects , 2017, 1701.08687.

[60]  W. Lin,et al.  Agnostic notes on regression adjustments to experimental data: Reexamining Freedman's critique , 2012, 1208.2301.

[61]  Jonathan H. Wright,et al.  Forecasting With Model Uncertainty: Representations and Risk Reduction , 2017 .

[62]  Benjamin A. Olken,et al.  Promises and Perils of Pre-analysis Plans , 2015 .

[63]  Alwyn Young Channeling Fisher: Randomization Tests and the Statistical Insignificance of Seemingly Significant Experimental Results* , 2018, The Quarterly Journal of Economics.