A General Weighted Average Representation of the Ordinary and Two-Stage Least Squares Estimands

It is standard practice in applied work to study the effect of a binary variable ("treatment") on an outcome of interest using linear models with additive effects. In this paper I study the interpretation of the ordinary and two-stage least squares estimands in such models when treatment effects are in fact heterogeneous. I show that in both cases the coefficient on treatment is identical to a convex combination of two other parameters (different for OLS and 2SLS), which can be interpreted as the average treatment effects on the treated and controls under additional assumptions. Importantly, the OLS and 2SLS weights on these parameters are inversely related to the proportion of each group. The more units get treatment, the less weight is placed on the effect on the treated. What follows, the reliance on these implicit weights can have serious consequences for applied work. I illustrate some of these issues in four empirical applications from different fields of economics. I also develop a weighted least squares correction and simple diagnostic tools that applied researchers can use to avoid potential biases. In an important special case, my diagnostics only require the knowledge of the proportion of treated units.

[1]  James J. Heckman,et al.  Characterizing Selection Bias Using Experimental Data , 1998 .

[2]  Isaiah Andrews On the structure of IV estimands , 2019, Journal of Econometrics.

[3]  J. Heckman Micro Data, Heterogeneity, and the Evaluation of Public Policy: Nobel Lecture , 2001, Journal of Political Economy.

[4]  A. Deaton The Analysis of Household Surveys : A Microeconometric Approach to Development Policy , 1997 .

[5]  S. Yitzhaki On Using Linear Regressions in Welfare Economics , 1996 .

[6]  Liyang Sun,et al.  Estimating Dynamic Treatment Effects in Event Studies With Heterogeneous Treatment Effects , 2018, Journal of Econometrics.

[7]  Matthew Wiswall,et al.  What Linear Estimators Miss: The Effects of Family Income on Child Outcomes , 2011 .

[8]  Nathan Nunn,et al.  Commercial Imperialism? Political Influence and Trade During the Cold War , 2010 .

[9]  Joshua D. Angrist,et al.  Mostly Harmless Econometrics: An Empiricist's Companion , 2008 .

[10]  Pedro H. C. Sant'Anna,et al.  Difference-in-Differences with Multiple Time Periods , 2018, Journal of Econometrics.

[11]  E. Bettinger,et al.  Virtual Classrooms: How Online College Courses Affect Student Success , 2017 .

[12]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1994 .

[13]  Rosa L. Matzkin,et al.  Control functions in nonseparable simultaneous equations models , 2014 .

[14]  Parag A. Pathak,et al.  Forced Sales and House Prices , 2009 .

[15]  L. Telser,et al.  Iterative Estimation of a Set of Linear Regression Equations , 1964 .

[16]  Lowell J. Taylor,et al.  The Impact of the Great Migration on Mortality of African Americans: Evidence from the Deep South. , 2015, The American economic review.

[17]  Magne Mogstad,et al.  Beyond LATE with a Discrete Instrument , 2017, Journal of Political Economy.

[18]  James J Heckman,et al.  Understanding Instrumental Variables in Models with Essential Heterogeneity , 2006, The Review of Economics and Statistics.

[19]  M. Kolesár,et al.  Inference in Instrumental Variables Analysis with Heterogeneous Treatment E ects ∗ , 2017 .

[20]  J. Hausman Specification tests in econometrics , 1978 .

[21]  W. Rhodes Heterogeneous Treatment Effects: What Does a Regression Estimate? , 2010, Evaluation review.

[22]  J. Wooldridge Control Function Methods in Applied Econometrics , 2015, The Journal of Human Resources.

[23]  Damon Clark,et al.  The Long-Run Effects of Attending an Elite School: Evidence from the UK , 2014, SSRN Electronic Journal.

[24]  R. Oaxaca Male-Female Wage Differentials in Urban Labor Markets , 1973 .

[25]  Kosuke Imai,et al.  When Should We Use Unit Fixed Effects Regression Models for Causal Inference with Longitudinal Data? , 2019, American Journal of Political Science.

[26]  R. Olsen,et al.  A Least Squares Correction for Selectivity Bias , 1980 .

[27]  Christopher R. Taber,et al.  An Evaluation of Instrumental Variable Strategies for Estimating the Effects of Catholic Schooling , 2002, The Journal of Human Resources.

[28]  Sharon Belenzon,et al.  Eponymous Entrepreneurs ⇤ , 2014 .

[29]  Bryan S. Graham,et al.  Semiparametrically Efficient Estimation of the Average Linear Regression Function , 2018, Journal of Econometrics.

[30]  J. Angrist,et al.  Two-Stage Least Squares Estimation of Average Causal Effects in Models with Variable Treatment Intensity , 1995 .

[31]  J. Heckman Dummy Endogenous Variables in a Simultaneous Equation System , 1977 .

[32]  Stelios Michalopoulos,et al.  The Long-Run E ff ects of the Scramble for Africa , 2012 .

[33]  D. Freedman Statistical Models and Causal Inference: On Regression Adjustments in Experiments with Several Treatments , 2008, 0803.3757.

[34]  M. Kolesár ESTIMATION IN AN INSTRUMENTAL VARIABLES MODEL WITH TREATMENT EFFECT HETEROGENEITY , 2012 .

[35]  Patrick M. Kline,et al.  On Heckits, Late, and Numerical Equivalence , 2017, Econometrica.

[36]  Anton Strezhnev Semiparametric weighting estimators for multi-period difference-in-differences designs , 2018 .

[37]  B. Kiker Male-female wage differentials: Additional evidence , 1978 .

[38]  Markus Frölich,et al.  Nonparametric IV Estimation of Local Average Treatment Effects with Covariates , 2002, SSRN Electronic Journal.

[39]  P. Lundborg,et al.  UvA-DARE ( Digital Academic Repository ) Can women have children and a career ? IV evidence from IVF treatments , 2017 .

[40]  Nico Voigtländer,et al.  Persecution Perpetuated: The Medieval Origins of Anti-Semitic Violence in Nazi Germany , 2012 .

[41]  A. Aizer,et al.  The Long-Run Impact of Cash Transfers to Poor Families. , 2016, The American economic review.

[42]  Mark C. Berger,et al.  Is the Threat of Reemployment Services More Effective than the Services Themselves? Evidence from Random Assignment in the UI System * , 2003 .

[43]  Monica Costa Dias,et al.  Alternative approaches to evaluation in empirical microeconomics , 2002, The Journal of Human Resources.

[44]  Ahmed Khwaja,et al.  A comparison of treatment effects estimators using a structural model of AMI treatment choices and severity of illness information from hospital charts , 2011 .

[45]  What Mean Impacts Miss , 2004 .

[46]  J. Wooldridge Fixed-Effects and Related Estimators for Correlated Random-Coefficient and Treatment-Effect Panel Data Models , 2005, Review of Economics and Statistics.

[47]  Jeffrey M. Wooldridge,et al.  What Are We Weighting For? , 2013, The Journal of Human Resources.

[48]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[49]  D. Rubin,et al.  Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction , 2016 .

[50]  Does Disability Insurance Receipt Discourage Work? Using Examiner Assignment to Estimate Causal Effects of SSDI Receipt , 2012 .

[51]  Michael D. Frakes The Impact of Medical Liability Standards on Regional Variations in Physician Behavior: Evidence from the Adoption of National-Standard Rules , 2013 .

[52]  Yuya Sasaki,et al.  ON USING LINEAR QUANTILE REGRESSIONS FOR CAUSAL INFERENCE , 2017, Econometric Theory.

[53]  A. Blinder Wage Discrimination: Reduced Form and Structural Estimates , 1973 .

[54]  D. Almond,et al.  The Costs of Low Birth Weight , 2004 .

[55]  R. Moffitt Estimating Marginal Treatment Effects in Heterogeneous Populations , 2008 .

[56]  Susan Athey,et al.  Design-Based Analysis in Difference-in-Differences Settings with Staggered Adoption , 2018, Journal of Econometrics.

[57]  Xavier Jaravel,et al.  Revisiting Event Study Designs , 2017 .

[58]  J. Angrist,et al.  Estimating the Labor Market Impact of Voluntary Military Service Using Social Security Data on Military Applicants , 1995 .

[59]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1995 .

[60]  R. Schwabe,et al.  Monitoring Corruptible Politicians , 2016 .

[61]  Patrick M. Kline A Note on Variance Estimation for the Oaxaca Estimator of Average Treatment Effects , 2014 .

[62]  Karthik Muralidharan,et al.  Quality and Accountability in Healthcare Delivery: Audit-Study Evidence from Primary Care in India , 2015, The American economic review.

[63]  Cyrus Samii,et al.  Does Regression Produce Representative Estimates of Causal Effects? , 2015 .

[64]  Susan Athey,et al.  Sampling‐Based versus Design‐Based Uncertainty in Regression Analysis , 2017, Econometrica.

[65]  H. White Using Least Squares to Approximate Unknown Regression Functions , 1980 .

[66]  D. Atkin The Caloric Costs of Culture: Evidence from Indian Migrants , 2013 .

[67]  P. Holland Statistics and Causal Inference , 1985 .

[68]  Taryn Dinkelman The Effects of Rural Electrification on Employment: New Evidence from South Africa , 2011 .

[69]  R. Lalonde Evaluating the Econometric Evaluations of Training Programs with Experimental Data , 1984 .

[70]  Damon Clark,et al.  The Long-Run Effects of Attending an Elite School: Evidence from the United Kingdom , 2016 .

[71]  A. Alesina,et al.  On the Origins of Gender Roles: Women and the Plough , 2011, SSRN Electronic Journal.

[72]  M. Parey,et al.  Downloading Wisdom from Online Crowds , 2007, SSRN Electronic Journal.

[73]  Justine S. Hastings,et al.  School Choice, School Quality and Postsecondary Attainment , 2011, The American economic review.

[74]  Justin L. Tobias,et al.  Simple Estimators for Treatment Parameters in a Latent-Variable Framework , 2003, Review of Economics and Statistics.

[75]  Alberto Abadie Semiparametric instrumental variable estimation of treatment response models , 2003 .

[76]  E. Moretti,et al.  Estimating and Testing Models with Many Treatment Levels and Limited Instruments , 2011, Review of Economics and Statistics.

[77]  Rembert De Blander,et al.  Mostly Harmless Econometrics: An Empiricist's Companion , 2011 .

[78]  R. Frisch,et al.  Partial Time Regressions as Compared with Individual Trends , 1933 .

[79]  W. Lin,et al.  Agnostic notes on regression adjustments to experimental data: Reexamining Freedman's critique , 2012, 1208.2301.

[80]  Guido W. Imbens,et al.  Matching Methods in Practice: Three Examples , 2014, The Journal of Human Resources.

[81]  Matias D. Cattaneo,et al.  Econometric Methods for Program Evaluation , 2018, Annual Review of Economics.

[82]  Xavier D’Haultfœuille,et al.  Two-Way Fixed Effects Estimators with Heterogeneous Treatment Effects , 2019 .

[83]  Andrew Goodman-Bacon Difference-in-Differences with Variation in Treatment Timing , 2018, Journal of Econometrics.

[84]  Monica Martinez-Bravo,et al.  The Role of Local Officials in New Democracies: Evidence from Indonesia , 2014 .

[85]  J. Ludwig,et al.  The Effects of Housing Assistance on Labor Supply: Evidence from a Voucher Lottery , 2008 .

[86]  Peter Hull Estimating Treatment Effects in Mover Designs , 2018, 1804.06721.

[87]  Jeffrey M. Woodbridge Econometric Analysis of Cross Section and Panel Data , 2002 .

[88]  Jeremiah Dittmar Information Technology and Economic Change: The Impact of The Printing Press , 2011 .

[89]  Jeffrey M. Wooldridge,et al.  Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data , 2003 .

[90]  Alberto Abadie Semiparametric Difference-in-Differences Estimators , 2005 .

[91]  Jonah B. Gelbach,et al.  Distributional Impacts of the Self-Sufficiency Project , 2005 .

[92]  Jeffrey A. Smith,et al.  Does Matching Overcome Lalonde's Critique of Nonexperimental Estimators? , 2000 .

[93]  Patrick M. Kline Oaxaca-Blinder as a Reweighting Estimator , 2011 .

[94]  Michael B. Urbancic,et al.  Broken or Fixed Effects? , 2014 .

[95]  Todd E. Elder,et al.  Unexplained Gaps and Oaxaca-Blinder Decompositions , 2009, SSRN Electronic Journal.

[96]  Petra Moser,et al.  German-Jewish Emigres and U.S. Invention , 2013 .

[97]  David A. Freedman,et al.  On regression adjustments to experimental data , 2008, Adv. Appl. Math..

[98]  Izzet Kale,et al.  Virtual classroom , 2001, Proceedings IEEE International Conference on Advanced Learning Technologies.

[99]  Guido W. Imbens,et al.  The Interpretation of Instrumental Variables Estimators in Simultaneous Equations Models with an Application to the Demand for Fish , 2000 .

[100]  M. Humphreys Bounds on least squares estimates of causal effects in the presence of heterogeneous assignment probabilities , 2009 .