Estimating Heterogeneous Treatment Effects in the Presence of Self-Selection : A Propensity Score Perspective *

An essential feature common to all empirical social research is variability across units of analysis. Individuals differ not only in background characteristics and outcomes of interest, but also in how they respond to a particular treatment, intervention, or stimulation. Moreover, individuals may self-select into treatment on the basis of their anticipated treatment effects. To study heterogeneous treatment effects in the presence of self-selection, Heckman and his colleagues have developed a structural approach that builds on the marginal treatment effect (MTE). In this paper, we propose a revision of the MTE-based approach through a redefinition of the MTE. Specifically, we redefine MTE as the expected treatment effect conditional on the propensity score (instead of the vector of observed covariates) as well as a latent variable representing unobserved resistance to treatment. The redefined MTE is superior to the original MTE in a number of aspects. First, while it is conditional on a unidimensional summary of covariates, it is sufficient to capture all of the treatment effect heterogeneity that is consequential for selection bias. Second, the new MTE is a bivariate function, thus much easier than the original MTE to be visualized. Third, as with the original MTE, the new MTE can also be used as a building block with which to estimate causal effects in the aggregate. However, the weights associated with the new MTE are simpler, more intuitive, and much easier to implement. Finally, the newly defined MTE allows us to directly assess treatment effect heterogeneity among individuals who are at the margin of treatment. As a result, it can be used to evaluate a wide range of policy changes with little analytical effort, and to design policy interventions that maximize the marginal benefits of treatment.

[1]  Xiaoping Zhou,et al.  Propensity Score–based Methods Versus MTE-based Methods in Causal Inference , 2016, Sociological methods & research.

[2]  R. Breen,et al.  Heterogeneous causal effects and sample selection bias , 2015 .

[3]  Yu Xie,et al.  Population heterogeneity and causal inference , 2013, Proceedings of the National Academy of Sciences.

[4]  Yu Xie,et al.  Estimating Heterogeneous Treatment Effects with Observational Data , 2012, Sociological methodology.

[5]  Michael Hout,et al.  Social and Economic Returns to College Education in the United States , 2012 .

[6]  A. Krueger,et al.  Estimating the Return to College Selectivity Over the Career Using Administrative Earning Data , 2011, SSRN Electronic Journal.

[7]  Jennifer L. Hill,et al.  Bayesian Nonparametric Modeling for Causal Inference , 2011 .

[8]  Edward Vytlacil,et al.  Estimating Marginal Returns to Education , 2010, The American economic review.

[9]  M. Gangl Causal Inference in Sociological Research , 2010 .

[10]  R. Dechter,et al.  Heuristics, Probability and Causality. A Tribute to Judea Pearl , 2010 .

[11]  Edward Vytlacil,et al.  Evaluating Marginal Policy Changes and the Average Effect of Treatment for Individuals at the Margin , 2009, Econometrica : journal of the Econometric Society.

[12]  Douglas B. Downey,et al.  Passing the Torch: Does Higher Education for the Disadvantaged Pay Off across the Generations?By Paul Attewell and David E. Lavin. New York: Russell Sage Foundation, 2007. Pp. xiii+268. $39.95. , 2009 .

[13]  Arne Henningsen,et al.  Sample Selection Models in R: Package sampleSelection , 2008 .

[14]  M. Hout Passing the Torch: Does Higher Education for the Disadvantaged Pay Off Across the Generations? , 2008 .

[15]  S. Mcnally,et al.  Vive la Révolution! Long‐Term Educational Returns of 1968 to the Angry Students , 2008, Journal of Labor Economics.

[16]  Christopher Winship,et al.  Counterfactuals and Causal Inference: Methods and Principles for Social Research , 2007 .

[17]  Gary King,et al.  Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference , 2007, Political Analysis.

[18]  Yu Xie,et al.  Otis Dudley Duncan's legacy: The demographic approach to quantitative reasoning in social science☆ , 2007 .

[19]  J. Heckman,et al.  Policy-Relevant Treatment Effects , 2001 .

[20]  J. Robins,et al.  Marginal Structural Models and Causal Inference in Epidemiology , 2000, Epidemiology.

[21]  Edward Vytlacil,et al.  Local Instrumental Variables , 2000 .

[22]  P. Smock,et al.  The Effect of Marriage and Divorce on Women's Economic Well-Being , 1999, American Sociological Review.

[23]  Christopher Winship,et al.  THE ESTIMATION OF CAUSAL EFFECTS FROM OBSERVATIONAL DATA , 1999 .

[24]  J J Heckman,et al.  Local instrumental variables and latent variable models for identifying and bounding treatment effects. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[25]  James C. Blackburn The Shape of the River , 1999 .

[26]  D. Woodruff,et al.  The Shape of the River: Long-Term Consequences of Considering Race in College and University Admissions , 1998 .

[27]  J. Angrist,et al.  Identification and Estimation of Local Average Treatment Effects , 1994 .

[28]  Joshua D. Angrist,et al.  Identification of Causal Effects Using Instrumental Variables , 1993 .

[29]  Meichu D. Chen,et al.  INEQUALITY AND ATTAINMENT IN A DUAL LABOR MARKET , 1991 .

[30]  R. Mare,et al.  Secondary School Tracking and Educational Inequality: Compensation, Reinforcement, or Neutrality? , 1989, American Journal of Sociology.

[31]  Robert A. Moffitt,et al.  The Estimation of Wage Gains and Welfare Gains in Self-selection , 1987 .

[32]  P. Holland Statistics and Causal Inference , 1985 .

[33]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[34]  P. Schmidt,et al.  An Investigation of the Robustness of the Tobit Estimator to Non-Normality , 1982 .

[35]  J. Heckman Dummy Endogenous Variables in a Simultaneous Equation System , 1977 .

[36]  D. Rubin Estimating causal effects of treatments in randomized and nonrandomized studies. , 1974 .

[37]  D. Campbell,et al.  Regression-Discontinuity Analysis: An Alternative to the Ex-Post Facto Experiment , 1960 .

[38]  A. Roy Some thoughts on the distribution of earnings , 1951 .

[39]  R. Moffitt Estimating Marginal Treatment Effects in Heterogeneous Populations , 2008 .

[40]  James J. Heckman,et al.  Econometric Evaluation of Social Programs, Part I: Causal Models, Structural Models and Econometric Policy Evaluation , 2007 .

[41]  E. Vytlacil Independence, Monotonicity, and Latent Index Models: An Equivalence Result , 2002 .

[42]  Petra E. Todd,et al.  IDENTIFICATION AND ESTIMATION OF TREATMENT EFFECTS WITH A REGRESSION-DISCONTINUITY DESIGN , 2001 .

[43]  Francine D. Blau,et al.  Handbook of Labor Economics , 1999 .

[44]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[45]  Christopher Winship,et al.  Models for Sample Selection Bias , 1992 .