Models for Sample Selection Bias

When observations in social research are selected so that they are not independent of the outcome variables in a study, sample selection leads to biased inferences about social processes. Nonrandom selection is both a source of bias in empirical research and a fundamental aspect of many social processes. This chapter reviews models that attempt to take account of sample selection and their applications in research on labor markets, schooling, legal processes, social mobility, and social networks. Variants of these models apply to outcome variables that are censored or truncated—whether explicitly or incidentally—and include the tobit model, the standard selection model, models for treatment effects in quasi-experimental designs, and endogenous switching models. Heckman’s two-stage estimator is the most widely used approach to selection bias, but its results may be sensitive to violations of its assumptions about the way that selection occurs. Recent econometric research has developed a wide variety of pro...

[1]  J. Tobin Estimation of Relationships for Limited Dependent Variables , 1958 .

[2]  J. Heckman Shadow prices, market wages, and labor supply , 1974 .

[3]  H. Gregg Lewis,et al.  Comments on Selectivity Biases in Wage Comparisons , 1974, Journal of Political Economy.

[4]  C. Manski MAXIMUM SCORE ESTIMATION OF THE STOCHASTIC UTILITY MODEL OF CHOICE , 1975 .

[5]  Jerry A. Hausman,et al.  The Evaluation of Results from Truncated Samples: The New Jersey Income Maintenance Experiment , 1976 .

[6]  O. Ashenfelter,et al.  Estimating the Effect of Training Programs on Earnings , 1976 .

[7]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[8]  Jerry A. Hausman,et al.  Social Experimentation, Truncated Distributions, and Efficient Estimation , 1977 .

[9]  J. Heckman Dummy Endogenous Variables in a Simultaneous Equation System , 1977 .

[10]  Michael T. Hannan,et al.  Income and Marital Events: Evidence from an Income-Maintenance Experiment , 1977, American Journal of Sociology.

[11]  Donald B. Rubin,et al.  Bayesian Inference for Causal Effects: The Role of Randomization , 1978 .

[12]  N. Fligstein,et al.  Sex similarities in occupational status attainment: Are the results due to the restriction of the sample to employed women? , 1978 .

[13]  M. Hannan,et al.  Income and Independence Effects on Marital Dissolution: Results from the Seattle and Denver Income-Maintenance Experiments , 1978, American Journal of Sociology.

[14]  J. Heckman Sample selection bias as a specification error , 1979 .

[15]  R. Olsen,et al.  A Least Squares Correction for Selectivity Bias , 1980 .

[16]  Arthur S. Goldberger,et al.  Linear regression after selection , 1981 .

[17]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[18]  Walter R. Young,et al.  The Statistical Analysis of Failure Time Data , 1981 .

[19]  Subhash C. Ray,et al.  Selection biases in sociological data , 1982 .

[20]  Lung-fei Lee Some Approaches to the Correction of Selectivity Bias , 1982 .

[21]  P. Schmidt,et al.  An Investigation of the Robustness of the Tobit Estimator to Non-Normality , 1982 .

[22]  David A. Wise,et al.  College Choice In America , 1983 .

[23]  R. Berk An introduction to sample selection bias in sociological data. , 1983 .

[24]  S. Cosslett DISTRIBUTION-FREE MAXIMUM LIKELIHOOD ESTIMATOR OF THE BINARY CHOICE MODEL1 , 1983 .

[25]  Lung-fei Lee Generalized Econometric Models with Selectivity , 1983 .

[26]  W. DuMouchel,et al.  Using Sample Survey Weights in Multiple Regression Analyses of Stratified Samples , 1983 .

[27]  Arthur S. Goldberger,et al.  ABNORMAL SELECTION BIAS , 1983 .

[28]  Edward Leamer Model choice and specification analysis , 1983 .

[29]  D. Rubin,et al.  The central role of the propensity score in observational studies for causal effects , 1983 .

[30]  P. Schmidt,et al.  Limited-Dependent and Qualitative Variables in Econometrics. , 1984 .

[31]  Wills and Statistics: Tobit Analysis and the Counter Reformation in Lyon , 1984 .

[32]  C. Manski Semiparametric analysis of discrete response: Asymptotic properties of the maximum score estimator , 1985 .

[33]  John Hagan,et al.  Changing Conceptions of Race: Toward an Account of Anomalous Findings of Sentencing Research , 1984 .

[34]  R. Lalonde Evaluating the Econometric Evaluations of Training Programs with Experimental Data , 1984 .

[35]  F. Nelson,et al.  Efficiency of the two-step estimator for models with endogenous sample selection☆ , 1984 .

[36]  Christopher Winship,et al.  The Transition from Youth to Adult: Understanding the Age Pattern of Employment , 1984, American Journal of Sociology.

[37]  Christopher Winship,et al.  THE PARADOX OF LESSENING RACIAL INEQUALITY AND JOBLESSNESS AMONG BLACK YOUTH: ENROLLMENT, ENLISTMENT, AND EMPLOYMENT, 1964-1981* , 1984 .

[38]  Marjorie S. Zatz,et al.  Crime, time, and punishment: An exploration of selection bias in sentencing research , 1985 .

[39]  J. Heckman,et al.  Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market , 1985, Journal of Political Economy.

[40]  J. Heckman,et al.  Longitudinal Analysis of Labor Market Data: Alternative methods for evaluating the impact of interventions , 1985 .

[41]  James J. Heckman,et al.  Alternative methods for solving the problem of selection bias in evaluating the impact of treatments , 1986 .

[42]  Paul A. Ruud,et al.  Consistent estimation of limited dependent variable models despite misspecification of distribution , 1986 .

[43]  P. Holland Statistics and Causal Inference , 1985 .

[44]  Howard Wainer,et al.  Drawing inferences from self-selected samples , 1986 .

[45]  G. Chamberlain Asymptotic efficiency in semi-parametric models with censoring , 1986 .

[46]  Robert L,et al.  How Precise Are Evaluations of Employment and Training Programs , 1987 .

[47]  Stanley Lieberson,et al.  Making It Count: The Improvement of Social Research and Theory. , 1987 .

[48]  T. Mroz,et al.  The Sensitivity of an Empirical Model of Married Women's Hours of Work to Economic and Statistical Assumptions , 1987 .

[49]  H. Bierens Advances in Econometrics: Kernel estimators of regression functions , 1987 .

[50]  Peter V. Marsden,et al.  Small networks and selectivity bias in the analysis of survey network data , 1987 .

[51]  D. Rubin,et al.  Statistical Analysis with Missing Data. , 1989 .

[52]  P. Robinson ROOT-N-CONSISTENT SEMIPARAMETRIC REGRESSION , 1988 .

[53]  Richard A. Berk,et al.  Causal inference for sociological data. , 1988 .

[54]  J. S. Long,et al.  Endogenous Switching Regression Models for the Causes and Effects of Discrete Variables , 1988 .

[55]  B. Singer,et al.  Causality in the Social Sciences , 1988 .

[56]  George Farkas,et al.  Explaining Occupational Sex Segregation and Wages: Findings from a Model with Fixed Effects , 1988 .

[57]  C. Manski Anatomy of the Selection Problem , 1989 .

[58]  Thomas M. Stoker,et al.  Semiparametric Estimation of Index Coefficients , 1989 .

[59]  M. Hardy Estimating Selection Effects in Occupational Mobility in a 19th-Century City , 1989 .

[60]  R. Mare,et al.  Secondary School Tracking and Educational Inequality: Compensation, Reinforcement, or Neutrality? , 1989, American Journal of Sociology.

[61]  C. Manski Nonparametric Bounds on Treatment Effects , 1989 .

[62]  Brian Powell,et al.  Acquiring Capital for College: The Constraints of Family Configuration , 1989 .

[63]  James L. Powell,et al.  Semiparametric Estimation of Selection Models: Some Empirical Results , 1990 .

[64]  James J. Heckman,et al.  Self-Selection and the Distribution of Hourly Wages , 1990, Journal of Labor Economics.

[65]  W. Härdle Applied Nonparametric Regression , 1991 .

[66]  J. Hagan The Gender Stratification of Income Inequality Among Lawyers , 1990 .

[67]  I. Garfinkel,et al.  Inequality in divorce settlements: An investigation of property settlements and child support awards , 1990 .

[68]  Bo E. Honoré,et al.  The Empirical Content of the Roy Model , 1990 .

[69]  D. Relles,et al.  Theory Testing in a World of Constrained Research Design , 1990 .

[70]  Charles F. Manski,et al.  The Selection Problem , 1990 .

[71]  Tony Lancaster,et al.  The econometric analysis of transition data , 1990 .

[72]  J. Seltzer Legal Custody Arrangements and Children's Economic Welfare , 1991, American Journal of Sociology.

[73]  J. Powell,et al.  Nonparametric and Semiparametric Methods in Econometrics and Statistics , 1993 .

[74]  Charles F. Manski,et al.  Alternative Estimates of the Effect of Family Structure during Adolescence on High School Graduation , 1992 .

[75]  J. Powell,et al.  Semiparametric estimation of censored selection models with a nonparametric selection mechanism , 1993 .

[76]  John Hagan,et al.  White Collar Crime and Punishment , 1994 .