Monte Carlo evidence on the choice between sample selection and two-part models

Abstract This Monte Carlo study examines the relative performance of sample selection and two-part models for data with a cluster at zero. The data are drawn from a bivariate normal distribution with a positive correlation. The alternative estimators are examined in terms of means squared error, mean bias and pointwise bias. The sample selection estimators include LIML and FIML. The two-part estimators include a naive (the true specification, omitting the correlation coefficient) and a data-analytic (testimator) variant. In the absence of exclusion restrictions, the two-part models are no worse, and often appreciably better than selection models in terms of mean behavior, but can behave poorly for extreme values of the independent variable. LIML had the worst performance of all four models. Empirically, selection effects are difficult to distinguish from a non-linear (e.g., quadratic) response. With exclusion restrictions, simple selection models were significantly better behaved than a naive two-part model over subranges of the data, but were negligibly better than the data-analytic version.

[1]  N. Duan Smearing Estimate: A Nonparametric Retransformation Method , 1983 .

[2]  W. V. D. Ven,et al.  The demand for deductibles in private health insurance: A probit model with sample selection , 1981 .

[3]  M. Hurd Estimation in truncated samples when there is heteroscedasticity , 1979 .

[4]  W. Manning,et al.  The demand for dental care: evidence from a randomized trial in health insurance. , 1985, Journal of the American Dental Association.

[5]  C. L. Mallows Some comments on C_p , 1973 .

[6]  Takeshi Amemiya,et al.  Regression Analysis when the Dependent Variable is Truncated Normal , 1973 .

[7]  J. Heckman The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models , 1976 .

[8]  R. Olsen,et al.  A Least Squares Correction for Selectivity Bias , 1980 .

[9]  T. Amemiya Tobit models: A survey , 1984 .

[10]  Randall J. Olsen,et al.  Let Them Eat Cake: A Note on Comparing Alternative Models of the Demand for Medical Care , 1984 .

[11]  Willard G. Manning,et al.  Choosing Between the Sample-Selection Model and the Multi-Part Model , 1984 .

[12]  F. Mehran Variance of the MVUE for the Lognormal Mean , 1973 .

[13]  C E Phelps,et al.  Some interim results from a controlled trial of cost sharing in health insurance. , 1981, The New England journal of medicine.

[14]  R. Little,et al.  A note about models for selectivity bias. , 1985 .

[15]  J. Tobin Estimation of Relationships for Limited Dependent Variables , 1958 .

[16]  Paul A. Ruud,et al.  On the appropriateness of endogenous switching , 1981 .

[17]  Arthur S. Goldberger,et al.  ABNORMAL SELECTION BIAS , 1983 .

[18]  J. Heckman Sample selection bias as a specification error , 1979 .

[19]  W. Manning,et al.  A controlled trial of the effect of a prepaid group practice on use of services. , 1984, The New England journal of medicine.

[20]  Lung-fei Lee,et al.  Tests for the Bivariate Normal Distribution in Econometric Models with Selectivity , 1984 .

[21]  A survey of the literature on selectivity bias as it pertains to health care markets. , 1985 .

[22]  P. Schmidt,et al.  Limited-Dependent and Qualitative Variables in Econometrics. , 1984 .

[23]  J. Heckman Shadow prices, market wages, and labor supply , 1974 .

[24]  C. Morris,et al.  A Comparison of Alternative Models for the Demand for Medical Care , 1983 .