Measurement Error Models with Auxiliary Data

We study the problem of parameter inference in (possibly non-linear and non-smooth) econometric models when the data are measured with error. We allow for arbitrary correlation between the true variables and the measurement errors. To solve the identification problem, we require the existence of an auxiliary data-set that contains information about the conditional distribution of the true variables given the mismeasured variables. Our main assumption requires that the conditional distribution of the true variables given the mismeasured variables is the same in the primary and auxiliary data. Our methods allow the auxiliary data to be a validation sample, where the primary and validation data are from the same distribution, and more importantly, a stratified sample where the auxiliary data-set is not from the same distribution as the primary data. We also show how to combine the two data-sets to obtain a more efficient estimator of the parameter of interest. We establish the large sample properties of the sieve based estimators under verifiable conditions. In particular, we allow for the mismeasured variables to have unbounded supports without employing the tedious trimming scheme typically used in kernel based methods. We illustrate our methods by estimating a returns to schooling censored quantile regression using the CPS/SSR 1978 exact match files where the dependent variable is measured with error of arbitrary kind. Copyright 2005, Wiley-Blackwell.

[1]  C. Bollinger,et al.  Measurement Error in the Current Population Survey: A Nonparametric Look , 1998, Journal of Labor Economics.

[2]  Joel L. Horowitz,et al.  Identification and Robustness with Contaminated and Corrupted Data , 1995 .

[3]  Jeffrey M. Wooldridge,et al.  Selection corrections for panel data models under conditional mean independence assumptions , 1995 .

[4]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[5]  A. Chesher The effect of measurement error , 1991 .

[6]  W. Newey,et al.  Large sample estimation and hypothesis testing , 1986 .

[7]  J. Powell,et al.  Least absolute deviations estimation for the censored regression model , 1984 .

[8]  E. Tamer,et al.  A simple estimator for nonlinear error in variable models , 2003 .

[9]  Petra E. Todd,et al.  Matching As An Econometric Evaluation Estimator , 1998 .

[10]  Halbert White,et al.  Estimation, inference, and specification analysis , 1996 .

[11]  Quang Vuong,et al.  Nonparametric Selection of Regressors: The Nonnested Case , 1996 .

[12]  Jerry A. Hausman,et al.  Nonlinear errors in variables Estimation of some Engel curves , 1995 .

[13]  A. Gallant,et al.  Semi-nonparametric Maximum Likelihood Estimation , 1987 .

[14]  Lung-fei Lee,et al.  Estimation of Linear and Nonlinear Errors-in-Variables Models Using Validation Data , 1995 .

[15]  Raymond J. Carroll,et al.  Semiparametric Estimation in Logistic Measurement Error Models , 1989 .

[16]  Whitney K. Newey Flexible Simulated Moment Estimation of Nonlinear Errors-in-Variables Models , 2001, Review of Economics and Statistics.

[17]  Xiaotong Shen,et al.  Sieve extremum estimates for weakly dependent data , 1998 .

[18]  Hidehiko Ichimura,et al.  Identification and estimation of polynomial errors-in-variables models , 1991 .

[19]  Susanne M. Schennach,et al.  Estimation of Nonlinear Models with Measurement Error , 2004 .

[20]  W. Newey,et al.  The asymptotic variance of semiparametric estimators , 1994 .

[21]  Marie-Luce Taupin,et al.  Semi-Parametric Estimation in the Nonlinear Structural Errors-in-Variables Model , 2001 .

[22]  W. Newey,et al.  16 Efficient estimation of models with conditional moment restrictions , 1993 .

[23]  Jeffrey M. Wooldridge,et al.  Inverse probability weighted M-estimators for sample selection, attrition, and stratification , 2002 .

[24]  Xiaohong Chen,et al.  Efficient Estimation of Models with Conditional Moment Restrictions Containing Unknown Functions , 2003 .

[25]  John Bound,et al.  The Extent of Measurement Error in Longitudinal Earnings Data: Do Two Wrongs Make a Right? , 1988, Journal of Labor Economics.

[26]  Tong Li,et al.  Robust and consistent estimation of nonlinear errors-in-variables models , 2002 .

[27]  John Bound,et al.  Measurement error in survey data , 2001 .

[28]  G. Duncan,et al.  Evidence on the Validity of Cross-Sectional and Longitudinal Labor Market Data , 1994, Journal of Labor Economics.

[29]  Xiaohong Chen,et al.  Semi‐Nonparametric IV Estimation of Shape‐Invariant Engel Curves , 2003 .

[30]  Moshe Buchinsky,et al.  The dynamics of changes in the female wage distribution in the USA: a quantile regression approach , 1998 .

[31]  Raymond J. Carroll,et al.  Semiparametric quasilikelihood and variance function estimation in measurement error models , 1993 .