Estimation of Partial Linear Error-in-Variables Models with Validation Data

Consider the partial linear models of the formY=X?s+g(T)+e, where thep-variate explanatoryXis erroneously measured, and bothTand the responseYare measured exactly. LetXbe the surrogate variable forXwith measurement error. Let the primary data set be that containing independent observations on (Y,X,T) and the validation data set be that containing independent observations on (X,X,T), where the exact observations onXmay be obtained by some expensive or difficult procedures for only a small subset of subjects enrolled in the study. In this paper, without specifying any structure equation and the distribution assumption ofXgivenX, a semiparametric method with the primary data is employed to obtain the estimators ofsandg(·) based on the least-squares criterion with the help of validation data. The proposed estimators are proved to be strongly consistent. The asymptotic representation and the asymptotic normality of the estimator ofsare derived, respectively. The rate of the weak consistency of the estimator ofg(·) is also obtained.

[1]  Margaret S. Pepe,et al.  Inference using surrogate outcome data and a validation sample , 1992 .

[2]  B. L. S. Prakasa Rao Asymptotic theory of statistical inference , 1987 .

[3]  Raymond J. Carroll,et al.  Semiparametric quasilikelihood and variance function estimation in measurement error models , 1993 .

[4]  A. A. Weiss,et al.  Semiparametric estimates of the relation between weather and electricity sales , 1986 .

[5]  Grace Wahba,et al.  Partial Spline Models for the Inclusion of Tropopause and Frontal Boundary Information in Otherwise Smooth Two- and Three-Dimensional Objective Analysis , 1986 .

[6]  Hung Chen,et al.  Convergence Rates for Parametric Components in a Partly Linear Model , 1988 .

[7]  Raymond J. Carroll,et al.  On errors-in-variables for binary regression models , 1984 .

[8]  Zhongguo Zheng,et al.  Asymptotic properties for the semiparametric regression model with randomly censored data , 1997 .

[9]  A. Seheult,et al.  Analysis of Field Experiments by Least Squares Smoothing , 1985 .

[10]  G. Wahba Partial and interaction spline models for the semiparametric estimation of functions of several variables , 1986 .

[11]  Thomas R. Fleming,et al.  A Nonparametric Method for Dealing with Mismeasured Covariate Data , 1991 .

[12]  P. Speckman Kernel smoothing in partial linear models , 1988 .

[13]  Yasuo Amemiya,et al.  Estimation for the Multivariate Errors-in-Variables Model with Estimated Error Covariance Matrix , 1984 .

[14]  Prakasa Rao Nonparametric functional estimation , 1983 .

[15]  Nancy E. Heckman,et al.  Spline Smoothing in a Partly Linear Model , 1986 .

[16]  Raymond J. Carroll,et al.  A Semiparametric Correction for Attenuation , 1994 .

[17]  J. Wittes,et al.  Surrogate endpoints in clinical trials: cardiovascular diseases. , 1989, Statistics in medicine.

[18]  Lung-fei Lee,et al.  Semiparametric estimation of nonlinear errors-in-variables models with validation study , 1995 .

[19]  Greg J. Duncan,et al.  An Investigation of the Extent and Consequences of Measurement Error in Labor-Economic Survey Data , 1985, Journal of Labor Economics.