Regression calibration in failure time regression.

In this paper we study a regression calibration method for failure time regression analysis when data on some covariates are missing or mismeasured. The method estimates the missing data based on the data structure estimated from a validation data set, a random subsample of the study cohort in which covariates are always observed. Ordinary Cox (1972; Journal of the Royal Statistical Society, Series B 34, 187-220) regression is then applied to estimate the regression coefficients, using the observed covariates in the validation data set and the estimated covariates in the nonvalidation data set. The method can be easily implemented. We present the asymptotic theory of the proposed estimator. Finite sample performance is examined and compared with an estimated partial likelihood estimator and other related methods via simulation studies, where the proposed method performs well even though it is technically inconsistent. Finally, we illustrate the method with a mouse leukemia data set.

[1]  D.Sc. Joseph Berkson Are there Two Regressions , 1950 .

[2]  D. Horvitz,et al.  A Generalization of Sampling Without Replacement from a Finite Universe , 1952 .

[3]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[4]  Marcel G. Dagenais,et al.  The use of incomplete observations in multiple regression analysis: A generalized least squares approach , 1973 .

[5]  Robert V. Foutz,et al.  On the Unique Consistent Solution to the Likelihood Equations , 1977 .

[6]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .

[7]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[8]  R. Prentice Covariate measurement errors and parameter estimation in a failure time regression model , 1982 .

[9]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[10]  R. L. Prentice,et al.  A case-cohort design for epidemiologic cohort studies and disease prevention trials , 1986 .

[11]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[12]  R. Prentice,et al.  Further results on covariate measurement errors in cohort studies with time to response data. , 1989, Statistics in medicine.

[13]  B Rosner,et al.  Correction of logistic regression relative risk estimates and confidence intervals for systematic within-person measurement error. , 2006, Statistics in medicine.

[14]  Raymond J. Carroll,et al.  Approximate Quasi-likelihood Estimation in Models with Surrogate Predictors , 1990 .

[15]  Daniel O. Stram,et al.  The Errors-in-Variables Problem: Considerations Provided by Radiation Dose-Response Analyses of the A-Bomb Survivor Data , 1992 .

[16]  Z. Ying,et al.  Cox Regression with Incomplete Covariate Measurements , 1993 .

[17]  M. Hughes,et al.  Regression dilution in the proportional hazards model. , 1993, Biometrics.

[18]  Raymond J. Carroll,et al.  A Semiparametric Correction for Attenuation , 1994 .

[19]  J. Robins,et al.  Estimation of Regression Coefficients When Some Regressors are not Always Observed , 1994 .

[20]  R. Carroll,et al.  Measurement error, instrumental variables and corrections for attenuation with applications to meta-analyses. , 1994, Statistics in medicine.

[21]  W. Barlow,et al.  Robust variance estimation for the case-cohort design. , 1994, Biometrics.

[22]  M. Pepe,et al.  Auxiliary covariate data in failure time regression , 1995 .

[23]  D. Ruppert,et al.  Measurement Error in Nonlinear Models , 1995 .

[24]  L. Zhao,et al.  Weighted Semiparametric Estimation in Regression Analysis with Missing Covariate Data , 1997 .