Analyzing Length-Biased Data With Semiparametric Transformation and Accelerated Failure Time Models

Right-censored time-to-event data are often observed from a cohort of prevalent cases that are subject to length-biased sampling. Informative right censoring of data from the prevalent cohort within the population often makes it difficult to model risk factors on the unbiased failure times for the general population, because the observed failure times are length biased. In this paper, we consider two classes of flexible semiparametric models: the transformation models and the accelerated failure time models, to assess covariate effects on the population failure times by modeling the length-biased times. We develop unbiased estimating equation approaches to obtain the consistent estimators of the regression coefficients. Large sample properties for the estimators are derived. The methods are confirmed through simulations and illustrated by application to data from a study of a prevalent cohort of dementia patients.

[1]  Donglin Zeng,et al.  Maximum likelihood estimation in semiparametric regression models with censored data , 2007, Statistica Sinica.

[2]  Marvin Zelen,et al.  On the theory of screening for chronic diseases , 1969 .

[3]  Y. Vardi,et al.  Nonparametric Estimation in the Presence of Length Bias , 1982 .

[4]  Richard D. Gill,et al.  Large sample theory of empirical distributions in biased sampling models , 1988 .

[5]  M S Pepe,et al.  Weighted Kaplan-Meier statistics: a class of distance tests for censored survival data. , 1989, Biometrics.

[6]  Ya'acov Ritov,et al.  Estimation in a Linear Regression Model with Censored Data , 1990 .

[7]  Kjell A. Doksum,et al.  Estimation and Testing in a Two-Sample Generalized Odds-Rate Model , 1988 .

[8]  C. Begg On the use of familial aggregation in population-based case probands for calculating penetrance. , 2002, Journal of the National Cancer Institute.

[9]  Y. Vardi Empirical Distributions in Selection Bias Models , 1985 .

[10]  R. Prentice Linear rank tests with right censored data , 1978 .

[11]  Marvin Zelen,et al.  Forward and Backward Recurrence Times and Length Biased Sampling: Age Specific Models , 2004, Lifetime data analysis.

[12]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[13]  Olcay Akman,et al.  Transformations of the Lognormal Distribution as a Selection Model , 2000 .

[14]  Z. Ying,et al.  Analysis of transformation models with censored data , 1995 .

[15]  Laurence L. George,et al.  The Statistical Analysis of Failure Time Data , 2003, Technometrics.

[16]  Zhiliang Ying,et al.  Large Sample Theory of a Modified Buckley-James Estimator for Regression Analysis with Censored Data , 1991 .

[17]  Lee-Jen Wei,et al.  Combining dependent tests with incomplete repeated measurements , 1985 .

[18]  I. James,et al.  Linear regression with censored data , 1979 .

[19]  R. Milner Mathematical Centre Tracts , 1976 .

[20]  Masoud Asgharian,et al.  Asymptotic behavior of the unconditional NPMLE of the length-biased survivor function from right censored prevalent cohort data , 2005, math/0602239.

[21]  B. Turnbull The Empirical Distribution Function with Arbitrarily Grouped, Censored, and Truncated Data , 1976 .

[22]  H. D. Miller,et al.  The Theory Of Stochastic Processes , 1977, The Mathematical Gazette.

[23]  Donglin Zeng,et al.  Efficient Estimation for the Accelerated Failure Time Model , 2007 .

[24]  David B Wolfson,et al.  Length-Biased Sampling With Right Censoring , 2002 .

[25]  Estimation of Regression Parameters and the Hazard Function in Transformed Linear Survival Models , 2000, Biometrics.

[26]  R Simon,et al.  Length biased sampling in etiologic studies. , 1980, American journal of epidemiology.

[27]  R. Gill Censoring and stochastic integrals , 1980 .

[28]  Yehuda Vardi,et al.  Multiplicative censoring, renewal processes, deconvolution and decreasing density: Nonparametric estimation , 1989 .

[29]  David B Wolfson,et al.  A formal test for the stationarity of the incidence rate using data from a prevalent cohort study with follow-up , 2006, Lifetime data analysis.

[30]  David B Wolfson,et al.  Checking stationarity of the incidence rate using prevalent cohort survival data , 2006, Statistics in medicine.

[31]  Thomas R. Fleming,et al.  Weighted Kaplan‐Meier Statistics: Large Sample and Optimality Considerations , 1991 .

[32]  V. De Gruttola,et al.  Nonparametric analysis of truncated survival data, with application to AIDS , 1988 .

[33]  T Ostbye,et al.  A reevaluation of the duration of survival after the onset of dementia. , 2001, The New England journal of medicine.

[34]  Ing Rj Ser Approximation Theorems of Mathematical Statistics , 1980 .

[35]  Mei-Cheng Wang,et al.  Hazards regression analysis for length-biased data , 1996 .

[36]  Mei-Cheng Wang,et al.  Nonparametric Estimation from Cross-Sectional Survival Data , 1991 .

[37]  Y. D. Semiparametric inference for the accelerated life model with time-dependent covariates , 1993 .

[38]  A. Tsiatis Estimating Regression Parameters Using Linear Rank Tests for Censored Data , 1990 .

[39]  Jason P. Fine,et al.  Analysing competing risks data with transformation models , 1999 .