Semiparametric likelihood estimation in survival models with informative censoring

Semiparametric proportional hazard regression models are the cornerstone in modern survival analysis. Most estimation methodologies developed in the literature, such as the famous partial likelihood based estimation, are built on the ground that the censoring is noninformative. However, in many applications, the censoring is indeed informative. In this paper, we study the survival regression models with an informative censoring that is easy to detect and apply. A very important problem in practice is how to estimate the survival models more efficiently with the information from the informative censoring. We propose a semiparametric maximum likelihood approach that is easily implementable to estimate both the nonparametric baseline hazard and the parametric coefficients in the survival models with informative censoring. Different from the methods in the literature, we do not apply least informative approach to the baseline, which does not work well in our simulation. We solve the difficulty in semiparametric estimation by suggesting an indirect application of local kernel smoothing to the baseline. Asymptotic theory of the proposed estimators is established under informative and noninformative likelihoods, respectively. We suggest a cross-validation method to detect the informative censoring in application. The performances of the estimators in finite samples are investigated by Monte Carlo simulation. Both asymptotic theory and simulation show that the suggested semiparametric approach provides more efficient estimators of the parameters for informative censoring, and estimates the baseline function accurately. The proposed method is applied to analyse the data about the infants hospitalised for pneumonia, which leads to interesting findings.

[1]  Jianwen Cai,et al.  Partially linear hazard regression with varying coefficients for multivariate survival data , 2008 .

[2]  J. Copas,et al.  Sensitivity analysis for informative censoring in parametric survival models. , 2005, Biostatistics.

[3]  Lagakos Sw General right censoring and its impact on the analysis of survival data. , 1979 .

[4]  Jianqing Fan,et al.  Generalized Partially Linear Single-Index Models , 1997 .

[5]  R. Serfling Approximation Theorems of Mathematical Statistics , 1980 .

[6]  R. Little,et al.  Proportional hazards regression with missing covariates , 1999 .

[7]  Xin-Yuan Song,et al.  Local Polynomial Fitting in Semivarying Coefficient Model , 2002 .

[8]  Jianqing Fan,et al.  Statistical Estimation in Varying-Coefficient Models , 1999 .

[9]  Jianqing Fan,et al.  Simultaneous Confidence Bands and Hypothesis Testing in Varying‐coefficient Models , 2000 .

[10]  Trevor Hastie,et al.  Polynomial splines and their tensor products in extended linear modeling. Discussion and rejoinder , 1997 .

[11]  Thomas M. Stoker,et al.  Semiparametric Estimation of Index Coefficients , 1989 .

[12]  Fotios Siannis,et al.  Applications of a Parametric Model for Informative Censoring , 2004, Biometrics.

[13]  H. Tong,et al.  Estimation of the covariance matrix of random effects in longitudinal studies , 2007, 0803.4112.

[14]  Michael R. Kosorok,et al.  Robust Inference for Univariate Proportional Hazards Frailty Regression Models , 2004 .

[15]  Runze Li,et al.  Variable Selection in Semiparametric Regression Modeling. , 2008, Annals of statistics.

[16]  Local partial likelihood estimation in proportional hazards regression , 2007, 0708.2150.

[17]  James M. Robins,et al.  Estimation of the failure time distribution in the presence of informative censoring , 2002 .

[18]  D B Dunson,et al.  Factor Analytic Models of Clustered Multivariate Data with Informative Censoring , 2001, Biometrics.

[19]  Yingcun Xia,et al.  Dimension Reduction and Semiparametric Estimation of Survival Models , 2010 .

[20]  Donglin Zeng,et al.  Maximum likelihood estimation in semiparametric regression models with censored data , 2007, Statistica Sinica.

[21]  Jon A. Wellner,et al.  Information bounds for Cox regression models with missing data , 2004, math/0406452.

[22]  D.,et al.  Regression Models and Life-Tables , 2022 .

[23]  Irène Gijbels,et al.  Local likelihood and local partial likelihood in hazard regression , 1997 .

[24]  Jianqing Fan,et al.  Local polynomial modelling and its applications , 1994 .

[25]  Dag Tjøstheim,et al.  Adaptive varying-coefficient linear models for stochastic processes: asymptotic theory , 2007 .

[26]  Br Uce E. Ha,et al.  UNIFORM CONVERGENCE RATES FOR KERNEL ESTIMATION WITH DEPENDENT DATA , 2008 .

[27]  James A. Koziol,et al.  A Cramér-von Mises statistic for randomly censored data , 1976 .

[28]  James J. Heckman,et al.  The identifiability of the competing risks model , 1989 .

[29]  Jianqing Fan,et al.  Variable Selection for Cox's proportional Hazards Model and Frailty Model , 2002 .

[30]  R. Tibshirani,et al.  Generalized Additive Models , 1991 .

[31]  Torben Martinussen,et al.  Maximum Likelihood Estimation for Cox's Regression Model Under Case–Cohort Sampling , 2004 .

[32]  Arvid Lundervold,et al.  Exploring spatial nonlinearity using additive approximation , 2007 .

[33]  J. Cleland The Determinants of Reproductive Change in Bangladesh: Success in a Challenging Environment , 1994 .

[34]  Young K. Truong,et al.  Polynomial splines and their tensor products in extended linear modeling: 1994 Wald memorial lecture , 1997 .

[35]  W. Hoeffding A Class of Statistics with Asymptotically Normal Distribution , 1948 .

[36]  Jianqing Fan,et al.  Profile likelihood inferences on semiparametric varying-coefficient partially linear models , 2005 .

[37]  William A. Link,et al.  A Model for Informative Censoring , 1989 .

[38]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[39]  M. Yuan Semiparametric censorship model with covariates , 2005 .

[40]  Jiti Gao,et al.  Estimation in semiparametric spatial regression , 2003 .

[41]  S W Lagakos,et al.  General right censoring and its impact on the analysis of survival data. , 1979, Biometrics.