A flexible, computationally efficient method for fitting the proportional hazards model to interval‐censored data

The proportional hazards model (PH) is currently the most popular regression model for analyzing time-to-event data. Despite its popularity, the analysis of interval-censored data under the PH model can be challenging using many available techniques. This article presents a new method for analyzing interval-censored data under the PH model. The proposed approach uses a monotone spline representation to approximate the unknown nondecreasing cumulative baseline hazard function. Formulating the PH model in this fashion results in a finite number of parameters to estimate while maintaining substantial modeling flexibility. A novel expectation-maximization (EM) algorithm is developed for finding the maximum likelihood estimates of the parameters. The derivation of the EM algorithm relies on a two-stage data augmentation involving latent Poisson random variables. The resulting algorithm is easy to implement, robust to initialization, enjoys quick convergence, and provides closed-form variance estimates. The performance of the proposed regression methodology is evaluated through a simulation study, and is further illustrated using data from a large population-based randomized trial designed and sponsored by the United States National Cancer Institute.

[1]  G. Satten Rank-based inference in the proportional hazards model for interval censored data , 1996 .

[2]  J. B. Rosen The Gradient Projection Method for Nonlinear Programming. Part I. Linear Constraints , 1960 .

[3]  Xiaoyan Lin,et al.  A Bayesian approach for analyzing case 2 interval-censored data under the semiparametric proportional odds model , 2011 .

[4]  Jianguo Sun,et al.  Interval Censoring , 2003 .

[5]  David B Dunson,et al.  Semiparametric bayes' proportional odds models for current status data with underreporting. , 2011, Biometrics.

[6]  Shuangge Ma,et al.  Survival Analysis in Medicine and Genetics , 2013 .

[7]  J. Ramsay Monotone Regression Splines in Action , 1988 .

[8]  Rebecca A Betensky,et al.  A local likelihood proportional hazards model for interval censored data , 2002, Statistics in medicine.

[9]  J. Gohagan,et al.  Prostate cancer screening in the randomized Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial: mortality results after 13 years of follow-up. , 2012, Journal of the National Cancer Institute.

[10]  Ying Zhang,et al.  On Algorithms for the Nonparametric Maximum Likelihood Estimator of the Failure Function With Censored Data , 2004 .

[11]  Yu Shen,et al.  A Semiparametric Regression Cure Model for Interval-Censored Data , 2009, Journal of the American Statistical Association.

[12]  B. Turnbull The Empirical Distribution Function with Arbitrarily Grouped, Censored, and Truncated Data , 1976 .

[13]  J. B. Rosen The gradient projection method for nonlinear programming: Part II , 1961 .

[14]  A M Zaslavsky,et al.  A Markov chain Monte Carlo EM algorithm for analyzing interval-censored data under the Cox proportional hazards model. , 1998, Biometrics.

[15]  Joshua M Tebbs,et al.  Regression analysis for current status data using the EM algorithm , 2013, Statistics in medicine.

[16]  Xiaoyan Lin,et al.  A semiparametric probit model for case 2 interval‐censored failure time data , 2010, Statistics in medicine.

[17]  Wei Pan,et al.  Extending the Iterative Convex Minorant Algorithm to the Cox Model for Interval Censored Data , 2011 .

[18]  L Ryan,et al.  Semiparametric Regression Analysis of Interval‐Censored Data , 2000, Biometrics.

[19]  D. Finkelstein,et al.  A proportional hazards model for interval-censored failure time data. , 1986, Biometrics.

[20]  Jian Huang,et al.  A Spline‐Based Semiparametric Maximum Likelihood Estimation Method for the Cox Model with Interval‐Censored Data , 2010 .

[21]  Jialiang Li,et al.  Semiparametric varying-coefficient model for interval censored data with a cured proportion. , 2014, Statistics in medicine.

[22]  Ulrich Mansmann,et al.  Iterated Convex Minorant Algorithm for interval censored eventdata , 2013 .

[23]  D.,et al.  Regression Models and Life-Tables , 2022 .

[24]  G. Rücker,et al.  Remission duration: an example of interval-censored observations. , 1988, Statistics in medicine.

[25]  Klaus Langohr,et al.  Tutorial on methods for interval-censored data and their implementation in R , 2009 .

[26]  T. Cai,et al.  Hazard Regression for Interval‐Censored Data with Penalized Spline , 2003, Biometrics.

[27]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[28]  P. Rosenberg,et al.  Hazard function estimation using B-splines. , 1995, Biometrics.

[29]  W Pan,et al.  A Multiple Imputation Approach to Cox Regression with Interval‐Censored Data , 2000, Biometrics.

[30]  Paul D. Allison,et al.  Survival analysis using sas®: a practical guide , 1995 .

[31]  R B D'Agostino,et al.  Maximum likelihood estimation for interval-censored data using a Weibull-based accelerated failure time model. , 1992, Biometrics.

[32]  Xiaoyan Lin,et al.  Bayesian proportional hazards model for current status data with monotone splines , 2011, Comput. Stat. Data Anal..