Survival analysis for recurrent event data: an application to childhood infectious diseases.

Many extensions of survival models based on the Cox proportional hazards approach have been proposed to handle clustered or multiple event data. Of particular note are five Cox-based models for recurrent event data: Andersen and Gill (AG); Wei, Lin and Weissfeld (WLW); Prentice, Williams and Peterson, total time (PWP-CP) and gap time (PWP-GT); and Lee, Wei and Amato (LWA). Some authors have compared these models by observing differences that arise from fitting the models to real and simulated data. However, no attempt has been made to systematically identify the components of the models that are appropriate for recurrent event data. We propose a systematic way of characterizing such Cox-based models using four key components: risk intervals; baseline hazard; risk set, and correlation adjustment. From the definitions of risk interval and risk set there are conceptually seven such Cox-based models that are permissible, five of which are those previously identified. The two new variant models are termed the 'total time - restricted' (TT-R) and 'gap time - unrestricted' (GT-UR) models. The aim of the paper is to determine which models are appropriate for recurrent event data using the key components. The models are fitted to simulated data sets and to a data set of childhood recurrent infectious diseases. The LWA model is not appropriate for recurrent event data because it allows a subject to be at risk several times for the same event. The WLW model overestimates treatment effect and is not recommended. We conclude that PWP-GT and TT-R are useful models for analysing recurrent event data, providing answers to slightly different research questions. Further, applying a robust variance to any of these models does not adequately account for within-subject correlation.

[1]  H. Goldstein Multilevel Statistical Models , 2006 .

[2]  T M Therneau,et al.  rhDNase as an example of recurrent event analysis. , 1997, Statistics in medicine.

[3]  X H Zhou,et al.  An empirical comparison of two semi-parametric approaches for the estimation of covariate effects from multivariate failure time data. , 1997, Statistics in medicine.

[4]  U Barai,et al.  Multiple statistics for multiple events, with application to repeated infections in the growth factor studies. , 1997, Statistics in medicine.

[5]  R. J. Spiegel,et al.  DISCUSSION OF PAPER BY WEI AND GLIDDEN , 1997 .

[6]  L J Wei,et al.  An overview of statistical methods for multiple failure time data in clinical trials. , 1997, Statistics in medicine.

[7]  D. Schoenfeld,et al.  Analysis of multiple failure time data from an AIDS clinical trial. , 1997, Statistics in medicine.

[8]  L. Moulton,et al.  Vitamin A supplementation fails to reduce incidence of acute respiratory illness and diarrhea in preschool-age Indonesian children. , 1996, The Journal of nutrition.

[9]  R Crouchley,et al.  A comparison of frailty models for multivariate survival data. , 1995, Statistics in medicine.

[10]  D. Lin,et al.  Cox regression analysis of multivariate failure time data: the marginal approach. , 1994, Statistics in medicine.

[11]  D. Clayton Some approaches to the analysis of recurrent event data , 1994, Statistical methods in medical research.

[12]  J P Klein,et al.  Semiparametric estimation of random effects using the Cox model based on the EM algorithm. , 1992, Biometrics.

[13]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .

[14]  A. V. Peterson,et al.  On the regression analysis of multivariate failure time data , 1981 .

[15]  Richard M. Royall,et al.  Variance Estimation in Finite Population Sampling , 1978 .

[16]  Tx Station Stata Statistical Software: Release 7. , 2001 .

[17]  Lee-Jen Wei,et al.  Cox-Type Regression Analysis for Large Numbers of Small Groups of Correlated Failure Time Observations , 1992 .

[18]  David Oakes,et al.  Frailty Models For Multiple Event Times , 1992 .

[19]  L. J. Wei,et al.  Regression analysis of multivariate incomplete failure time data by modeling marginal distributions , 1989 .

[20]  L. J. Wei,et al.  The Robust Inference for the Cox Proportional Hazards Model , 1989 .

[21]  H. White Maximum Likelihood Estimation of Misspecified Models , 1982 .

[22]  D. Cox Regression Models and Life-Tables , 1972 .