Martingale-based residuals for survival models

SUMMARY Graphical methods based on the analysis of residuals are considered for the setting of the highly-used Cox (1972) regression model and for the Andersen-Gill (1982) generalization of that model. We start with a class of martingale-based residuals as proposed by Barlow & Prentice (1988). These residuals and/or their transforms are useful for investigating the functional form of a covariate, the proportional hazards assumption, the leverage of each subject upon the estimates of 13, and the lack of model fit to a given subject. 1 1. Model Consider a set of n independent subjects such that the counting process Ni {Ni(t), t } O} for the ith subject in the set indicates the number of observed events experienced over time t. The sample paths of the Ni are step functions with jumps of size +1 and with Ni(0) =0. We assume that the intensity function for Ni(t) is given by Yi(t)dA{t, Zi(t)} = Yi(t) eP'Z(t) dAO(t), (1) where Yi(t) is a 0-1 process indicating whether the ith subject is a risk at time t, 13 is a vector of regression coefficients, Zi(t) is a p dimensional vector of covariate processes, and Ao is the baseline cumulative hazard function. Several familar survival models fit into this framework. The Andersen & Gill (1982) generalization of the Cox (1972) model arises when AO(t) is completely unspecified. The further restriction that Yi(t) = 1 until the first event or censoring, and is 0 thereafter yields the Cox model. The parametric form AO(t) = t yields a Poisson model, or an exponential if restricted to a single event per subject, and AO(t) = tP a Weibull model. Our attention will focus primarily on the Andersen-Gill and Cox models; however, the

[1]  D. Cox,et al.  A General Definition of Residuals , 1968 .

[2]  Wayne Nelson,et al.  Hazard Plotting for Incomplete Failure Data , 1969 .

[3]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[4]  N. Breslow Covariance analysis of censored survival data. , 1974, Biometrics.

[5]  James A. Koziol,et al.  Percentage Points of the Asymptotic Distributions of One and Two Sample K-S Statistics for Truncated or Censored Data , 1975 .

[6]  J. Crowley,et al.  Covariance Analysis of Heart Transplant Survival Data , 1977 .

[7]  R. Kay,et al.  Proportional Hazard Regression Models and the Analysis of Censored Survival Data , 1977 .

[8]  R. Gill Censoring and stochastic integrals , 1980 .

[9]  David A. Schoenfeld,et al.  Chi-squared goodness-of-fit tests for the proportional hazards regression model , 1980 .

[10]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .

[11]  Richard D. Gill,et al.  Understanding Cox's regression model: a martingale approach : (preprint) , 1984 .

[12]  David A. Schoenfeld,et al.  Partial residuals for the proportional hazards regression model , 1982 .

[13]  Per Kragh Andersen,et al.  Testing Goodness of Fit of Cox's Regression and Life Model , 1982 .

[14]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[15]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[16]  F. J. Aranda-Ordaz An extension of the proportional-hazards model for grouped data. , 1983, Biometrics.

[17]  N. Lange,et al.  Approximate case influence for the proportional hazards regression model with censored data. , 1984, Biometrics.

[18]  L. J. Wei,et al.  Testing Goodness of Fit for Proportional Hazards Model with Censored Observations , 1984 .

[19]  J. Crowley,et al.  A Diagnostic for Cox Regression and General Conditional Likelihoods , 1985 .

[20]  T. Therneau,et al.  Stage D1 prostatic adenocarcinoma: significance of nuclear DNA ploidy patterns studied by flow cytometry. , 1988, Mayo Clinic proceedings.

[21]  W. Barlow,et al.  Residuals for relative risk regression , 1988 .

[22]  Stage D1 prostatic adenocarcinoma: significance of nuclear DNA ploidy patterns studied by flow cytometry. , 1988 .

[23]  P. Grambsch,et al.  Prognosis in primary biliary cirrhosis: Model for decision making , 1989, Hepatology.