Goftte: A R package for assessing goodness-of-fit in proportional (sub) distributions hazards regression models

BACKGROUND AND OBJECTIVE In this paper, we introduce a new R package goftte for goodness-of-fit assessment based on cumulative sums of model residuals useful for checking key assumptions in the Cox regression and Fine and Gray regression models. METHODS Monte-Carlo methods are used to approximate the null distribution of cumulative sums of model residuals. To limit the computational burden, the main routines used to approximate the null distributions are implemented in a parallel C++ programming environment. Numerical studies are carried out to evaluate the empirical type I error rates of the different testing procedures. The package and the documentation are available to users from CRAN R repositories. RESULTS Results from simulation studies suggested that all statistical tests implemented in goftte yielded excellent control of the type I error rate even with modest sample sizes with high censoring rates. CONCLUSIONS As compared to other R packages goftte provides new useful method for testing functionals, such as Anderson-Darling type test statistics for checking assumptions about proportional (sub-) distribution hazards. Approximations for the null distributions of test statistics have been validated through simulation experiments. Future releases will provide similar tools for checking model assumptions in multiplicative intensity models for recurrent data. The package may help to spread the use of recent advocated goodness-of-fit techniques in semiparametric regression for time-to-event data.

[1]  John D. Kalbfleisch,et al.  Misspecified proportional hazard models , 1986 .

[2]  Wenbin Lu,et al.  A Monte Carlo approach for change‐point detection in the Cox proportional hazards model , 2008, Statistics in medicine.

[3]  Z. Ying,et al.  Checking the Cox model with cumulative sums of martingale-based residuals , 1993 .

[4]  R. Tremblay,et al.  Childhood Hyperactivity, Physical Aggression and Criminality: A 19-Year Prospective Population-Based Study , 2013, PloS one.

[5]  Thomas A. Gerds,et al.  On functional misspecification of covariates in the Cox regression model , 2001 .

[6]  L. J. Wei,et al.  The Robust Inference for the Cox Proportional Hazards Model , 1989 .

[7]  P. Grambsch,et al.  Prognosis in primary biliary cirrhosis: Model for decision making , 1989, Hepatology.

[8]  Chih-Ling Tsai,et al.  Functional Form Diagnostics for Cox's Proportional Hazards Model , 2004, Biometrics.

[9]  Tests for the Proportional Intensity Assumption Based on the Score Process , 2004, Lifetime data analysis.

[10]  Martin Schumacher,et al.  Proportional subdistribution hazards modeling offers a summary analysis, even if misspecified , 2010, Statistics in medicine.

[11]  Robert Gray,et al.  A Proportional Hazards Model for the Subdistribution of a Competing Risk , 1999 .

[12]  Xiao-Sheng Si,et al.  An Adaptive Prognostic Approach via Nonlinear Degradation Modeling: Application to Battery Data , 2015, IEEE Transactions on Industrial Electronics.

[13]  D. Cox Regression Models and Life-Tables , 1972 .

[14]  D. Weatherburn Effect of prison on adult re-offending, The , 2010 .

[15]  S. W. Lagakos The loss in efficiency from misspecifying covariates in proportional hazards regression models , 1988 .

[16]  Ming-Chang Lee,et al.  Business Bankruptcy Prediction Based on Survival Analysis Approach , 2014 .

[17]  S W Lagakos,et al.  Properties of proportional-hazards score tests under misspecified regression models. , 1984, Biometrics.

[18]  Mei-Jie Zhang,et al.  Checking Fine and Gray subdistribution hazards model with cumulative sums of residuals , 2014, Lifetime Data Analysis.

[19]  Suresh K. Choubey,et al.  Failure event prediction using the Cox proportional hazard model driven by frequent failure signatures , 2007 .

[20]  S Chevret,et al.  Misspecified regression model for the subdistribution hazard of a competing risk , 2007, Statistics in medicine.

[21]  R. Cook,et al.  Goodness‐of‐fit test for monotone proportional subdistribution hazards assumptions based on weighted residuals , 2017, Statistics in medicine.

[22]  Yaguo Lei,et al.  Degradation data analysis and remaining useful life estimation: A review on Wiener-process-based methods , 2018, Eur. J. Oper. Res..