Modeling Time-Varying Effects With Large-Scale Survival Data: An Efficient Quasi-Newton Approach

ABSTRACT Nonproportional hazards models often arise in biomedical studies, as evidenced by a recent national kidney transplant study. During the follow-up, the effects of baseline risk factors, such as patients’ comorbidity conditions collected at transplantation, may vary over time. To model such dynamic changes of covariate effects, time-varying survival models have emerged as powerful tools. However, traditional methods of fitting time-varying effects survival model rely on an expansion of the original dataset in a repeated measurement format, which, even with a moderate sample size, leads to an extremely large working dataset. Consequently, the computational burden increases quickly as the sample size grows, and analyses of a large dataset such as our motivating example defy any existing statistical methods and software. We propose a novel application of quasi-Newton iteration method to model time-varying effects in survival analysis. We show that the algorithm converges superlinearly and is computationally efficient for large-scale datasets. We apply the proposed methods, via a stratified procedure, to analyze the national kidney transplant data and study the impact of potential risk factors on post-transplant survival. Supplementary materials for this article are available online.

[1]  A. Karr,et al.  Nonparametric Survival Analysis with Time-Dependent Covariate Effects: A Penalized Partial Likelihood Approach , 1990 .

[2]  Stanley R. Johnson,et al.  Varying Coefficient Models , 1984 .

[3]  B. Ripley,et al.  Semiparametric Regression: Preface , 2003 .

[4]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[5]  Wei Pan,et al.  A Note on the Use of Marginal Likelihood and Conditional Likelihood in Analyzing Clustered Data , 2002 .

[6]  J. J. Moré,et al.  Quasi-Newton Methods, Motivation and Theory , 1974 .

[7]  J. D. Pearson ON VARIABLE METRIC METHODS OF MINIMIZATION , 1968 .

[8]  C. G. Broyden The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations , 1970 .

[9]  Hans C. van Houwelingen,et al.  A fast routine for fitting Cox models with time varying effects of the covariates , 2006, Comput. Methods Programs Biomed..

[10]  Renée de Mutsert,et al.  Survival analysis: time-dependent effects and time-varying risk factors. , 2008, Kidney international.

[11]  Kamyar Kalantar-Zadeh,et al.  Causes and consequences of the reverse epidemiology of body mass index in dialysis patients. , 2005, Journal of renal nutrition : the official journal of the Council on Renal Nutrition of the National Kidney Foundation.

[12]  John D. Kalbfleisch,et al.  On Monitoring Outcomes of Medical Providers , 2013 .

[13]  P. Grambsch,et al.  Proportional hazards tests and diagnostics based on weighted residuals , 1994 .

[14]  Larry Nazareth,et al.  A family of variable metric updates , 1977, Math. Program..

[15]  H C van Houwelingen,et al.  Time-dependent effects of fixed covariates in Cox regression. , 1995, Biometrics.

[16]  Juliane Schäfer,et al.  Dynamic Cox modelling based on fractional polynomials: time‐variations in gastric cancer prognosis , 2003, Statistics in medicine.

[17]  M. Akritas Nonparametric Survival Analysis , 2004 .

[18]  R. Fletcher,et al.  A New Approach to Variable Metric Algorithms , 1970, Comput. J..

[19]  Phillip L. Gould,et al.  Time-Dependent Effects , 1994 .

[20]  R. Gray,et al.  Spline-based tests in survival analysis. , 1994, Biometrics.

[21]  D. Shanno Conditioning of Quasi-Newton Methods for Function Minimization , 1970 .

[22]  David A. Schoenfeld,et al.  Partial residuals for the proportional hazards regression model , 1982 .

[23]  Robert Gray,et al.  Flexible Methods for Analyzing Survival Data Using Splines, with Applications to Breast Cancer Prognosis , 1992 .

[24]  Zhiliang Ying,et al.  Semiparametric regression for the mean and rate functions of recurrent events , 2000 .

[25]  D. Goldfarb A family of variable-metric methods derived by variational means , 1970 .

[26]  D.,et al.  Regression Models and Life-Tables , 2022 .

[27]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[28]  Renée de Mutsert,et al.  Association between body mass index and mortality is similar in the hemodialysis population and the general population at high age and equal duration of follow-up. , 2007, Journal of the American Society of Nephrology : JASN.