Model building in nonproportional hazard regression

Recent developments of statistical methods allow for a very flexible modeling of covariates affecting survival times via the hazard rate, including also the inspection of possible time-dependent associations. Despite their immediate appeal in terms of flexibility, these models typically introduce additional difficulties when a subset of covariates and the corresponding modeling alternatives have to be chosen, that is, for building the most suitable model for given data. This is particularly true when potentially time-varying associations are given. We propose to conduct a piecewise exponential representation of the original survival data to link hazard regression with estimation schemes based on of the Poisson likelihood to make recent advances for model building in exponential family regression accessible also in the nonproportional hazard regression context. A two-stage stepwise selection approach, an approach based on doubly penalized likelihood, and a componentwise functional gradient descent approach are adapted to the piecewise exponential regression problem. These three techniques were compared via an intensive simulation study. An application to prognosis after discharge for patients who suffered a myocardial infarction supplements the simulation to demonstrate the pros and cons of the approaches in real data analyses.

[1]  L. Fahrmeir,et al.  A Mixed Model Approach for Geoadditive Hazard Regression , 2007 .

[2]  M. Abrahamowicz,et al.  Empirical study of the dependence of the results of multivariable flexible survival analyses on model selection strategy , 2008, Statistics in medicine.

[3]  L. Fahrmeir,et al.  Geoadditive Survival Models , 2006 .

[4]  Ralf Bender,et al.  Generating survival times to simulate Cox proportional hazards models , 2005, Statistics in medicine.

[5]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[6]  Peter Buhlmann,et al.  BOOSTING ALGORITHMS: REGULARIZATION, PREDICTION AND MODEL FITTING , 2007, 0804.2752.

[7]  R. Giugliano,et al.  Association of Hemoglobin Levels With Clinical Outcomes in Acute Coronary Syndromes , 2005, Circulation.

[8]  Gerhard Tutz,et al.  Variable Selection and Model Choice in Geoadditive Regression Models , 2009, Biometrics.

[9]  G. Stone,et al.  Impact of Leukocyte Count on Mortality and Bleeding in Patients With Myocardial Infarction Undergoing Primary Percutaneous Coronary Interventions: Analysis From the Harmonizing Outcome With Revascularization and Stent in Acute Myocardial Infarction Trial , 2011, Circulation.

[10]  G. Tutz,et al.  Generalized Additive Modeling with Implicit Variable Selection by Likelihood‐Based Boosting , 2006, Biometrics.

[11]  Nan M. Laird,et al.  Covariance Analysis of Censored Survival Data Using Log-Linear Analysis Techniques , 1981 .

[12]  P. Royston,et al.  Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials , 1999 .

[13]  Marco Valgimigli,et al.  Standardized Bleeding Definitions for Cardiovascular Clinical Trials: A Consensus Report From the Bleeding Academic Research Consortium , 2011, Circulation.

[14]  R Roberts,et al.  Thrombolysis in Myocardial Infarction (TIMI) Trial, Phase I: A comparison between intravenous tissue plasminogen activator and intravenous streptokinase. Clinical findings through hospital discharge. , 1987, Circulation.

[15]  S. Wood Thin plate regression splines , 2003 .

[16]  Giampiero Marra,et al.  Practical variable selection for generalized additive models , 2011, Comput. Stat. Data Anal..

[17]  Benjamin Hofner,et al.  Building Cox-type structured hazard regression models with time-varying effects , 2011 .

[18]  Paul W Dickman,et al.  Regression models for relative survival , 2004, Statistics in medicine.

[19]  Torsten Hothorn,et al.  Model-based Boosting 2.0 , 2010, J. Mach. Learn. Res..

[20]  P. Royston,et al.  Selection of important variables and determination of functional form for continuous predictors in multivariable model building , 2007, Statistics in medicine.

[21]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[22]  P. Royston,et al.  Regression using fractional polynomials of continuous covariates: parsimonious parametric modelling. , 1994 .

[23]  T. Holford The analysis of rates and of survivorship using log-linear models. , 1980, Biometrics.