Penalized variable selection in competing risks regression

Penalized variable selection methods have been extensively studied for standard time-to-event data. Such methods cannot be directly applied when subjects are at risk of multiple mutually exclusive events, known as competing risks. The proportional subdistribution hazard (PSH) model proposed by Fine and Gray (J Am Stat Assoc 94:496–509, 1999) has become a popular semi-parametric model for time-to-event data with competing risks. It allows for direct assessment of covariate effects on the cumulative incidence function. In this paper, we propose a general penalized variable selection strategy that simultaneously handles variable selection and parameter estimation in the PSH model. We rigorously establish the asymptotic properties of the proposed penalized estimators and modify the coordinate descent algorithm for implementation. Simulation studies are conducted to demonstrate the good performance of the proposed method. Data from deceased donor kidney transplants from the United Network of Organ Sharing illustrate the utility of the proposed method.

[1]  E. Steyerberg,et al.  Prognosis Research Strategy (PROGRESS) 2: Prognostic Factor Research , 2013, PLoS medicine.

[2]  Ravi Varadhan,et al.  Model selection in competing risks regression , 2013, Statistics in medicine.

[3]  Jason Fine,et al.  Competing Risks Regression for Stratified Data , 2011, Biometrics.

[4]  Jian Huang,et al.  Group descent algorithms for nonconvex penalized linear and logistic regression models with grouped predictors , 2012, Statistics and Computing.

[5]  D Faraggi,et al.  Bayesian variable selection method for censored survival data. , 1998, Biometrics.

[6]  Cun-Hui Zhang Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[7]  Hee-Seok Oh,et al.  A new sparse variable selection via random-effect model , 2014, J. Multivar. Anal..

[8]  Xin Xin,et al.  Model determination and estimation for the growth curve model via group SCAD penalty , 2014, J. Multivar. Anal..

[9]  Runze Li,et al.  Tuning parameter selectors for the smoothly clipped absolute deviation method. , 2007, Biometrika.

[10]  Jian Huang,et al.  THE Mnet method for variable selection , 2016 .

[11]  Myriam Labopin,et al.  Competing risks regression for clustered data. , 2012, Biostatistics.

[12]  Ewout W. Steyerberg,et al.  Prognostic Models With Competing Risks: Methods and Application to Coronary Risk Prediction , 2009, Epidemiology.

[13]  Hongzhe Li,et al.  Group SCAD regression analysis for microarray time course gene expression data , 2007, Bioinform..

[14]  Chenlei Leng,et al.  Unified LASSO Estimation by Least Squares Approximation , 2007 .

[15]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[16]  Hao Helen Zhang,et al.  Adaptive Lasso for Cox's proportional hazards model , 2007 .

[17]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[18]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[19]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[20]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[21]  R. Tibshirani,et al.  PATHWISE COORDINATE OPTIMIZATION , 2007, 0708.1485.

[22]  Jian Huang,et al.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION. , 2011, The annals of applied statistics.

[23]  Hansheng Wang,et al.  Computational Statistics and Data Analysis a Note on Adaptive Group Lasso , 2022 .

[24]  Robert Gray,et al.  A Proportional Hazards Model for the Subdistribution of a Competing Risk , 1999 .

[25]  R. Tibshirani,et al.  Generalized additive models for medical research , 1986, Statistical methods in medical research.

[26]  H. Zou,et al.  Addendum: Regularization and variable selection via the elastic net , 2005 .

[27]  Runze Li,et al.  Statistical Challenges with High Dimensionality: Feature Selection in Knowledge Discovery , 2006, math/0602133.

[28]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[29]  Jian Huang,et al.  A Selective Review of Group Selection in High-Dimensional Models. , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[30]  Brent A. Johnson,et al.  Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models , 2008, Journal of the American Statistical Association.

[31]  Jianqing Fan,et al.  Variable Selection for Cox's proportional Hazards Model and Frailty Model , 2002 .

[32]  Jianqing Fan,et al.  A Selective Overview of Variable Selection in High Dimensional Feature Space. , 2009, Statistica Sinica.

[33]  Fengrong Wei,et al.  Group coordinate descent algorithms for nonconvex penalized regression , 2012, Comput. Stat. Data Anal..

[34]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[35]  Runze Li,et al.  Regularization Parameter Selections via Generalized Information Criterion , 2010, Journal of the American Statistical Association.

[36]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[37]  David I. Warton,et al.  Tuning Parameter Selection for the Adaptive Lasso Using ERIC , 2015 .

[38]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[39]  Youngjo Lee,et al.  Variable selection in subdistribution hazard frailty models with competing risks data , 2014, Statistics in medicine.

[40]  Douglas E Schaubel,et al.  A Comprehensive Risk Quantification Score for Deceased Donor Kidneys: The Kidney Donor Risk Index , 2009, Transplantation.

[41]  Harald Binder,et al.  Quantifying the predictive accuracy of time‐to‐event models in the presence of competing risks , 2011, Biometrical journal. Biometrische Zeitschrift.