Bi-level variable selection in semiparametric transformation models with right-censored data

In this article, we investigate bi-level variable selection approaches in semiparametric transformation models when a grouping structure of covariates is available. This large class of transformation models includes the Cox proportional hazards model and proportional odds model as special cases. For this class of models, there are only a few works on variable selection and all the selection methods are at individual variable level. To fill the gap of variable selection at both group and individual levels, we propose a penalized nonparametric maximum likelihood estimation method with three different penalties, i.e., group bridge (GB), adaptive group bridge (AGB) and composite group bridge (CGB), and develop their respective computational algorithms. Further, we prove that the resulting estimators from AGB and CGB have desirable oracle properties. Our simulation studies demonstrate that all the three penalties work well in bi-level variable selection, while AGB and CGB outperform GB when within-group sparsity is present. The proposed methods are applied to two real datasets for illustration.

[1]  Bin Nan,et al.  Hierarchically penalized Cox regression with grouped variables , 2009 .

[2]  Jian Huang,et al.  A Selective Review of Group Selection in High-Dimensional Models. , 2012, Statistical science : a review journal of the Institute of Mathematical Statistics.

[3]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[4]  Jianqing Fan,et al.  Variable Selection for Cox's proportional Hazards Model and Frailty Model , 2002 .

[5]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[6]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[7]  Donglin Zeng,et al.  Efficient estimation of semiparametric transformation models for counting processes , 2006 .

[8]  D. Todem,et al.  Adaptive lasso for the Cox regression with interval censored and possibly left truncated data , 2019, Statistical methods in medical research.

[9]  Hao Helen Zhang,et al.  Adaptive Lasso for Cox's proportional hazards model , 2007 .

[10]  Chunxiang Wang Variable selection through adaptive elastic net for proportional odds model , 2016 .

[11]  Hongzhe Li,et al.  Group SCAD regression analysis for microarray time course gene expression data , 2007, Bioinform..

[12]  Donglin Zeng,et al.  Checking semiparametric transformation models with censored data. , 2012, Biostatistics.

[13]  Cun-Hui Zhang,et al.  A group bridge approach for variable selection , 2009, Biometrika.

[14]  Wenjiang J. Fu Penalized Regressions: The Bridge versus the Lasso , 1998 .

[15]  Wenbin Lu,et al.  Variable selection for proportional odds model , 2006 .

[16]  D. Zeng,et al.  Variable selection in semiparametric transformation models for right-censored data , 2013 .

[17]  Jian Huang,et al.  Group selection in the cox model with a diverging number of covariates , 2014 .

[18]  Jianguo Sun,et al.  Penalized estimation of semiparametric transformation models with interval-censored data and application to Alzheimer’s disease , 2019, Statistical methods in medical research.

[19]  Donglin Zeng,et al.  Maximum likelihood estimation in semiparametric regression models with censored data , 2007, Statistica Sinica.

[20]  Yufeng Liu,et al.  Linear or Nonlinear? Automatic Structure Discovery for Partially Linear Models , 2011, Journal of the American Statistical Association.

[21]  R. Gill,et al.  Cox's regression model for counting processes: a large sample study : (preprint) , 1982 .

[22]  Jianbo Li,et al.  Adaptive LASSO for general transformation models with right censored data , 2012, Comput. Stat. Data Anal..

[23]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[24]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[25]  Jian Huang,et al.  Penalized methods for bi-level variable selection. , 2009, Statistics and its interface.

[26]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[27]  Indu Seetharaman,et al.  Consistent bi-level variable selection via composite group bridge penalized regression , 2013 .

[28]  D. Zeng,et al.  Semiparametric Transformation Models With Random Effects for Recurrent Events , 2007 .

[29]  Patrick Breheny,et al.  The group exponential lasso for bi‐level variable selection , 2015, Biometrics.