hdnom: Building Nomograms for Penalized Cox Models with High-Dimensional Survival Data

Summary We developed hdnom, an R package for survival modeling with high-dimensional data. The package is the first free and open-source software package that streamlines the workflow of penalized Cox model building, validation, calibration, comparison, and nomogram visualization, with nine types of penalized Cox regression methods fully supported. A web application and an online prediction tool maker are offered to enhance interac-tivity and flexibility in high-dimensional survival analysis. Availability The hdnom R package is available from CRAN: https://cran.r-project.org/package=hdnom under GPL. The hdnom web application can be accessed at http://hdnom.io. The web application maker is available from http://hdnom.org/appmaker. The hdnom project website: http://hdnom.org. Contact qsxu@csu.edu.cn or miaozhu.li@duke.edu

[1]  Gustavo Stolovitzky,et al.  The Prostate Cancer DREAM Challenge: A Community-Wide Effort to Use Open Clinical Trial Data for the Quantitative Prediction of Outcomes in Metastatic Prostate Cancer. , 2015, The oncologist.

[2]  Hongzhe Li,et al.  VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS. , 2010, The annals of applied statistics.

[3]  Axel Benner,et al.  High‐Dimensional Cox Models: The Choice of Penalty as Part of the Model Building Process , 2010, Biometrical journal. Biometrische Zeitschrift.

[4]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[5]  Xiao Song,et al.  A semiparametric approach for the covariate specific ROC curve with survival outcome , 2008 .

[6]  Hans A. Kestler,et al.  On the validity of time-dependent AUC estimators , 2015, Briefings Bioinform..

[7]  Qing-Song Xu,et al.  Multi-step adaptive elastic-net: reducing false positives in high-dimensional variable selection , 2015 .

[8]  Arthur E. Hoerl,et al.  Ridge Regression: Biased Estimation for Nonorthogonal Problems , 2000, Technometrics.

[9]  R. Tibshirani,et al.  The solution path of the generalized lasso , 2010, 1005.1971.

[10]  Cun-Hui Zhang Nearly unbiased variable selection under minimax concave penalty , 2010, 1002.4734.

[11]  Lingmin Zeng,et al.  Group variable selection via SCAD-L2 , 2014 .

[12]  M. Stampfer,et al.  Mounting evidence for prediagnostic use of statins in reducing risk of lethal prostate cancer. , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[13]  Jian Huang,et al.  COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION. , 2011, The annals of applied statistics.

[14]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[15]  J. Shao Linear Model Selection by Cross-validation , 1993 .

[16]  Hao Helen Zhang,et al.  ON THE ADAPTIVE ELASTIC-NET WITH A DIVERGING NUMBER OF PARAMETERS. , 2009, Annals of statistics.

[17]  Qing-Song Xu,et al.  Grouping Variable Selection by Weight Fused Elastic Net for Multi-Collinear Data , 2012, Commun. Stat. Simul. Comput..

[18]  Guoqing Diao,et al.  Estimation of time‐dependent area under the ROC curve for long‐term risk prediction , 2006, Statistics in medicine.

[19]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[20]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[21]  Peter Bühlmann,et al.  High-Dimensional Statistics with a View Toward Applications in Biology , 2014 .

[22]  M. Morris,et al.  Updated prognostic model for predicting overall survival in first-line chemotherapy for patients with metastatic castration-resistant prostate cancer. , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[23]  Sunil J Rao,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2003 .

[24]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[25]  R. Tibshirani,et al.  Spatial smoothing and hot spot detection for CGH data using the fused lasso. , 2008, Biostatistics.

[26]  Wei Pan,et al.  Predictor Network in Penalized Regression with Application to Microarray Data” , 2009 .

[27]  Tianxi Cai,et al.  Evaluating Prediction Rules for t-Year Survivors With Censored Regression Models , 2007 .

[28]  P. Royston,et al.  External validation of a Cox prognostic model: principles and methods , 2013, BMC Medical Research Methodology.

[29]  Trevor Hastie,et al.  Statistical Learning with Sparsity: The Lasso and Generalizations , 2015 .

[30]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[31]  Bin Nan,et al.  Hierarchically penalized Cox regression with grouped variables , 2009 .

[32]  Jian Lu,et al.  Prognostic and predictive value of a microRNA signature in stage II colon cancer: a microRNA expression analysis. , 2013, The Lancet. Oncology.

[33]  Yi-Zeng Liang,et al.  Monte Carlo cross validation , 2001 .

[34]  Jiang Gui,et al.  Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data , 2005, Bioinform..

[35]  G. Raj,et al.  How to build and interpret a nomogram for cancer prognosis. , 2008, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[36]  Richard M. Simon,et al.  Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data , 2011, Briefings Bioinform..

[37]  Jian Huang,et al.  THE Mnet method for variable selection , 2016 .

[38]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[39]  Trevor Hastie,et al.  Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent. , 2011, Journal of statistical software.

[40]  Frank E. Harrell,et al.  Regression Modeling Strategies: With Applications to Linear Models, Logistic Regression, and Survival Analysis , 2001 .

[41]  Mithat Gonen,et al.  Building a Nomogram for Survey-Weighted Cox Models Using R , 2015 .

[42]  Edi Brogi,et al.  Nomogram for predicting the risk of local recurrence after breast-conserving surgery for ductal carcinoma in situ. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[43]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .

[44]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[45]  Hao Helen Zhang,et al.  Adaptive Lasso for Cox's proportional hazards model , 2007 .