Improved performance on high-dimensional survival data by application of Survival-SVM

MOTIVATION New application areas of survival analysis as for example based on micro-array expression data call for novel tools able to handle high-dimensional data. While classical (semi-) parametric techniques as based on likelihood or partial likelihood functions are omnipresent in clinical studies, they are often inadequate for modelling in case when there are less observations than features in the data. Support vector machines (svms) and extensions are in general found particularly useful for such cases, both conceptually (non-parametric approach), computationally (boiling down to a convex program which can be solved efficiently), theoretically (for its intrinsic relation with learning theory) as well as empirically. This article discusses such an extension of svms which is tuned towards survival data. A particularly useful feature is that this method can incorporate such additional structure as additive models, positivity constraints of the parameters or regression constraints. RESULTS Besides discussion of the proposed methods, an empirical case study is conducted on both clinical as well as micro-array gene expression data in the context of cancer studies. Results are expressed based on the logrank statistic, concordance index and the hazard ratio. The reported performances indicate that the present method yields better models for high-dimensional data, while it gives results which are comparable to what classical techniques based on a proportional hazard model give for clinical data.

[1]  F. Harrell,et al.  Regression models in clinical studies: determining relationships between predictors and response. , 1988, Journal of the National Cancer Institute.

[2]  Laurence L. George,et al.  The Statistical Analysis of Failure Time Data , 2003, Technometrics.

[3]  Sabine Van Huffel,et al.  On the use of a clinical kernel in survival analysis , 2010, ESANN.

[4]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[5]  R. Prentice A LOG GAMMA MODEL AND ITS MAXIMUM LIKELIHOOD ESTIMATION , 1974 .

[6]  Jianhua Z. Huang,et al.  Functional ANOVA modeling for proportional hazards regression , 2000 .

[7]  R. Tibshirani,et al.  Prediction by Supervised Principal Components , 2006 .

[8]  D Faraggi,et al.  A neural network model for survival data. , 1995, Statistics in medicine.

[9]  Bert Kappen,et al.  Improving Cox survival analysis with a neural‐Bayesian approach , 2004, Statistics in medicine.

[10]  M J Symons,et al.  A comparison of the logistic risk function and the proportional hazards model in prospective epidemiologic studies. , 1983, Journal of chronic diseases.

[11]  Ludger Evers,et al.  Sparse kernel methods for high-dimensional survival data , 2008, Bioinform..

[12]  R. Tibshirani The lasso method for variable selection in the Cox model. , 1997, Statistics in medicine.

[13]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data: Kalbfleisch/The Statistical , 2002 .

[14]  J. Goeman L1 Penalized Estimation in the Cox Proportional Hazards Model , 2009, Biometrical journal. Biometrische Zeitschrift.

[15]  J. Kalbfleisch,et al.  The Statistical Analysis of Failure Time Data , 1980 .

[16]  Lu Tian,et al.  Linking gene expression data with patient survival times using partial least squares , 2002, ISMB.

[17]  Johan A. K. Suykens,et al.  Least squares support vector machines for classification and nonlinear modelling , 2000 .

[18]  Sabine Van Huffel,et al.  Support vector machines for survival analysis , 2007 .

[19]  Wei Chu,et al.  A Support Vector Approach to Censored Targets , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[20]  Byar Dp,et al.  The choice of treatment for cancer patients based on covariate information. , 1980 .

[21]  L. Staudt,et al.  The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. , 2002, The New England journal of medicine.

[22]  David R. Brillinger,et al.  Case studies in biometry , 1995 .

[23]  Arnoldo Frigessi,et al.  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm305 Gene expression Predicting survival from microarray data—a comparative study , 2022 .

[24]  Sabine Van Huffel,et al.  Survival SVM: a practical scalable algorithm , 2008, ESANN.

[25]  W. Sauerbrei,et al.  Randomized 2 x 2 trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients. German Breast Cancer Study Group. , 1994, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[26]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[27]  Sabine Van Huffel,et al.  Learning Transformation Models for Ranking and Survival Analysis , 2011, J. Mach. Learn. Res..

[28]  Danh V. Nguyen,et al.  Partial least squares proportional hazard regression for application to DNA microarray survival data , 2002, Bioinform..

[29]  R. Tibshirani,et al.  Repeated observation of breast tumor subtypes in independent gene expression data sets , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[30]  M LeBlanc,et al.  Adaptive Regression Splines in the Cox Model , 1999, Biometrics.

[31]  A. Atiya,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[32]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[33]  P. Royston,et al.  Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials , 1999 .

[34]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[35]  Ståle Nygård,et al.  Partial least squares Cox regression for genome-wide data , 2008, Lifetime data analysis.

[36]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[37]  Bart De Moor,et al.  Development of a kernel function for clinical data , 2009, 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[38]  D. Byar 46. Prognostic Variables for Survival in a Randomized Comparison of Treatments for Prostatic Cancer , 1985 .

[39]  D. F. Andrews,et al.  Prognostic Variables for Survival in a Randomized Comparison of Treatments for Prostatic Cancer , 1985 .

[40]  Paulo J. G. Lisboa,et al.  A Bayesian neural network approach for modelling censored data with an application to prognosis after surgery for breast cancer , 2003, Artif. Intell. Medicine.

[41]  E Biganzoli,et al.  Feed forward neural networks for the analysis of censored survival data: a partial logistic regression approach. , 1998, Statistics in medicine.

[42]  S Van Huffel,et al.  Additive survival least‐squares support vector machines , 2010, Statistics in medicine.

[43]  L. V. van't Veer,et al.  Cross‐validated Cox regression on microarray gene expression data , 2006, Statistics in medicine.

[44]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[45]  D. Hosmer,et al.  Empirical comparisons of proportional hazards, poisson, and logistic regression modeling of occupational cohort data. , 1998, American journal of industrial medicine.

[46]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[47]  L. Breiman Better subset regression using the nonnegative garrote , 1995 .

[48]  P. Novotny,et al.  Prospective evaluation of prognostic variables from patient-completed questionnaires. North Central Cancer Treatment Group. , 1994, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[49]  Sabine Van Huffel,et al.  Support vector methods for survival analysis: a comparison between ranking and regression approaches , 2011, Artif. Intell. Medicine.

[50]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.