A support vector machine based semiparametric mixture cure model

The mixture cure model is an extension of standard survival models to analyze survival data with a cured fraction. Many developments in recent years focus on the latency part of the model to allow more flexible modeling strategies for the distribution of uncured subjects, and fewer studies focus on the incidence part to model the probability of being uncured/cured. We propose a new mixture cure model that employs the support vector machine (SVM) to model the covariate effects in the incidence part of the cure model. The new model inherits the features of the SVM to provide a flexible model to assess the effects of covariates on the incidence. Unlike the existing nonparametric approaches for the incidence part, the SVM method also allows for potentially high-dimensional covariates in the incidence part. Semiparametric models are also allowed in the latency part of the proposed model. We develop an estimation method to estimate the cure model and conduct a simulation study to show that the proposed model outperforms existing cure models, particularly in incidence estimation. An illustrative example using data from leukemia patients is given.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Meng Mao,et al.  Semiparametric Efficient Estimation for a Class of Generalized Proportional Odds Cure Models , 2010, Journal of the American Statistical Association.

[3]  Sudipto Banerjee,et al.  Analysis of cure rate survival data under proportional odds model , 2011, Lifetime data analysis.

[4]  Jiajia Zhang,et al.  An alternative estimation method for the accelerated failure time frailty model , 2007, Comput. Stat. Data Anal..

[5]  John Platt,et al.  Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[6]  Thiago G. Ramires,et al.  Estimating nonlinear effects in the presence of cure fraction using a semi-parametric regression model , 2018, Comput. Stat..

[7]  K. Dear,et al.  A Nonparametric Mixture Model for Cure Rate Estimation , 2000, Biometrics.

[8]  J. P. Sy,et al.  Estimation in a Cox Proportional Hazards Cure Model , 2000, Biometrics.

[9]  Yingwei Peng,et al.  Nonparametric cure rate estimation with covariates , 2014 .

[10]  V. Farewell,et al.  The use of mixture models for the analysis of survival data with long-term survivors. , 1982, Biometrics.

[11]  Chin-Shang Li,et al.  A semi‐parametric accelerated failure time cure model , 2002, Statistics in medicine.

[12]  Christophe Mues,et al.  Mixture cure models in credit scoring: If and when borrowers default , 2012, Eur. J. Oper. Res..

[13]  Vernon T. Farewell,et al.  Mixture models in survival analysis: Are they worth the risk? , 1986 .

[14]  Prediction accuracy for the cure probabilities in mixture cure models , 2017, Statistical methods in medical research.

[15]  Pierre Joly,et al.  A SAS macro for parametric and semiparametric mixture cure models , 2007, Comput. Methods Programs Biomed..

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  J. Klein,et al.  Survival Analysis: Techniques for Censored and Truncated Data , 1997 .

[18]  Guosheng Yin,et al.  Cure Rate Quantile Regression for Censored Data With a Survival Fraction , 2013 .

[19]  N. Breslow Covariance analysis of censored survival data. , 1974, Biometrics.

[20]  J M Taylor,et al.  Semi-parametric estimation in failure time mixture models. , 1995, Biometrics.

[21]  Yingwei Peng,et al.  Accelerated hazards mixture cure model , 2009, Lifetime data analysis.

[22]  Anthony Y. C. Kuk,et al.  A mixture model combining logistic regression with proportional hazards regression , 1992 .

[23]  J W Denham,et al.  The follicular non-Hodgkin's lymphomas--I. The possibility of cure. , 1996, European journal of cancer.

[24]  John P. Klein,et al.  Treatment for acute myelocytic leukemia with allogeneic bone marrow transplantation following preparation with BuCy2. , 1991 .

[25]  M. Amalia Jácome,et al.  Nonparametric incidence estimation and bootstrap bandwidth selection in mixture cure models , 2017, Comput. Stat. Data Anal..

[26]  Chao Cai,et al.  smcure: An R-package for estimating semiparametric mixture cure models , 2012, Comput. Methods Programs Biomed..

[27]  Yingwei Peng,et al.  A new estimation method for the semiparametric accelerated failure time mixture cure model , 2007, Statistics in medicine.

[28]  J. Boag,et al.  Maximum Likelihood Estimates of the Proportion of Patients Cured by Cancer Therapy , 1949 .

[29]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[30]  Yingwei Peng,et al.  Fitting semiparametric cure models , 2003, Comput. Stat. Data Anal..