Support Vector Hazards Machine: A Counting Process Framework for Learning Risk Scores for Censored Outcomes

Learning risk scores to predict dichotomous or continuous outcomes using machine learning approaches has been studied extensively. However, how to learn risk scores for time-to-event outcomes subject to right censoring has received little attention until recently. Existing approaches rely on inverse probability weighting or rank-based regression, which may be inefficient. In this paper, we develop a new support vector hazards machine (SVHM) approach to predict censored outcomes. Our method is based on predicting the counting process associated with the time-to-event outcomes among subjects at risk via a series of support vector machines. Introducing counting processes to represent time-to-event data leads to a connection between support vector machines in supervised learning and hazards regression in standard survival analysis. To account for different at risk populations at observed event times, a time-varying offset is used in estimating risk scores. The resulting optimization is a convex quadratic programming problem that can easily incorporate non-linearity using kernel trick. We demonstrate an interesting link from the profiled empirical risk function of SVHM to the Cox partial likelihood. We then formally show that SVHM is optimal in discriminating covariate-specific hazard function from population average hazard function, and establish the consistency and learning rate of the predicted risk using the estimated risk scores. Simulation studies show improved prediction accuracy of the event times using SVHM compared to existing machine learning methods and standard conventional approaches. Finally, we analyze two real world biomedical study data where we use clinical markers and neuroimaging biomarkers to predict age-at-onset of a disease, and demonstrate superiority of SVHM in distinguishing high risk versus low risk subjects.

[1]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[2]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[3]  S Van Huffel,et al.  Additive survival least‐squares support vector machines , 2010, Statistics in medicine.

[4]  Donglin Zeng,et al.  Efficient estimation of semiparametric transformation models for counting processes , 2006 .

[5]  J. Robins,et al.  Analysis of semiparametric regression models for repeated outcomes in the presence of missing data , 1995 .

[6]  Lionel Tarassenko,et al.  Non‐linear survival analysis using neural networks , 2004, Statistics in medicine.

[7]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[8]  Yi Lin Multicategory Support Vector Machines, Theory, and Application to the Classification of . . . , 2003 .

[9]  David R. Cox,et al.  Regression models and life tables (with discussion , 1972 .

[10]  Z. Ying,et al.  Analysis of transformation models with censored data , 1995 .

[11]  Manish S. Shah,et al.  A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes , 1993, Cell.

[12]  Anthony C. C. Coolen,et al.  Gaussian process regression for survival data with competing risks , 2013, 1312.1591.

[13]  Donglin Zeng,et al.  Targeted Local Support Vector Machine for Age-Dependent Classification , 2014, Journal of the American Statistical Association.

[14]  Shigeo Abe Support Vector Machines for Pattern Classification , 2010, Advances in Pattern Recognition.

[15]  H. Jeremy Bockholt,et al.  Clinical and Biomarker Changes in Premanifest Huntington Disease Show Trial Feasibility: A Decade of the PREDICT-HD Study , 2014, Front. Aging Neurosci..

[16]  Sabine Van Huffel,et al.  Support vector methods for survival analysis: a comparison between ranking and regression approaches , 2011, Artif. Intell. Medicine.

[17]  A. Singleton,et al.  The Parkinson Progression Marker Initiative (PPMI) , 2011, Progress in Neurobiology.

[18]  Ingo Steinwart,et al.  Support Vector Machines are Universally Consistent , 2002, J. Complex..

[19]  Michael R. Kosorok,et al.  Support Vector Regression for Right Censored Data , 2012, 1202.5130.

[20]  P. Gänssler Weak Convergence and Empirical Processes - A. W. van der Vaart; J. A. Wellner. , 1997 .

[21]  Javier M. Moguerza,et al.  Support Vector Machines with Applications , 2006, math/0612817.

[22]  Wei Pan,et al.  Large Margin Hierarchical Classification with Mutually Exclusive Class Membership , 2011, J. Mach. Learn. Res..

[23]  C. Jack,et al.  Ways toward an early diagnosis in Alzheimer’s disease: The Alzheimer’s Disease Neuroimaging Initiative (ADNI) , 2005, Alzheimer's & Dementia.

[24]  A. Mechelli,et al.  Using Support Vector Machine to identify imaging biomarkers of neurological and psychiatric disease: A critical review , 2012, Neuroscience & Biobehavioral Reviews.

[25]  Jane S. Paulsen,et al.  Indexing disease progression at study entry with individuals at‐risk for Huntington disease , 2011, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[26]  Li Li,et al.  Support Vector Machines , 2015 .

[27]  Kjell A. Doksum,et al.  Partial likelihood in transformation models with censored data , 1988 .

[28]  Ingo Steinwart,et al.  Fast rates for support vector machines using Gaussian kernels , 2007, 0708.1838.

[29]  Jane S. Paulsen,et al.  Detection of Huntington’s disease decades before diagnosis: the Predict-HD study , 2007, Journal of Neurology, Neurosurgery, and Psychiatry.

[30]  Brian D. Ripley,et al.  Clinical applications of artificial neural networks: Neural networks as statistical methods in survival analysis , 2001 .

[31]  S. Bennett,et al.  Analysis of survival data by the proportional odds model. , 1983, Statistics in medicine.

[32]  Jane S. Paulsen Cognitive Impairment in Huntington Disease: Diagnosis and Treatment , 2011, Current neurology and neuroscience reports.

[33]  Zhiliang Ying,et al.  Semiparametric analysis of transformation models with censored data , 2002 .

[34]  I. James,et al.  Linear regression with censored data , 1979 .

[35]  Klaus Nordhausen,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman , 2009 .

[36]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[37]  Michael I. Jordan,et al.  Convexity, Classification, and Risk Bounds , 2006 .

[38]  A. Folsom,et al.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. , 1989, American journal of epidemiology.

[39]  Faisal M. Khan,et al.  Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[40]  K. Do,et al.  Efficient and Adaptive Estimation for Semiparametric Models. , 1994 .

[41]  Denis Larocque,et al.  A review of survival trees , 2011 .

[42]  M. Woodward,et al.  Risk of Cardiovascular Disease from Cumulative Cigarette Use and the Impact of Smoking Intensity , 2016, Epidemiology.

[43]  Guodong Guo,et al.  Support Vector Machines Applications , 2014 .

[44]  Donglin Zeng,et al.  Maximum likelihood estimation in semiparametric regression models with censored data , 2007, Statistica Sinica.

[45]  Wei Chu,et al.  A Support Vector Approach to Censored Targets , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).