Deep Lifetime Clustering

The goal of lifetime clustering is to develop an inductive model that maps subjects into $K$ clusters according to their underlying (unobserved) lifetime distribution. We introduce a neural-network based lifetime clustering model that can find cluster assignments by directly maximizing the divergence between the empirical lifetime distributions of the clusters. Accordingly, we define a novel clustering loss function over the lifetime distributions (of entire clusters) based on a tight upper bound of the two-sample Kuiper test p-value. The resultant model is robust to the modeling issues associated with the unobservability of termination signals, and does not assume proportional hazards. Our results in real and synthetic datasets show significantly better lifetime clusters (as evaluated by C-index, Brier Score, Logrank score and adjusted Rand index) as compared to competing approaches.

[1]  F. Harrell,et al.  Evaluating the yield of medical tests. , 1982, JAMA.

[2]  Udaya B. Kogalur,et al.  High-Dimensional Variable Selection for Survival Data , 2010 .

[3]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[4]  Lawrence Carin,et al.  Adversarial Time-to-Event Modeling , 2018, ICML.

[5]  Philip S. Yu,et al.  On using partial supervision for text categorization , 2004, IEEE Transactions on Knowledge and Data Engineering.

[6]  Richard S. Zemel,et al.  Generative Moment Matching Networks , 2015, ICML.

[7]  R. Tibshirani,et al.  Semi-Supervised Methods to Predict Patient Survival from Gene Expression Data , 2004, PLoS biology.

[8]  Yanglan Gan,et al.  Identification of cancer subtypes from single-cell RNA-seq data using a consensus clustering method , 2018, BMC Medical Genomics.

[9]  Vincenzo Lagani,et al.  Structure-based variable selection for survival data , 2010, Bioinform..

[10]  Robert Tibshirani,et al.  Survival analysis with high-dimensional covariates , 2010, Statistical methods in medical research.

[11]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[12]  N. Mantel Evaluation of survival data and two new rank order statistics arising in its consideration. , 1966, Cancer chemotherapy reports.

[13]  Eric Bair,et al.  Identification of biologically relevant subtypes via preweighted sparse clustering , 2013, ArXiv.

[14]  Bruno Ribeiro,et al.  Modeling Website Popularity Competition in the Attention-Activity Marketplace , 2014, WSDM.

[15]  D.,et al.  Regression Models and Life-Tables , 2022 .

[16]  Uri Shaham,et al.  DeepSurv: personalized treatment recommender system using a Cox proportional hazards deep neural network , 2016, BMC Medical Research Methodology.

[17]  P. Hougaard,et al.  Frailty models for survival data , 1995, Lifetime data analysis.

[18]  Lei Zheng,et al.  Deep Recurrent Survival Analysis , 2018, AAAI.

[19]  R. Wolfe,et al.  A Frailty Model for Informative Censoring , 2002, Biometrics.

[20]  Yoshua Bengio,et al.  Deep Learning for Patient-Specific Kidney Graft Survival Analysis , 2017, ArXiv.

[21]  Ahmed M. Alaa,et al.  Deep Multi-task Gaussian Processes for Survival Analysis with Competing Risks , 2017, NIPS.

[22]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[23]  Mark Tygert,et al.  Statistical tests for whether a given set of independent, identically distributed draws comes from a specified probability density , 2010, Proceedings of the National Academy of Sciences.

[24]  L. Hubert,et al.  Comparing partitions , 1985 .

[25]  Wei Chu,et al.  A Support Vector Approach to Censored Targets , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[26]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[27]  R. Tibshirani,et al.  Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia. , 2004, The New England journal of medicine.

[28]  T. Dodson,et al.  Frailty Approach for the Analysis of Clustered Failure Time Observations in Dental Research , 2005, Journal of dental research.

[29]  Adler J. Perotte,et al.  Deep Survival Analysis , 2016, MLHC.

[30]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[31]  Robert Tibshirani,et al.  A Framework for Feature Selection in Clustering , 2010, Journal of the American Statistical Association.

[32]  N. Kuiper Tests concerning random points on a circle , 1960 .

[33]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[34]  Douglas G Altman,et al.  The logrank test , 2004, BMJ : British Medical Journal.

[35]  Léon Bottou,et al.  Wasserstein Generative Adversarial Networks , 2017, ICML.

[36]  Arindam Banerjee,et al.  Semi-supervised Clustering by Seeding , 2002, ICML.

[37]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Charu C. Aggarwal,et al.  When will it happen?: relationship prediction in heterogeneous information networks , 2012, WSDM '12.

[39]  Zaïd Harchaoui,et al.  A Fast, Consistent Kernel Two-Sample Test , 2009, NIPS.

[40]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[41]  Bernhard Schölkopf,et al.  A Kernel Two-Sample Test , 2012, J. Mach. Learn. Res..

[42]  R. Tibshirani,et al.  Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[43]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[44]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[45]  J. Peto,et al.  Asymptotically Efficient Rank Invariant Test Procedures , 1972 .

[46]  Yan Liu,et al.  Benchmark of Deep Learning Models on Large Healthcare MIMIC Datasets , 2017, ArXiv.

[47]  Bernhard Schölkopf,et al.  Kernel Measures of Conditional Dependence , 2007, NIPS.

[48]  Sebastian Thrun,et al.  Learning to Classify Text from Labeled and Unlabeled Documents , 1998, AAAI/IAAI.

[49]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[50]  E. Kaplan,et al.  Nonparametric Estimation from Incomplete Observations , 1958 .

[51]  Changhee Lee,et al.  DeepHit: A Deep Learning Approach to Survival Analysis With Competing Risks , 2018, AAAI.

[52]  Huilin Chen,et al.  Statistical Inference Methods for Two Crossing Survival Curves: A Comparison of Methods , 2015, PloS one.