Interval Censored Recursive Forests

We propose the interval censored recursive forests (ICRF) which is an iterative tree ensemble method for interval censored survival data. This nonparametric regression estimator makes the best use of censored information by iteratively updating the survival estimate, and can be viewed as a self-consistent estimator with convergence monitored using out-of-bag samples. Splitting rules optimized for interval censored data are developed and kernel-smoothing is applied. The ICRF displays the highest prediction accuracy among competing nonparametric methods in most of the simulations and in an applied example to avalanche data. An R package icrf is available for implementation.

[1]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[2]  Yaming Yu,et al.  Computing the log concave NPMLE for interval censored data , 2016, Stat. Comput..

[3]  M. LeBlanc,et al.  Relative risk trees for censored survival data. , 1992, Biometrics.

[4]  Stefan Wager,et al.  Estimation and Inference of Heterogeneous Treatment Effects using Random Forests , 2015, Journal of the American Statistical Association.

[5]  Robert L. Strawderman,et al.  Censoring Unbiased Regression Trees and Ensembles , 2018, Journal of the American Statistical Association.

[6]  Geurt Jongbloed,et al.  The Iterative Convex Minorant Algorithm for Nonparametric Estimation , 1998 .

[7]  J. Peto,et al.  Asymptotically Efficient Rank Invariant Test Procedures , 1972 .

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  S. Dudoit,et al.  Tree-based multivariate regression and density estimation with right-censored data , 2004 .

[10]  J. Wellner,et al.  Information Bounds and Nonparametric Maximum Likelihood Estimation , 1992 .

[11]  Nonparametric Tree-Structured Modeling for Interval-Censored Survival Data , 2002 .

[12]  Jeffrey S Simonoff,et al.  Survival trees for interval‐censored survival data , 2017, Statistics in medicine.

[13]  Torsten Hothorn,et al.  Bagging survival trees , 2002, Statistics in medicine.

[14]  Yan Zhou,et al.  Rationale and Applications of Survival Tree and Survival Ensemble Methods , 2015, Psychometrika.

[15]  Clifford Anderson-Bergman,et al.  icenReg: Regression Models for Interval Censored Data in R , 2017 .

[16]  Jon A. Wellner,et al.  A Hybrid Algorithm for Computation of the Nonparametric Maximum Likelihood Estimator from Censored Data , 1997 .

[17]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[18]  R. Olshen,et al.  Tree-structured survival analysis. , 1985, Cancer treatment reports.

[19]  P. Groeneboom,et al.  MAXIMUM SMOOTHED LIKELIHOOD ESTIMATION AND SMOOTHED MAXIMUM LIKELIHOOD ESTIMATION IN THE CURRENT STATUS MODEL , 2010, 1001.1829.

[20]  Jeffrey S. Simonoff,et al.  An ensemble method for interval-censored time-to-event data. , 2019, Biostatistics.

[21]  Michael R Kosorok,et al.  Recursively Imputed Survival Trees , 2012, Journal of the American Statistical Association.

[22]  M. LeBlanc,et al.  Survival Trees by Goodness of Split , 1993 .

[23]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[24]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[25]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[26]  D. Finkelstein,et al.  A proportional hazards model for interval-censored failure time data. , 1986, Biometrics.

[27]  B. Efron The two sample problem with censored data , 1967 .

[28]  Hermann Brugger,et al.  Comparison of avalanche survival patterns in Canada and Switzerland , 2011, Canadian Medical Association Journal.

[29]  Denis Larocque,et al.  A review of survival trees , 2011 .

[30]  R B Davis,et al.  Exponential survival trees. , 1989, Statistics in medicine.

[31]  P. Bühlmann,et al.  Survival ensembles. , 2006, Biostatistics.

[32]  Guadalupe Gómez,et al.  Interval censoring: Model characterizations for the validity of the simplified likelihood , 2004 .

[33]  Jian Huang,et al.  Interval Censored Survival Data: A Review of Recent Progress , 1997 .

[34]  A Ciampi,et al.  RECPAM: a computer program for recursive partition and amalgamation for survival data and other situations frequently occurring in biostatistics. III. Classification according to a multivariate construct. Application to data on Haemophilus influenzae type b meningitis. , 1991, Computer methods and programs in biomedicine.

[35]  Mark R. Segal,et al.  Regression Trees for Censored Data , 1988 .

[36]  F. Hsu,et al.  EVALUATING PERFORMANCE OF SURVIVAL REGRESSION MODELS WITH INTERVAL CENSORED DATA IN MOTORVEHICLE CRASH EXPERIMENTS , 2016 .

[37]  Susan Athey,et al.  Recursive partitioning for heterogeneous causal effects , 2015, Proceedings of the National Academy of Sciences.