Stratification Methodologies for Neural Networks Models of Survival

Clinical management often relies on stratification of patients by outcome. The application of flexible non-linear time-to-event models to stratification of patient populations into different and clinically meaningful risk groups is currently an important area of research. This paper proposes a definition of prognostic index for neural network models of survival. This index underpins different stratification strategies including k-means clustering, regression trees and recursive application of the log-rank test. It was obtained with multiple imputation applied to a neural network model of survival fitted to a substantial data set for breast cancer (n=931) and was evaluated with a large out of sample data set (n=4,083). It was found that the constraint imposed by regression trees on the form of the permitted rules makes it less specific than stratifying directly from the prognostic index and deriving unconstrained low-order rules with Orthogonal Search Rule Extraction.

[1]  Douglas G Altman,et al.  Developing a prognostic model in the presence of missing data: an ovarian cancer case study. , 2003, Journal of clinical epidemiology.

[2]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models: Missing-data imputation , 2006 .

[3]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[4]  Paulo J. G. Lisboa,et al.  A Bayesian neural network approach for modelling censored data with an application to prognosis after surgery for breast cancer , 2003, Artif. Intell. Medicine.

[5]  Elia Biganzoli,et al.  Estimates of clinically useful measures in competing risks survival analysis , 2008, Statistics in medicine.

[6]  Paulo J. G. Lisboa,et al.  Stratification of Severity of Illness Indices: A Case Study for Breast Cancer Prognosis , 2008, KES.

[7]  Paulo J. G. Lisboa,et al.  Orthogonal search-based rule extraction (OSRE) for trained neural networks: a practical and efficient approach , 2006, IEEE Transactions on Neural Networks.

[8]  Paulo J. G. Lisboa,et al.  Missing Data Imputation in Longitudinal Cohort Studies: Application of PLANN-ARD in Breast Cancer Survival , 2008, 2008 Seventh International Conference on Machine Learning and Applications.

[9]  H. Boshuizen,et al.  Multiple imputation of missing blood pressure covariates in survival analysis. , 1999, Statistics in medicine.

[10]  Karen A Gelmon,et al.  Population-based validation of the prognostic model ADJUVANT! for early breast cancer. , 2005, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[11]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[12]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[13]  Paulo J. G. Lisboa,et al.  A review of evidence of health benefit from artificial neural networks in medical intervention , 2002, Neural Networks.