Different Methodologies for Patient Stratification Using Survival Data

Clinical characterization of breast cancer patients related to their risk and profiles is an important part for making their correct prognostic assessments. This paper first proposes a prognostic index obtained when it is applied a flexible non-linear time-to-event model and compares it to a widely used linear survival estimator. This index underpins different stratification methodologies including informed clustering utilising the principle of learning metrics, regression trees and recursive application of the log-rank test. Missing data issue was overcome using multiple imputation, which was applied to a neural network model of survival fitted to a data set for breast cancer (n=743). It was found the three methodologies broadly agree, having however important differences.

[1]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[2]  Samuel Kaski,et al.  Bankruptcy analysis with self-organizing maps in learning metrics , 2001, IEEE Trans. Neural Networks.

[3]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[4]  H. Boshuizen,et al.  Multiple imputation of missing blood pressure covariates in survival analysis. , 1999, Statistics in medicine.

[5]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[6]  Elia Biganzoli,et al.  Estimates of clinically useful measures in competing risks survival analysis , 2008, Statistics in medicine.

[7]  Paulo J. G. Lisboa,et al.  A review of evidence of health benefit from artificial neural networks in medical intervention , 2002, Neural Networks.

[8]  Douglas G Altman,et al.  Developing a prognostic model in the presence of missing data: an ovarian cancer case study. , 2003, Journal of clinical epidemiology.

[9]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models: Missing-data imputation , 2006 .

[10]  Paulo J. G. Lisboa,et al.  A Bayesian neural network approach for modelling censored data with an application to prognosis after surgery for breast cancer , 2003, Artif. Intell. Medicine.

[11]  Shun-ichi Amari,et al.  Natural Gradient Works Efficiently in Learning , 1998, Neural Computation.

[12]  Davide Bacciu,et al.  Competitive Repetition Suppression (CoRe) Clustering: A Biologically Inspired Learning Model With Application to Robust Clustering , 2008, IEEE Transactions on Neural Networks.

[13]  Paulo J. G. Lisboa,et al.  Patient stratification with competing risks by multivariate Fisher distance , 2009, 2009 International Joint Conference on Neural Networks.

[14]  Paulo J. G. Lisboa,et al.  Stratification of Severity of Illness Indices: A Case Study for Breast Cancer Prognosis , 2008, KES.

[15]  Paulo J. G. Lisboa,et al.  Missing Data Imputation in Longitudinal Cohort Studies: Application of PLANN-ARD in Breast Cancer Survival , 2008, 2008 Seventh International Conference on Machine Learning and Applications.