A framework for automatic modelling of survival using fuzzy inference

Survival analysis describes the analysis of data that corresponds to the time from when an individual enters a study until the occurrence of some particular event or end-point. It is most commonly used in the context of modelling survival (or disease-free interval time) in medical contexts, often concerned with the comparison of survival for different combinations of risk factors and/or treatments. Analytical methods which are transparent to the clinicians in understanding and explaining individual inference need to be considered when dealing with such medical data. In this paper, we present a framework for modelling survival utilising the application of the ANFIS fuzzy inference system. In this framework, alternative methods of partitioning the input space can be selected to define the membership functions, for example by using expert knowledge, equalizer partitioning, fuzzy c-means clustering, or the combination of these techniques. Further, the rule base can be established by enumerating all possible combinations of membership functions of all inputs. After the initialisation of the fuzzy inference structure, the replication data (until time to event) will be subject to be trained using the gradient descent and nonnegative least square algorithm to estimate the conditional event probability. This framework is validated over a novel dataset of patients following operative surgery for ovarian cancer. We demonstrate that the proposed framework can be successfully applied to estimate the hazard and survival curves between different prognostic factors, and model survival times, while providing models with explicit explanation capabilities.

[1]  Paulo J. G. Lisboa,et al.  Neural Networks and Other Machine Learning Methods in Cancer Research , 2007, IWANN.

[2]  T. Halvorsen,et al.  Survival and Prognostic Factors in Patients With Ovarian Cancer , 2003, Obstetrics and gynecology.

[3]  Paulo J. G. Lisboa,et al.  A Bayesian neural network approach for modelling censored data with an application to prognosis after surgery for breast cancer , 2003, Artif. Intell. Medicine.

[4]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[5]  John F Smyth,et al.  Validation of a new prognostic index for advanced epithelial ovarian cancer: results from its application to a UK-based cohort. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[6]  T. Stijnen,et al.  Review: a gentle introduction to imputation of missing values. , 2006, Journal of clinical epidemiology.

[7]  Douglas G Altman,et al.  Developing a prognostic model in the presence of missing data: an ovarian cancer case study. , 2003, Journal of clinical epidemiology.

[8]  Brian D. Ripley,et al.  Clinical applications of artificial neural networks: Neural networks as statistical methods in survival analysis , 2001 .

[9]  Michio Sugeno,et al.  Industrial Applications of Fuzzy Control , 1985 .

[10]  David McLean,et al.  Rule extraction from neural networks for medical domains , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[11]  T G Clark,et al.  Survival Analysis Part I: Basic concepts and first analyses , 2003, British Journal of Cancer.

[12]  Jonathan M. Garibaldi,et al.  Adaptive neuro-fuzzy inference system (ANFIS) in modelling breast cancer survival , 2010, International Conference on Fuzzy Systems.

[13]  Mu-Song Chen,et al.  Fuzzy clustering analysis for optimizing fuzzy membership functions , 1999, Fuzzy Sets Syst..

[14]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[15]  E. McFadden,et al.  Toxicity and response criteria of the Eastern Cooperative Oncology Group , 1982, American journal of clinical oncology.

[16]  Xihong Lin,et al.  New Developments in Biostatistics and Bioinformatics , 2009 .

[17]  Paulo J. G. Lisboa,et al.  The Use of Artificial Neural Networks in Decision Support in Cancer: a Systematic Review , 2005 .

[18]  Heloisa A. Camargo,et al.  Optimising the Fuzzy Granulation of Attribute Domains , 2009, IFSA/EUSFLAT Conf..

[19]  Paulo J. G. Lisboa,et al.  Data Mining in Cancer Research [Application Notes] , 2010, IEEE Computational Intelligence Magazine.

[20]  Joachim Diederich,et al.  Survey and critique of techniques for extracting rules from trained artificial neural networks , 1995, Knowl. Based Syst..

[21]  Michael Negnevitsky,et al.  Artificial Intelligence: A Guide to Intelligent Systems , 2001 .

[22]  Lucila Ohno-Machado,et al.  Logistic regression and artificial neural network classification models: a methodology review , 2002, J. Biomed. Informatics.

[23]  D G Altman,et al.  A prognostic model for ovarian cancer , 2001, British Journal of Cancer.

[24]  Jonathan M. Garibaldi,et al.  An Investigation of the Effect of Input Representation in ANFIS Modelling of Breast Cancer Survival , 2010, IJCCI.

[25]  Satoshi Teramukai,et al.  PIEPOC: a new prognostic index for advanced epithelial ovarian cancer--Japan Multinational Trial Organization OC01-01. , 2007, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[26]  E Biganzoli,et al.  Feed forward neural networks for the analysis of censored survival data: a partial logistic regression approach. , 1998, Statistics in medicine.

[27]  Dursun Delen,et al.  Predicting breast cancer survivability: a comparison of three data mining methods , 2005, Artif. Intell. Medicine.