Data Driven Generation of Fuzzy Systems: An Application to Breast Cancer Detection

The detection of diseases often can be formalized as a decision problem that typically has to be solved merging uncertain information; diagnostic tools, intended to aid the physician in interpreting the data, besides attaining the best possible correct classification rate, should furnish some insight into how the problem has been decided. Fuzzy logic is a well known successful attempt to automatize the human capability to reason with imperfect information; fuzzy systems are rule-based so that they can easily provide motivations for their decisions, after having verified some additional conditions. In this paper we describe a six-steps data driven methodology to automatically build fuzzy systems with a user defined number of rules; almost each step can be approached using several strategies and we thus describe an implementation of the proposed solution. Then, we test our systems on a well known and widely used data set of features of images of breast masses and, having the number of rules varying, we show results both in terms of correct classification rates and in terms of systems' confidence in the obtained decisions. Finally, we select the number of rules that produces the most interpretable and trustworthy system; such a system is described in details and tested.

[1]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[2]  Dimitrios I. Fotiadis,et al.  A methodology for automated fuzzy model generation , 2008, Fuzzy Sets Syst..

[3]  Serge Guillaume,et al.  Designing fuzzy inference systems from data: An interpretability-oriented review , 2001, IEEE Trans. Fuzzy Syst..

[4]  Lotfi A. Zadeh,et al.  Is there a need for fuzzy logic? , 2008, NAFIPS 2008 - 2008 Annual Meeting of the North American Fuzzy Information Processing Society.

[5]  Mohammad Hossein Fazel Zarandi,et al.  Data-driven fuzzy modeling for Takagi-Sugeno-Kang fuzzy system , 2010, Inf. Sci..

[6]  Bo Yang,et al.  Automatic Design of Hierarchical Takagi–Sugeno Type Fuzzy Systems Using Evolutionary Algorithms , 2007, IEEE Transactions on Fuzzy Systems.

[7]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[8]  T. Warren Liao,et al.  Medical data mining by fuzzy modeling with selected features , 2008, Artif. Intell. Medicine.

[9]  W. N. Street,et al.  Image analysis and machine learning applied to breast cancer diagnosis and prognosis. , 1995, Analytical and quantitative cytology and histology.

[10]  E. H. Mamdani,et al.  An Experiment in Linguistic Synthesis with a Fuzzy Logic Controller , 1999, Int. J. Man Mach. Stud..

[11]  Dimitrios I. Fotiadis,et al.  Automated Diagnosis of Coronary Artery Disease Based on Data Mining and Fuzzy Modeling , 2008, IEEE Transactions on Information Technology in Biomedicine.

[12]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[13]  Moshe Sipper,et al.  A fuzzy-genetic approach to breast cancer diagnosis , 1999, Artif. Intell. Medicine.

[14]  Derek A. Linkens,et al.  Rule-base self-generation and simplification for data-driven fuzzy models , 2004, Fuzzy Sets Syst..

[15]  Michio Sugeno,et al.  Fuzzy identification of systems and its applications to modeling and control , 1985, IEEE Transactions on Systems, Man, and Cybernetics.