A New Nonlinear Fuzzy Robust PCA Algorithm and Similarity Classifier in Classification of Medical Data Sets

In this article a classification method is proposed where data is first preprocessed using new nonlinear fuzzy robust principal component analysis (NFRPCA) algorithm to get data into more feasible form. After this preprocessing step the similarity classifier is then used for the actual classification. The procedure was tested for dermatology, hepatitis and liver-disorder data. Results were quite promising and better classification accuracy was achieved than using classical PCA and similarity classifier. This new nonlinear fuzzy robust principal component analysis algorithm seems to have the effect that it project the data sets into a more feasible form and when used together with the similarity classifier a classification accuracy of 72.27 % was achieved with liver-disorder data, 88.94 % with hepatitis, and 97.09 % accuracy was achieved with dermatology data. Compared to results with classical PCA and the similarity classifier, higher accuracies were achieved with the approach using nonlinear fuzzy robust principal component analysis and the similarity classifier.

[1]  Lotfi A. Zadeh,et al.  Similarity relations and fuzzy orderings , 1971, Inf. Sci..

[2]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[3]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[4]  E. Oja,et al.  On stochastic approximation of the eigenvectors and eigenvalues of the expectation of a random matrix , 1985 .

[5]  L. Valverde On the structure of F-indistinguishability operators , 1985 .

[6]  O. Mangasarian,et al.  Multisurface method of pattern separation for medical diagnosis applied to breast cytology. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Lawrence Sirovich,et al.  Application of the Karhunen-Loeve Procedure for the Characterization of Human Faces , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Vilém Novák On the syntactico-semantical completeness of first-order fuzzy logic. I. Syntax and semantics , 1990, Kybernetika.

[9]  Vilém Novák On the syntactico-semantical completeness of first-order fuzzy logic. II. Main results , 1990, Kybernetika.

[10]  E. Oja The Nonlinear PCA Learning Rule and Signal Separation - Mathematical Analysis , 1995 .

[11]  Alan L. Yuille,et al.  Robust principal component analysis by self-organizing rules based on statistical physics approach , 1995, IEEE Trans. Neural Networks.

[12]  Frank Klawonn,et al.  Similarity in fuzzy reasoning , 1995 .

[13]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[14]  H. Altay Güvenir,et al.  Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals , 1998, Artif. Intell. Medicine.

[15]  Sheng-De Wang,et al.  Robust algorithms for principal component analysis , 1999, Pattern Recognit. Lett..

[16]  Giangiacomo Gerla,et al.  Fuzzy subgroups and similarities , 1999, Soft Comput..

[17]  E. Turunen Mathematics Behind Fuzzy Logic , 1999 .

[18]  Esko Turunen Survey of Theory and Applications of Łukasiewicz-Pavelka Fuzzy Logic , 2001 .

[19]  Pasi Luukka,et al.  A classifier based on the maximal fuzzy similarity in the generalized Lukasiewicz-structure , 2001, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297).

[20]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[21]  Thierry Denoeux,et al.  Principal component analysis of fuzzy data using autoassociative neural networks , 2004, IEEE Transactions on Fuzzy Systems.

[22]  Elif Derya Übeyli,et al.  Automatic detection of erythemato-squamous diseases using adaptive neuro-fuzzy inference systems , 2004, Comput. Biol. Medicine.

[23]  Pasi Luukka,et al.  Similarity classifier with generalized mean applied to medical data , 2006, Comput. Biol. Medicine.

[24]  David J. Hewson,et al.  Classifying NIR spectra of textile products with kernel methods , 2007, Eng. Appl. Artif. Intell..

[25]  Yun-Chi Yeh,et al.  Heartbeat Case Determination Using Fuzzy Logic Method on ECG Signals , 2009 .

[26]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[27]  Pasi Luukka PCA for fuzzy data and similarity classifier in building recognition system for post-operative patient data , 2009, Expert Syst. Appl..

[28]  Pasi Luukka,et al.  Classification based on fuzzy robust PCA algorithms and similarity classifier , 2009, Expert Syst. Appl..

[29]  Chen-Chia Chuang,et al.  Two-Stages Support Vector Regression for Fuzzy Neural Networks with Outliers , 2009 .

[30]  Pasi Luukka,et al.  Nonlinear fuzzy robust PCA algorithms and similarity classifier in bankruptcy analysis , 2010, Expert Syst. Appl..

[31]  Pao-Ta Yu,et al.  Nonparametric Fuzzy Feature Extraction for Hyperspectral Image Classification , 2010 .

[32]  Kuo-Lan Su,et al.  ARFNNs under Different Types SVR for Identification of Nonlinear Magneto-Rheological Damper Systems with Outliers , 2010 .

[33]  Pasi Luukka,et al.  Feature selection using fuzzy entropy measures with similarity classifier , 2011, Expert Syst. Appl..

[34]  V. Kshirsagar,et al.  Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.