Strongly robust feature-voting higher dimensional fuzzy clustering

Supervised learning systems learn well on input-output vector pairs with low noise, but often the output vectors are not available so a clustering form of self-organizing learning on the input feature vectors is required. But noise on the features can cause the distance similarity measure to misclassify. To solve this problem we let the features vote, which tolerates unbounded noise on a minority of features. But the classes may not be linearly separable, as is the case for the famous iris data, so the usual clustering is inaccurate. We embed the feature vectors nonlinearly in higher dimensions, an idea from support vector machines. We next apply a feature separability criterion to eliminate extra features that merely add extraneous noise. Then we apply our method of weighted fuzzy expected value clustering in the higher dimensions. The significance is that we often can accomplish self-organizing learning on noisy data in difficult and linearly nonseparable cases. We use the algorithm on a simple test case and then on the iris data.

[1]  Carl G. Looney,et al.  Pattern recognition using neural networks , 1997 .

[2]  Carl G. Looney,et al.  Interactive clustering and merging with a new fuzzy expected value , 2002, Pattern Recognit..

[3]  Chin-Teng Lin,et al.  Neural fuzzy systems , 1994 .

[4]  Yoshiki Uchikawa,et al.  A Fuzzy Classifier System for evolutionary learning of robot behaviors , 1998 .

[5]  Abraham Kandel,et al.  Feature-based fuzzy classification for interpretation of mammograms , 2000, Fuzzy Sets Syst..

[6]  M. Ait Kbir,et al.  Hierarchical fuzzy partition for pattern classification with fuzzy if-then rules , 2000, Pattern Recognit. Lett..

[7]  Ching-Chang Wong,et al.  K-means-based fuzzy classifier design , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[8]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[9]  John Moody,et al.  Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[10]  Krzysztof J. Cios,et al.  Certainty factors versus Parzen windows as reliability measures in RBF networks , 1998, Neurocomputing.

[11]  E. Forgy,et al.  Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[12]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Donald F. Specht,et al.  Probabilistic neural networks , 1990, Neural Networks.

[14]  David G. Stork,et al.  Pattern classification, 2nd Edition , 2000 .

[15]  Hisao Ishibuchi,et al.  Selecting fuzzy if-then rules for classification problems using genetic algorithms , 1995, IEEE Trans. Fuzzy Syst..

[16]  Ioannis Anagnostopoulos,et al.  A neural network and fuzzy logic system for face detection on RGB images , 2001, Computers and Their Applications.

[17]  David G. Stork,et al.  Pattern Classification , 1973 .

[18]  Carl G. Looney,et al.  Radial basis functional link nets and fuzzy reasoning , 2002, Neurocomputing.

[19]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..