Fuzzy c-Means Classifier for Incomplete Data Sets with Outliers and Missing Values

A novel membership function and a fuzzy clustering approach derived from a viewpoint of iteratively reweighted least square (IRLS) techniques resolve the problem of singularity in the regular fuzzy c-means (FCM) clustering. An FCM classifier using the membership function and Mahalanobis distances makes class memberships of outliers less clear-cut, which thus resolve the problem of classification based on normal populations or normal mixtures. The ways of handling singular covariance matrices and missing values are also furnished, which improve the generalization capability of the classifier. Computational experiments show high classification performance on several well-known benchmark data sets

[1]  Michael E. Tipping,et al.  Mixtures of Principal Component Analysers , 1997 .

[2]  Hidetomo Ichihashi,et al.  Linear fuzzy clustering techniques with missing values and their application to local principal component analysis , 2004, IEEE Transactions on Fuzzy Systems.

[3]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[4]  P. Holland,et al.  Robust regression using iteratively reweighted least-squares , 1977 .

[5]  Hidetomo Ichihashi,et al.  Component-wise robust linear fuzzy clustering for collaborative filtering , 2004, Int. J. Approx. Reason..

[6]  T. Kunii,et al.  Soft Computing and Human-Centered Machines , 2013, Computer Science Workbench.

[7]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[8]  Hidetomo Ichihashi,et al.  Regularized linear fuzzy clustering and probabilistic PCA mixture models , 2005, IEEE Transactions on Fuzzy Systems.

[9]  Peter J. Huber,et al.  Robust Statistics , 2005, Wiley Series in Probability and Statistics.

[11]  Teuvo Kohonen,et al.  Self-Organization and Associative Memory , 1988 .

[12]  Walter A. Kosters,et al.  Genetic Programming for data classification: partitioning the search space , 2004, SAC '04.

[13]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[14]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[15]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[16]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[17]  Cor J. Veenman,et al.  The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  K. Rose Deterministic annealing for clustering, compression, classification, regression, and related optimization problems , 1998, Proc. IEEE.

[19]  Roy L. Streit,et al.  Maximum likelihood training of probabilistic neural networks , 1994, IEEE Trans. Neural Networks.

[20]  Hidetomo Ichihashi,et al.  FCM Classifier Based on IRLS and Fuzzy Mahalanobis Distances , 2005 .

[21]  Hidetomo Ichihashi,et al.  FCM Clustering from the View Point of Iteratively Reweighted Least Squares , 2005, The 14th IEEE International Conference on Fuzzy Systems, 2005. FUZZ '05..