Fuzzy C-means clustering based uncertainty measure for sample weighting boosts pattern classification efficiency

The paper presents a fuzzy c-means clustering based fuzzy measure for weighting samples in a dataset for pattern classification. The method improves classification efficiency. The fuzzy c-means generated membership grades of a sample for belonging to different clusters are interpreted as measures of uncertainty for assigning specific crisp class label to this sample. The fuzzy measure of total uncertainty for a sample is defined as U = -Σk=1c Mk log2 Mk where Mk denotes the membership grade in k-th cluster, and the summation extends is over all the c clusters. The data samples in feature space are then transformed according to X → (1 + U)X. By using a radial basis function neural network classifier the classification efficiency is compared based on the original and the transformed feature vectors. Several data sets collected from open sources were used for validation.

[1]  Jose C. Principe,et al.  Handbook of Neural Network Signal Processing , 2018 .

[2]  R. D. S. Yadava,et al.  Genetic Algorithm Assisted Enhancement in Pattern Recognition Efficiency of Radial Basis Neural Network , 2011, SEMCCO.

[3]  Michael G. Madden Evaluation of the Performance of the Markov Blanket Bayesian Classifier Algorithm , 2002, ArXiv.

[4]  R. D. S. Yadava,et al.  Boosting Principal Component Analysis by Genetic Algorithm , 2010 .

[5]  P. Jurs,et al.  Detection of hazardous vapors including mixtures using pattern recognition analysis of responses from surface acoustic wave devices. , 1988, Analytical chemistry.

[6]  Tülay Yildirim,et al.  Improving classification performance of sonar targets by applying general regression neural network with PCA , 2008, Expert Syst. Appl..

[7]  R. Yadava,et al.  On Genetic Algorithm and Multiple Preprocessors Assisted Feature Boosting for Electronic Nose Signal Processing , 2011 .

[8]  Matteo Pardo,et al.  Coffee analysis with an electronic nose , 2002, IEEE Trans. Instrum. Meas..

[9]  Kewen Wang,et al.  Artificial Intelligence Advances in China: Guest-editorial , 2003 .

[10]  Evor L. Hines,et al.  Pattern analysis for electronic noses , 2003 .

[11]  P. K. Chaturvedi,et al.  Communication Systems , 2002, IFIP — The International Federation for Information Processing.

[12]  George J. Klir,et al.  Fuzzy sets and fuzzy logic - theory and applications , 1995 .

[13]  F. Moura-Pires,et al.  A fuzzy clustering model of data and fuzzy c-means , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[14]  Zhi-Hua Zhou,et al.  Editing Training Data for kNN Classifiers with Neural Network Ensemble , 2004, ISNN.

[15]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[16]  Stephen D. Bay Combining Nearest Neighbor Classifiers Through Multiple Feature Subsets , 1998, ICML.

[17]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Zhi-Hua Zhou,et al.  Extracting symbolic rules from trained neural network ensembles , 2003, AI Commun..

[19]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .