An Enhanced K-Nearest Neighbor Classification Method Based on Maximal Coherence and Validity Ratings

Traditional k-nearest neighbor methods couldn’t be able to correctly classify objects when their k nearest neighbors are dominated by other classes. This paper formulates a two-class classification problem, and applies a modified k-nearest neighbors (KNN) classifier algorithm based on maximal coherence, validity ratings, and k-fold cross validation to classify the test samples. We build a validity score for the pairs of sample and their surroundings according to their labels. The k nearest neighbors (including the unknown test object) of each sample in the training set as well as the unknown test object itself will be determined. The unknown test object will be tentatively assigned to a class membership. Then we use the validity scores to quantify the degree to which a pre-determined group of samples resemble their k nearest neighbors. A classifier is designed which take into account the coherence and validity ratings. A numerical example demonstrates the effectiveness of the algorithm in detail. The enhanced KNN method is compared with the conventional KNN and the modified KNN method on both real world wine data and photo-thermal infrared imaging spectroscopy (PT-IRIS) data for up to 20 different k values. Classification accuracy of KNN method and our method in terms of various combinations of k-value and k-fold cross validation are compared. The experimental results show that the proposed enhanced KNN method outperforms the conventional KNN and the modified KNN method on both real world wine data and PT-IRIS data. In addition, the classification accuracy of both the conventional KNN and our method increase drastically when k = 5. The average classification accuracy of our method on the PT-IRIS data featuring small sample size and high overlap is 97.87%.

[1]  Nian Zhang,et al.  Design of adaptive feature extraction algorithm based on fuzzy classifier in hyperspectral imagery classification for big data analysis , 2016, 2016 12th World Congress on Intelligent Control and Automation (WCICA).

[2]  Juan F. Ramirez Rochac,et al.  Feature extraction in hyperspectral imaging using adaptive feature selection approach , 2016, 2016 Eighth International Conference on Advanced Computational Intelligence (ICACI).

[3]  Hamid Parvin,et al.  MKNN: Modified K-Nearest Neighbor , 2008 .

[4]  Nian Zhang,et al.  Cost-sensitive spectral clustering for photo-thermal infrared imaging data , 2016, 2016 Sixth International Conference on Information Science and Technology (ICIST).

[5]  Bo Tang,et al.  ENN: Extended Nearest Neighbor Method for Pattern Recognition [Research Frontier] , 2015, IEEE Computational Intelligence Magazine.

[6]  Jeff Byers,et al.  Infrared photothermal imaging of trace explosives on relevant substrates , 2013, Defense, Security, and Sensing.

[7]  Juan F. Ramirez Rochac,et al.  Reference clusters based feature extraction approach for mixed spectral signatures with dimensionality disparity , 2016, 2016 Annual IEEE Systems Conference (SysCon).

[8]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[9]  Leon N. Cooper,et al.  Improving nearest neighbor rule with a simple adaptive distance measure , 2007, Pattern Recognit. Lett..

[10]  Leslie S. Smith,et al.  Feature subset selection in large dimensionality domains , 2010, Pattern Recognit..

[11]  Hamid Parvin,et al.  Validation Based Modified K‐Nearest Neighbor , 2009 .

[12]  Shweta Taneja,et al.  MFZ-KNN — A modified fuzzy based K nearest neighbor algorithm , 2015, 2015 International Conference on Cognitive Computing and Information Processing(CCIP).