A generalized mean distance-based k-nearest neighbor classifier

Abstract K-nearest neighbor (KNN) rule is a well-known non-parametric classifier that is widely used in pattern recognition. However, the sensitivity of the neighborhood size k always seriously degrades the KNN-based classification performance, especially in the case of the small sample size with the existing outliers. To overcome this issue, in this article we propose a generalized mean distance-based k-nearest neighbor classifier (GMDKNN) by introducing multi-generalized mean distances and the nested generalized mean distance that are based on the characteristic of the generalized mean. In the proposed method, multi-local mean vectors of the given query sample in each class are calculated by adopting its class-specific k nearest neighbors. Using the achieved k local mean vectors per class, the corresponding k generalized mean distances are calculated and then used to design the categorical nested generalized mean distance. In the classification phase, the categorical nested generalized mean distance is used as the classification decision rule and the query sample is classified into the class with the minimum nested generalized mean distance among all the classes. Extensive experiments on the UCI and KEEL data sets, synthetic data sets, the KEEL noise data sets and the UCR time series data sets are conducted by comparing the proposed method to the state-of-art KNN-based methods. The experimental results demonstrate that the proposed GMDKNN performs better and has the less sensitiveness to k. Thus, our proposed GMDKNN with the robust and effective classification performance could be a promising method for pattern recognition in some expert and intelligence systems.

[1]  Jesús Alcalá-Fdez,et al.  KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework , 2011, J. Multiple Valued Log. Soft Comput..

[2]  Jianping Gou,et al.  A new distance-weighted k-nearest neighbor classifier , 2012 .

[3]  Yong Zeng,et al.  Pseudo nearest neighbor rule for pattern classification , 2009, Expert Syst. Appl..

[4]  Yongzhao Zhan,et al.  Improved pseudo nearest neighbor classification , 2014, Knowl. Based Syst..

[5]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[6]  Yong Zeng,et al.  Nonparametric classification based on local mean and class statistics , 2009, Expert Syst. Appl..

[7]  Nojun Kwak,et al.  Generalized mean for robust principal component analysis , 2016, Pattern Recognit..

[8]  Pasi Luukka,et al.  Nonlinear fuzzy robust PCA algorithms and similarity classifier in bankruptcy analysis , 2010, Expert Syst. Appl..

[9]  Qian Du,et al.  Collaborative-Representation-Based Nearest Neighbor Classifier for Hyperspectral Imagery , 2015, IEEE Geoscience and Remote Sensing Letters.

[10]  Jian Yang,et al.  From classifiers to discriminators: A nearest neighbor rule induced discriminant analysis , 2011, Pattern Recognit..

[11]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[12]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[13]  Nicolás García-Pedrajas,et al.  A Proposal for Local $k$ Values for $k$ -Nearest Neighbor Rule , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Hong Liu,et al.  Coarse to fine K nearest neighbor classifier , 2013, Pattern Recognit. Lett..

[15]  Yidi Wang,et al.  A new k-harmonic nearest neighbor classifier based on the multi-local means , 2017, Expert Syst. Appl..

[16]  Jianping Gou,et al.  A Novel Weighted Voting for K-Nearest Neighbor Rule , 2011, J. Comput..

[17]  Jianping Gou,et al.  A new nearest neighbor classifier based on multi-harmonic mean distances , 2017, 2017 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC).

[18]  Dimitrios Gunopulos,et al.  Locally Adaptive Metric Nearest-Neighbor Classification , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Pasi Luukka,et al.  Classification based on fuzzy robust PCA algorithms and similarity classifier , 2009, Expert Syst. Appl..

[20]  S.-G. Lee,et al.  An Arrhythmia Classification Method in Utilizing the Weighted KNN and the Fitness Rule , 2017 .

[21]  Yan Qiu Chen,et al.  The Nearest Neighbor Algorithm of Local Probability Centers , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Jian Yang,et al.  Linear reconstruction measure steered nearest neighbor classification framework , 2014, Pattern Recognit..

[23]  Manuele Bicego,et al.  Weighted K-Nearest Neighbor revisited , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[24]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[25]  Nan Zhang,et al.  Component-based global k-NN classifier for small sample size problems , 2012, Pattern Recognit. Lett..

[26]  Yoshihiko Hamamoto,et al.  A local mean-based nonparametric classifier , 2006, Pattern Recognit. Lett..

[27]  Lei Yu,et al.  Kernel representation-based nearest neighbor classifier , 2014 .

[28]  Hien T. Nguyen,et al.  A Weighted Local Mean-Based k-Nearest Neighbors Classifier for Time Series , 2017, ICMLC.

[29]  Namita Mittal,et al.  AVNM: A Voting based Novel Mathematical Rule for Image Classification , 2016, Comput. Methods Programs Biomed..

[30]  Ping Li,et al.  The Distance-Weighted K-nearest Centroid Neighbor Classification , 2017, J. Inf. Hiding Multim. Signal Process..

[31]  Jianping Gou,et al.  Sparse Coefficient-Based ${k}$ -Nearest Neighbor Classification , 2017, IEEE Access.

[32]  Huchuan Lu,et al.  Weighted Generalized Nearest Neighbor for Hyperspectral Image Classification , 2017, IEEE Access.

[33]  Jianping Gou,et al.  A Local Mean-Based k-Nearest Centroid Neighbor Classifier , 2012, Comput. J..

[34]  Xiaofeng Zhu,et al.  Efficient kNN Classification With Different Numbers of Nearest Neighbors , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Xuelong Li,et al.  Learning k for kNN Classification , 2017, ACM Trans. Intell. Syst. Technol..

[36]  Mehmet Fatih Amasyali,et al.  Locally adaptive k parameter selection for nearest neighbor classifier: one nearest cluster , 2017, Pattern Analysis and Applications.