A New Weighted k-Nearest Neighbor Algorithm Based on Newton's Gravitational Force

The kNN algorithm has three main advantages that make it appealing to the community: it is easy to understand, it regularly offers competitive performance and its structure can be easily tuning to adapting to the needs of researchers to achieve better results. One of the variations is weighting the instances based on their distance. In this paper we propose a weighting based on the Newton’s gravitational force, so that a mass (or relevance) has to be assigned to each instance. We evaluated this idea in the kNN context over 13 benchmark data sets used for binary and multi-class classification experiments. Results in \(\mathrm {F}_1\) score, statistically validated, suggest that our proposal outperforms the original version of kNN and is statistically competitive with the distance weighted kNN version as well.

[1]  Francisco Herrera,et al.  rNPBST: An R Package Covering Non-parametric and Bayesian Statistical Tests , 2017, HAIS.

[2]  Gautam Bhattacharya,et al.  An affinity-based new local distance function and similarity measure for kNN algorithm , 2012, Pattern Recognit. Lett..

[3]  José Cristóbal Riquelme Santos,et al.  An evolutionary voting for k-nearest neighbours , 2016, Expert Syst. Appl..

[4]  Ji Feng,et al.  Natural neighbor: A self-adaptive neighborhood method without parameter K , 2016, Pattern Recognit. Lett..

[5]  Pedro M. Domingos,et al.  The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World , 2015 .

[6]  John Riedl,et al.  Shilling recommender systems for fun and profit , 2004, WWW '04.

[7]  Songbo Tan,et al.  Neighbor-weighted K-nearest neighbor for unbalanced text corpus , 2005, Expert Syst. Appl..

[8]  Elham Parvinnia,et al.  Classification of EEG Signals using adaptive weighted distance nearest neighbor algorithm , 2014, J. King Saud Univ. Comput. Inf. Sci..

[9]  Dennis L. Wilson,et al.  Asymptotic Properties of Nearest Neighbor Rules Using Edited Data , 1972, IEEE Trans. Syst. Man Cybern..

[10]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[11]  Hong Liu,et al.  Coarse to fine K nearest neighbor classifier , 2013, Pattern Recognit. Lett..

[12]  Shichao Zhang,et al.  A novel kNN algorithm with data-driven k parameter computation , 2017, Pattern Recognit. Lett..

[13]  Marco Zaffalon,et al.  A Bayesian Wilcoxon signed-rank test based on the Dirichlet process , 2014, ICML.

[14]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[15]  Julio López,et al.  Redefining nearest neighbor classification in high-dimensional settings , 2018, Pattern Recognit. Lett..

[16]  Guy W. Mineau,et al.  A simple KNN algorithm for text categorization , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[17]  D. S. Guru,et al.  Texture Features and KNN in Classification of Flower Images , 2010 .