A vector quantization based k-NN approach for large-scale image classification

The k-nearest-neighbour classifiers (k-NN) have been one of the simplest yet most effective approaches to instance based learning problem for image classification. However, with the growth of the size of image datasets and the number of dimensions of image descriptors, popularity of k-NNs has decreased due to their significant storage requirements and computational costs. In this paper we propose a vector quantization (VQ) based k-NN classifier, which has improved efficiency for both storage requirements and computational costs. We test the proposed method on publicly available large scale image datasets and show that the proposed method performs comparable to traditional k-NN with significantly better complexity and storage requirements.

[1]  Cheng Wang,et al.  Approximate Nearest Neighbor Search by Residual Vector Quantization , 2010, Sensors.

[2]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[3]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[4]  J. L. Hodges,et al.  Discriminatory Analysis - Nonparametric Discrimination: Consistency Properties , 1989 .

[5]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[6]  Jianping Gou,et al.  A Novel Weighted Voting for K-Nearest Neighbor Rule , 2011, J. Comput..

[7]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[8]  Jianping Gou,et al.  A new distance-weighted k-nearest neighbor classifier , 2012 .

[9]  Tsang-Long Pao,et al.  A Comparative Study of Different Weighting Schemes on KNN-Based Emotion Recognition in Mandarin Speech , 2007, ICIC.

[10]  Tinne Tuytelaars,et al.  A Testbed for Cross-Dataset Analysis , 2014, ECCV Workshops.

[11]  David L. Neuhoff,et al.  Quantization , 2022, IEEE Trans. Inf. Theory.

[12]  Heng Tao Shen,et al.  Hashing for Similarity Search: A Survey , 2014, ArXiv.

[13]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[14]  Sahibsingh A. Dudani The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[15]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.