论文信息 - Boosting k-Nearest Neighbors Classification

Boosting k-Nearest Neighbors Classification

A major drawback of the k-nearest neighbors (k-NN) rule is the high variance when dealing with sparse prototype datasets in high dimensions. Most techniques proposed for improving k-NN classification rely either on deforming the k-NN relationship by learning a distance function or modifying the input space by means of subspace selection. Here we propose a novel boosting approach for generalizing the k-NN rule. Namely, we redefine the voting rule as a strong classifier that linearly combines predictions from the k closest prototypes. Our algorithm, called UNN (Universal Nearest Neighbors), rely on the k-nearest neighbors examples as weak classifiers and learn their weights so as to minimize a surrogate risk. These weights, called leveraging coefficients, allow us to distinguish the most relevant prototypes for a given class. Results obtained on several scene categorization datasets display the ability of UNN to compete with or beat state-of-the-art methods, while achieving comparatively small training and testing times.

[1] Sukhendu Das,et al. Indoor versus Outdoor Scene Classification Using Probabilistic Neural Network , 2007, EURASIP J. Adv. Signal Process..

[2] Ming Yuan,et al. Classification Methods with Reject Option Based on Convex Risk Minimization , 2010, J. Mach. Learn. Res..

[3] Andrea Vedaldi,et al. Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[4] Trevor Hastie,et al. Multi-class AdaBoost ∗ , 2009 .

[5] Nicolás García-Pedrajas,et al. Boosting k-nearest neighbor classifier by means of input space projection , 2009, Expert Syst. Appl..

[6] Jean-Michel Marin,et al. A Bayesian reassessment of nearest-neighbour classification , 2008, 0802.1357.

[7] Peter L. Bartlett,et al. AdaBoost is Consistent , 2006, J. Mach. Learn. Res..

[8] David Zhang,et al. On kernel difference-weighted k-nearest neighbor classification , 2007, Pattern Analysis and Applications.

[9] Pablo Suau,et al. Information Theory in Computer Vision and Pattern Recognition , 2009 .

[10] Xuegong Zhang,et al. Kernel Nearest-Neighbor Algorithm , 2002, Neural Processing Letters.

[11] David Masip,et al. Boosted discriminant projections for nearest neighbor classification , 2006, Pattern Recognit..

[12] Antonio Torralba,et al. Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[13] Chih-Jen Lin,et al. A Practical Guide to Support Vector Classication , 2008 .

[14] Jiebo Luo,et al. Improved scene classification using efficient low-level features and semantic cues , 2004, Pattern Recognit..

[15] Andrew Zisserman,et al. Advances in Neural Information Processing Systems (NIPS) , 2007 .

[16] Frank Nielsen,et al. Bregman Divergences and Surrogates for Learning , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Bernt Schiele,et al. International Journal of Computer Vision manuscript No. (will be inserted by the editor) Semantic Modeling of Natural Scenes for Content-Based Image Retrieval , 2022 .

[18] Richard Nock,et al. An improved bound on the finite-sample risk of the nearest neighbor rule , 2001, Pattern Recognit. Lett..

[19] Cordelia Schmid,et al. Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20] Jitendra Malik,et al. SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21] Krista A. Ehinger,et al. SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[22] Alexandru Telea,et al. International Conference on Computer Vision Theory and Applications (VISAPP) , 2014 .

[23] Michael J. Swain,et al. Color indexing , 1991, International Journal of Computer Vision.

[24] Frank Nielsen,et al. Leveraging Κ-nn for generic classification boosting , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[25] Niall M. Adams,et al. Likelihood inference in nearest‐neighbour classification models , 2003 .

[26] Antonio Torralba,et al. Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[27] Chris Mellish,et al. Advances in Instance Selection for Instance-Based Learning Algorithms , 2002, Data Mining and Knowledge Discovery.

[28] Sameer Singh,et al. Indoor vs. outdoor scene classification in digital photographs , 2005, Pattern Recognit..

[29] Yoram Singer,et al. Improved Boosting Algorithms Using Confidence-rated Predictions , 1998, COLT' 98.

[30] Keinosuke Fukunaga,et al. An Optimal Global Nearest Neighbor Metric , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31] Trevor Darrell,et al. The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[32] Antonio Torralba,et al. Recognizing indoor scenes , 2009, CVPR.

[33] Nicu Sebe,et al. Boosting the distance estimation: Application to the K-Nearest Neighbor Classifier , 2006, Pattern Recognit. Lett..

[34] Michel Barlaud,et al. A Bio-inspired Learning and Classification Method for Subcellular Localization of a Plasma Membrane Protein , 2012, VISAPP.

[35] Enrique Vidal,et al. Learning weighted metrics to minimize nearest-neighbor classification error , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36] Sahibsingh A. Dudani. The Distance-Weighted k-Nearest-Neighbor Rule , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[37] George Kollios,et al. BoostMap: An Embedding Method for Efficient Nearest Neighbor Retrieval , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38] Frank Nielsen,et al. On the Efficient Minimization of Classification Calibrated Surrogates , 2008, NIPS.

[39] Ambuj Tewari,et al. Applications of strong convexity--strong smoothness duality to learning with matrices , 2009, ArXiv.

[40] Pietro Perona,et al. A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[41] Michael I. Jordan,et al. Convexity, Classification, and Risk Bounds , 2006 .

[42] David G. Lowe,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[43] Peter E. Hart,et al. The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[44] Zhi-Hua Zhou,et al. ML-KNN: A lazy learning approach to multi-label learning , 2007, Pattern Recognit..

[45] Robert Tibshirani,et al. Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[46] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[47] Martin J. Wainwright,et al. ON surrogate loss functions and f-divergences , 2005, math/0510521.

[48] Piotr Indyk,et al. Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[49] Jiebo Luo,et al. Learning multi-label scene classification , 2004, Pattern Recognit..

[50] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).