Fast k-Nearest Neighbor Search for Face Identification Using Bounds of Residual Score

A novel fast k-nearest neighbor (k-NN) search method is proposed for the face identification task. It is well suited for this task because (1) it works well with high dimensionality, (2) it can be used with various similarity scores such as inner product, Euclidean distance, and correlation coefficient, (3) it can achieve not only fast exact k-NN search but much faster approximate search, and (4) it does not require any training or special data structure, resulting in low maintenance cost for the target database. Similarity scores between query and target samples are aggregated sequentially along with their dimensions, and target samples with no possibility of being included in k-NNs are rejected. The possibility is evaluated on the basis of the upper and lower bounds of the score for residual dimensions. Experimental results for a face database demonstrated that the proposed method achieves equal or better accuracy than other methods and is ten times faster than an exhaustive search with no degradation in the rank-k identification rate.

[1]  K. Kise,et al.  What Is the Most Efficient Way to Select Nearest Neighbor Candidates for Fast Approximate Nearest Neighbor Search ? , 2013 .

[2]  Jian Sun,et al.  Bayesian Face Revisited: A Joint Formulation , 2012, ECCV.

[3]  Bohyung Han,et al.  A fast nearest neighbor search algorithm by nonlinear embedding , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Sameer A. Nene,et al.  A simple algorithm for nearest neighbor search in high dimensions , 1997 .

[5]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[7]  Yi-Ching Liaw,et al.  Fast exact k nearest neighbors search using an orthogonal search tree , 2010, Pattern Recognit..

[8]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[9]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[10]  Nicole Immorlica,et al.  Locality-sensitive hashing scheme based on p-stable distributions , 2004, SCG '04.

[11]  Moni Naor,et al.  Optimal aggregation algorithms for middleware , 2001, PODS.

[12]  Gerhard Weikum,et al.  IO-Top-k: index-access optimized top-k query processing , 2006, VLDB.

[13]  Tomokazu Sato,et al.  What is the Most EfficientWay to Select Nearest Neighbor Candidates for Fast Approximate Nearest Neighbor Search? , 2013, 2013 IEEE International Conference on Computer Vision.

[14]  Svetlana Lazebnik,et al.  Iterative quantization: A procrustean approach to learning binary codes , 2011, CVPR 2011.

[15]  Wendy Hui Wang,et al.  The Threshold Algorithm: From Middleware Systems to the Relational Engine , 2007, IEEE Transactions on Knowledge and Data Engineering.