Deep neural networks and maximum likelihood search for approximate nearest neighbor in video-based image recognition

We analyzed the way to increase computational efficiency of video-based image recognition methods with matching of high dimensional feature vectors extracted by deep convolutional neural networks. We proposed an algorithm for approximate nearest neighbor search. At the first step, for a given video frame the algorithm verifies a reference image obtained when recognizing the previous frame. After that the frame is compared with a few number of reference images. Each next examined reference image is chosen so that to maximize conditional probability density of distances to the reference instances tested at previous steps. To decrease the required memory space we beforehand calculate only distances from all the images to small number of instances (pivots). When experimenting with either face photos from Labeled Faces in the Wild and PubFig83 datasets or with video data from YouTube Faces we showed that our algorithm allows accelerating the recognition procedure by 1.4–4 times comparing with known approximate nearest neighbor methods.

[1]  Andrey V. Savchenko The maximal likelihood enumeration method for the problem of classifying piecewise regular objects , 2016, Autom. Remote. Control..

[2]  Simon J. D. Prince,et al.  Computer Vision: Models, Learning, and Inference , 2012 .

[3]  Yunhong Wang,et al.  Video-based Face Recognition: A Survey , 2009 .

[4]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[5]  Richard I. Hartley,et al.  Optimised KD-trees for fast image descriptor matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Andrey V. Savchenko Real-Time Image Recognition with the Parallel Directed Enumeration Method , 2013, ICVS.

[7]  A. Savchenko,et al.  Pattern recognition and increasing of the computational efficiency of a parallel realization of the probabilistic neural network with homogeneity testing , 2013, Optical Memory and Neural Networks.

[8]  Rama Chellappa,et al.  Face Tracking and Recognition in Video , 2011, Handbook of Face Recognition.

[9]  Andrey V. Savchenko Face Recognition in Real-Time Applications: A Comparison of Directed Enumeration Method and K-d Trees , 2012, BIR.

[10]  Macario Cordel,et al.  Convolutional neural network for vehicle detection in low resolution traffic videos , 2016, 2016 IEEE Region 10 Symposium (TENSYMP).

[11]  A. V. Savchenko Adaptive video image recognition system using a committee machine , 2012, Optical Memory and Neural Networks.

[12]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13]  Andrey V. Savchenko,et al.  Maximum Likelihood Directed Enumeration Method in Piecewise-Regular Object Recognition , 2014, Pattern Recognit..

[14]  Gang Hua,et al.  Labeled Faces in the Wild: A Survey , 2016 .

[15]  Vladimir Krylov,et al.  Approximate nearest neighbor algorithm based on navigable small world graphs , 2014, Inf. Syst..

[16]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[17]  Robert P. W. Duin,et al.  Classifiers for dissimilarity-based pattern recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[18]  Xilin Chen,et al.  Projection Metric Learning on Grassmann Manifold with Application to Video based Face Recognition , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Loris Nanni,et al.  Ensemble of texture descriptors and classifiers for face recognition , 2017 .

[20]  Arnold W. M. Smeulders,et al.  The Distribution Family of Similarity Distances , 2007, NIPS.

[21]  David Cox,et al.  Scaling up biologically-inspired computer vision: A case study in unconstrained face recognition on facebook , 2011, CVPR 2011 WORKSHOPS.

[22]  Tal Hassner,et al.  Face recognition in unconstrained videos with matched background similarity , 2011, CVPR 2011.

[23]  Gonzalo Navarro,et al.  Effective Proximity Retrieval by Ordering Permutations , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Andrey V. Savchenko,et al.  Search Techniques in Intelligent Classification Systems , 2016 .

[25]  Anil K. Jain,et al.  Unconstrained Face Recognition: Identifying a Person of Interest From a Media Collection , 2014, IEEE Transactions on Information Forensics and Security.

[26]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.