Beyond Mahalanobis distance: Learning second-order discriminant function for people verification

People verification is a challenging and important task which finds many applications in modern surveillance and video retrieval systems. In this problem, metric learning approaches have played an important role by trying to bridge the semantic gap between image features and people's identities. However, we believe that the traditional Mahalanobis distance is limited in capturing the diversity of visual phenomenon, and hence insufficient for complicated tasks such as people verification. In this paper, we introduce a novel discriminant function which generalizes the classical Mahalanobis distance. Our approach considers a quadratic function directly on the space of image pairs. The resulting decision boundary is therefore in a general shape and not limited to ellipsoids enforced by Mahalanobis distance. To achieve computational efficiency, we develop a generalized SVM-type solver in dual space. Experimental results on the “Labeled Faces in the Wild” dataset show that our method outperforms the classical Mahalanobis distance in the people verification problem.

[1]  Hai Tao,et al.  Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[2]  Peng Li,et al.  Distance Metric Learning with Eigenvalue Optimization , 2012, J. Mach. Learn. Res..

[3]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[4]  Hai Tao,et al.  Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features , 2008, ECCV.

[5]  Frédéric Jurie,et al.  Learning Visual Similarity Measures for Comparing Never Seen Objects , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Erik G. Learned-Miller,et al.  Discriminative Training of Hyper-feature Models for Object Identification , 2006, BMVC.

[7]  Jitendra Malik,et al.  Learning Globally-Consistent Local Distance Functions for Shape-Based Image Retrieval and Classification , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[8]  Robert Tibshirani,et al.  Discriminant Adaptive Nearest Neighbor Classification , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  P. Mahalanobis On the generalized distance in statistics , 1936 .

[10]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[11]  Satoshi Ito,et al.  Random ensemble metrics for object recognition , 2011, 2011 International Conference on Computer Vision.

[12]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[13]  Alex Pentland,et al.  Bayesian face recognition , 2000, Pattern Recognit..

[14]  Geoffrey E. Hinton,et al.  Neighbourhood Components Analysis , 2004, NIPS.

[15]  Erik G. Learned-Miller,et al.  Unsupervised Joint Alignment of Complex Images , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[16]  Amir Globerson,et al.  Metric Learning by Collapsing Classes , 2005, NIPS.

[17]  Samy Bengio,et al.  An Online Algorithm for Large Scale Image Similarity Learning , 2009, NIPS.

[18]  Cordelia Schmid,et al.  Is that you? Metric learning approaches for face identification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[19]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[20]  Thorsten Joachims,et al.  Learning a Distance Metric from Relative Comparisons , 2003, NIPS.

[21]  Marwan Mattar,et al.  Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[22]  Rong Jin,et al.  Distance Metric Learning: A Comprehensive Survey , 2006 .

[23]  Yaniv Taigman,et al.  Descriptor Based Methods in the Wild , 2008 .

[24]  Yoram Singer,et al.  Online and batch learning of pseudo-metrics , 2004, ICML.

[25]  Nicolas Pinto,et al.  How far can you get with a modern face recognition test set using only simple features? , 2009, CVPR.

[26]  Richard Szeliski,et al.  Finding People in Repeated Shots of the Same Scene , 2006, BMVC.

[27]  Inderjit S. Dhillon,et al.  Information-theoretic metric learning , 2006, ICML '07.