Coupled metric learning for single-shot versus single-shot person reidentification

Abstract. Person reidentification tackles the problem of building a correspondence between different images of the same person captured by distributed cameras. To date, attempts to solve this problem have focused on either feature representation or learning methods. Usually, the greater the number of the samples for each person, the better the reidentification performance is. However, in the real world, we may not be able to acquire enough samples to give acceptable performance. Here, we focus on the so-called “single-shot versus single-shot” problem: matching one image of a person to another. Because of the extremely small sample class size, there is limited scope to statistically weaken the empirical risk for hand-crafted feature representation. Therefore, we resort to metric learning methods, such as the ranking-specialized metric learning to rank (MLR) and the classification-based maximally collapsing metric learning (MCML). Taking advantage of the complementarity between them, we propose a novel “coupled metric learning” approach. This searches for the optimal linear projection for the original feature space using MCML before minimizing the ranking loss via MLR. Experiments on widely used benchmark datasets show encouraging results.

[1]  Luc Van Gool,et al.  Depth and Appearance for Mobile Scene Analysis , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[2]  Li Bai,et al.  Cosine Similarity Metric Learning for Face Verification , 2010, ACCV.

[3]  Slawomir Bak,et al.  Learning to Match Appearances by Correlations in a Covariance Metric Space , 2012, ECCV.

[4]  Shaogang Gong,et al.  Associating Groups of People , 2009, BMVC.

[5]  Narendra Ahuja,et al.  Pedestrian Recognition with a Learned Metric , 2010, ACCV.

[6]  Amir Globerson,et al.  Metric Learning by Collapsing Classes , 2005, NIPS.

[7]  Shaogang Gong,et al.  Reidentification by Relative Distance Comparison , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Gert R. G. Lanckriet,et al.  Metric Learning to Rank , 2010, ICML.

[9]  Slawomir Bak,et al.  Boosted human re-identification using Riemannian manifolds , 2012, Image Vis. Comput..

[10]  Duc Phu Chau,et al.  Multi-target tracking by discriminative analysis on Riemannian manifold , 2012, 2012 19th IEEE International Conference on Image Processing.

[11]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Hai Tao,et al.  Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[13]  Masayuki Mukunoki,et al.  Set Based Discriminative Ranking for Recognition , 2012, ECCV.

[14]  Peter H. Tu,et al.  Appearance-based person reidentification in camera networks: problem overview and current approaches , 2011, J. Ambient Intell. Humaniz. Comput..

[15]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[16]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[17]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, CVPR Workshops.

[18]  Slawomir Bak,et al.  Person Re-identification Using Haar-based and DCD-based Signature , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[19]  Inderjit S. Dhillon,et al.  Information-theoretic metric learning , 2006, ICML '07.

[20]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[21]  Shaogang Gong,et al.  Person Re-Identification by Support Vector Ranking , 2010, BMVC.

[22]  Alessandro Perina,et al.  Multiple-Shot Person Re-identification by HPE Signature , 2010, 2010 20th International Conference on Pattern Recognition.

[23]  Masayuki Mukunoki,et al.  Optimizing Mean Reciprocal Rank for person re-identification , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[24]  Larry S. Davis,et al.  Learning Discriminative Appearance-Based Models Using Partial Least Squares , 2009, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing.

[25]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[26]  Slawomir Bak,et al.  Multiple-shot human re-identification by Mean Riemannian Covariance Grid , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[27]  Ramakant Nevatia,et al.  Multi-target tracking by on-line learned discriminative appearance models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[28]  Vittorio Murino,et al.  Custom Pictorial Structures for Re-identification , 2011, BMVC.