Person Re-identification by Smooth Metric Learning

Person re-identification, which is to match people across areas covered by multiple non-overlapping surveillance cameras, has drawn great research interests in the video surveillance domain. Previous research mainly focus on feature extraction and distance measure, where the former aims to find robust feature representation while the latter seeks to learn an optimal metric space. Because of the introduction of supervised information, metric learning methods can usually achieve better performance over feature based methods. As one of the most representative metric learning method, the Large Margin Nearest Neighbor (LMNN) algorithm was recently applied in person re-identification task and achieved satisfactory results [1]. However, LMNN uses a standard hinge loss function, which is neither differentiable everywhere nor time-efficiency due to the usage of all training samples. In this paper, we propose to replace hinge loss function with the logistic loss function, which transforms LMNN to a smooth unconstrained convex optimization problem easily solved with gradient descent algorithm. Thereafter, we further design a stochastic sampling scheme to accelerate the optimization process of the above problem with randomly selected training samples. Extensive comparative experiments conducted on two standard datasets have shown the effectiveness and efficiency of the proposed algorithm over a series of standard baseline methods.

[1]  Shaogang Gong,et al.  Person re-identification by probabilistic relative distance comparison , 2011, CVPR 2011.

[2]  Inderjit S. Dhillon,et al.  Information-theoretic metric learning , 2006, ICML '07.

[3]  Hai Tao,et al.  Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features , 2008, ECCV.

[4]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[5]  Horst Bischof,et al.  Large scale metric learning from equivalence constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Shaogang Gong,et al.  Person Re-Identification by Support Vector Ranking , 2010, BMVC.

[7]  Xiaogang Wang,et al.  Shape and Appearance Context Modeling , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[8]  Xuelong Li,et al.  Person Re-Identification by Regularized Smoothing KISS Metric Learning , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Tong Zhang,et al.  Solving large scale linear prediction problems using stochastic gradient descent algorithms , 2004, ICML.

[10]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Andrew McCallum,et al.  Dynamic conditional random fields: factorized probabilistic models for labeling and segmenting sequence data , 2004, J. Mach. Learn. Res..

[12]  Hai Tao,et al.  Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[13]  Narendra Ahuja,et al.  Pedestrian Recognition with a Learned Metric , 2010, ACCV.

[14]  Horst Bischof,et al.  Dense appearance modeling and efficient learning of camera transitions for person re-identification , 2012, 2012 19th IEEE International Conference on Image Processing.