Person Re-Identification Over Camera Networks Using Multi-Task Distance Metric Learning

Person reidentification in a camera network is a valuable yet challenging problem to solve. Existing methods learn a common Mahalanobis distance metric by using the data collected from different cameras and then exploit the learned metric for identifying people in the images. However, the cameras in a camera network have different settings and the recorded images are seriously affected by variability in illumination conditions, camera viewing angles, and background clutter. Using a common metric to conduct person reidentification tasks on different camera pairs overlooks the differences in camera settings; however, it is very time-consuming to label people manually in images from surveillance videos. For example, in most existing person reidentification data sets, only one image of a person is collected from each of only two cameras; therefore, directly learning a unique Mahalanobis distance metric for each camera pair is susceptible to over-fitting by using insufficiently labeled data. In this paper, we reformulate person reidentification in a camera network as a multitask distance metric learning problem. The proposed method designs multiple Mahalanobis distance metrics to cope with the complicated conditions that exist in typical camera networks. We address the fact that these Mahalanobis distance metrics are different but related, and learned by adding joint regularization to alleviate over-fitting. Furthermore, by extending, we present a novel multitask maximally collapsing metric learning (MtMCML) model for person reidentification in a camera network. Experimental results demonstrate that formulating person reidentification over camera networks as multitask distance metric learning problem can improve performance, and our proposed MtMCML works substantially better than other current state-of-the-art person reidentification methods.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Amit K. Roy-Chowdhury,et al.  Collaborative Sensing in a Distributed PTZ Camera Network , 2012, IEEE Transactions on Image Processing.

[3]  Narendra Ahuja,et al.  Pedestrian Recognition with a Learned Metric , 2010, ACCV.

[4]  Mubarak Shah,et al.  Tracking across multiple cameras with disjoint views , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[5]  Inderjit S. Dhillon,et al.  Information-theoretic metric learning , 2006, ICML '07.

[6]  A. E. Taylor,et al.  L'Hospital's Rule , 1952 .

[7]  Takehiro Mori Comments on "A matrix inequality associated with bounds on solutions of algebraic Riccati and Lyapunov equation" by J.M. Saniuk and I.B. Rhodes , 1988 .

[8]  Richard I. Hartley,et al.  Person Reidentification Using Spatiotemporal Appearance , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Shaogang Gong,et al.  Incremental Activity Modeling in Multiple Disjoint Cameras , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Charles A. Micchelli,et al.  Learning Multiple Tasks with Kernel Methods , 2005, J. Mach. Learn. Res..

[11]  Massimo Piccardi,et al.  Tracking people across disjoint camera views by an illumination-tolerant appearance representation , 2007, Machine Vision and Applications.

[12]  Mubarak Shah,et al.  Modeling inter-camera space-time and appearance relationships for tracking across non-overlapping views , 2008, Comput. Vis. Image Underst..

[13]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[14]  Xiaogang Wang,et al.  Human Reidentification with Transferred Metric Learning , 2012, ACCV.

[15]  J. Heinonen Lectures on Lipschitz analysis , 2005 .

[16]  Alessandro Perina,et al.  Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Yi-Ping Hung,et al.  An adaptive learning method for target tracking across multiple cameras , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Amit K. Roy-Chowdhury,et al.  Distributed Camera Networks , 2011, IEEE Signal Processing Magazine.

[19]  Shaogang Gong,et al.  Person Re-Identification by Support Vector Ranking , 2010, BMVC.

[20]  Senem Velipasalar,et al.  Cooperative Object Tracking and Composite Event Detection With Wireless Embedded Smart Cameras , 2010, IEEE Transactions on Image Processing.

[21]  Kilian Q. Weinberger,et al.  Large Margin Multi-Task Metric Learning , 2010, NIPS.

[22]  Slawomir Bak,et al.  Person Re-identification Using Spatial Covariance Regions of Human Body Parts , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[23]  Shaogang Gong,et al.  Reidentification by Relative Distance Comparison , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  S. Shankar Sastry,et al.  A Distributed Topological Camera Network Representation for Tracking Applications , 2010, IEEE Transactions on Image Processing.

[25]  Amir Globerson,et al.  Metric Learning by Collapsing Classes , 2005, NIPS.

[26]  Frédéric Jurie,et al.  PCCA: A new approach for distance learning from sparse pairwise constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Chunxiao Liu,et al.  Person Re-identification: What Features Are Important? , 2012, ECCV Workshops.

[28]  R. Merris Laplacian matrices of graphs: a survey , 1994 .

[29]  Mubarak Shah,et al.  Appearance modeling for tracking in multiple non-overlapping cameras , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[30]  Yurii Nesterov,et al.  Smooth minimization of non-smooth functions , 2005, Math. Program..

[31]  Shaogang Gong,et al.  Modelling activity global temporal dependencies using Time Delayed Probabilistic Graphical Model , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[32]  Springer-Verlag London Limited A multi-task framework for metric learning with common subspace , 2013 .

[33]  Andrew Gilbert,et al.  Tracking Objects Across Cameras by Incrementally Learning Inter-camera Colour Calibration and Patterns of Activity , 2006, ECCV.

[34]  W. Eric L. Grimson,et al.  Inference of non-overlapping camera network topology by measuring statistical dependence , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[35]  S. Gong,et al.  Multi-camera Matching under Illumination Change Over Time , 2008 .

[36]  Vittorio Murino,et al.  Custom Pictorial Structures for Re-identification , 2011, BMVC.

[37]  Jason J. Corso,et al.  Efficient max-margin metric learning , 2012 .

[38]  Shaogang Gong,et al.  Multi-camera activity correlation analysis , 2009, CVPR.

[39]  Vittorio Murino,et al.  Symmetry-driven accumulation of local features for human characterization and re-identification , 2013, Comput. Vis. Image Underst..

[40]  Hai Tao,et al.  Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features , 2008, ECCV.

[41]  Horst Bischof,et al.  Large scale metric learning from equivalence constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[42]  Vittorio Murino,et al.  Semi-supervised multi-feature learning for person re-identification , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[43]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[44]  Kaizhu Huang,et al.  Geometry Preserving Multi-task Metric Learning , 2012, ECML/PKDD.

[45]  Chunxiao Liu,et al.  Person re-identification by manifold ranking , 2013, 2013 IEEE International Conference on Image Processing.

[46]  Larry S. Davis,et al.  Learning Discriminative Appearance-Based Models Using Partial Least Squares , 2009, 2009 XXII Brazilian Symposium on Computer Graphics and Image Processing.

[47]  Shaogang Gong,et al.  Multi-camera Matching using Bi-Directional Cumulative Brightness Transfer Functions , 2008, BMVC.

[48]  Amit K. Roy-Chowdhury,et al.  Tracking and Activity Recognition Through Consensus in Distributed Camera Networks , 2010, IEEE Transactions on Image Processing.