论文信息 - Deep adaptive feature embedding with local sample distributions for person re-identification

Deep adaptive feature embedding with local sample distributions for person re-identification

Person re-identification (re-id) aims to match pedestrians observed by disjoint camera views. It attracts increasing attention in computer vision due to its importance to surveillance systems. To combat the major challenge of cross-view visual variations, deep embedding approaches are proposed by learning a compact feature space from images such that the Euclidean distances correspond to their cross-view similarity metric. However, the global Euclidean distance cannot faithfully characterize the ideal similarity in a complex visual feature space because features of pedestrian images exhibit unknown distributions due to large variations in poses, illumination and occlusion. Moreover, intra-personal training samples within a local range which are robust to guide deep embedding against uncontrolled variations cannot be captured by a global Euclidean distance. In this paper, we study the problem of person re-id by proposing a novel sampling to mine suitable positives (i.e., intra-class) within a local range to improve the deep embedding in the context of large intra-class variations. Our method is capable of learning a deep similarity metric adaptive to local sample structure by minimizing each sample's local distances while propagating through the relationship between samples to attain the whole intra-class minimization. To this end, a novel objective function is proposed to jointly optimize similarity metric learning, local positive mining and robust deep feature embedding. This attains local discriminations by selecting local-ranged positive samples, and the learned features are robust to dramatic intra-class variations. Experiments on benchmarks show state-of-the-art results achieved by our method. (C) 2017 Elsevier Ltd. All rights reserved.

[1] R. Fisher. THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[2] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[3] Sergio A. Velastin,et al. Local Fisher Discriminant Analysis for Pedestrian Re-identification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[4] Alessandro Perina,et al. Person re-identification by symmetry-driven accumulation of local features , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[5] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7] Lin Wu,et al. Unsupervised Metric Fusion Over Multiview Data by Graph Random Walk-Based Cross-View Diffusion , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[8] Bernhard Schölkopf,et al. Domain Adaptation with Conditional Transferable Components , 2016, ICML.

[9] Bingbing Ni,et al. HCP: A Flexible CNN Framework for Multi-Label Image Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Shaogang Gong,et al. Person re-identification by probabilistic relative distance comparison , 2011, CVPR 2011.

[11] Honglak Lee,et al. Learning hierarchical representations for face verification with convolutional deep belief networks , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Shengcai Liao,et al. Person re-identification by Local Maximal Occurrence representation and metric learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Jingjing Liu,et al. Enhanced fisher discriminant criterion for image recognition , 2012, Pattern Recognit..

[14] Amit K. Roy-Chowdhury,et al. Continuous Adaptation of Multi-Camera Person Identification Models through Sparse Non-redundant Representative Selection , 2016, 1607.00417.

[15] Victor S. Lempitsky,et al. Learning Deep Embeddings with Histogram Loss , 2016, NIPS.

[16] Lin Wu,et al. Robust Hashing for Multi-View Data: Jointly Learning Low-Rank Kernelized Similarity Consensus and Hash Functions , 2016, Image Vis. Comput..

[17] Jian-Huang Lai,et al. Deep Ranking for Person Re-Identification via Joint Representation Learning , 2015, IEEE Transactions on Image Processing.

[18] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[19] Rong Jin,et al. Fine-grained visual categorization via multi-stage metric learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21] Horst Bischof,et al. Person Re-identification by Efficient Impostor-Based Metric Learning , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[22] Cordelia Schmid,et al. Is that you? Metric learning approaches for face identification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[23] Shai Shalev-Shwartz,et al. Stochastic dual coordinate ascent methods for regularized loss , 2012, J. Mach. Learn. Res..

[24] Zhen Li,et al. Learning Locally-Adaptive Decision Functions for Person Verification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Aurélien Lucchi,et al. Variance Reduced Stochastic Gradient Descent with Neighbors , 2015, NIPS.

[26] Xuelong Li,et al. General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] Dacheng Tao,et al. Large-margin Weakly Supervised Dimensionality Reduction , 2014, ICML.

[28] Xiaogang Wang,et al. Unsupervised Salience Learning for Person Re-identification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[29] Richard I. Hartley,et al. Person Reidentification Using Spatiotemporal Appearance , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[30] David J. Fleet,et al. Dynamical binary latent variable models for 3D human pose tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31] Hai Tao,et al. Viewpoint Invariant Pedestrian Recognition with an Ensemble of Localized Features , 2008, ECCV.

[32] Nanning Zheng,et al. Similarity Learning with Spatial Constraints for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Liang Zheng,et al. Re-ranking Person Re-identification with k-Reciprocal Encoding , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Shaogang Gong,et al. Unsupervised Cross-Dataset Transfer Learning for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Kilian Q. Weinberger,et al. Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[36] Liang Lin,et al. Deep feature learning with relative distance comparison for person re-identification , 2015, Pattern Recognit..

[37] Fei Xiong,et al. Person Re-Identification Using Kernel-Based Metric Learning Methods , 2014, ECCV.

[38] Andrew Zisserman,et al. Deep Fisher Networks for Large-Scale Image Classification , 2013, NIPS.

[39] Xiaogang Wang,et al. Person Re-identification by Salience Matching , 2013, 2013 IEEE International Conference on Computer Vision.

[40] Gang Wang,et al. Gated Siamese Convolutional Neural Network Architecture for Human Re-identification , 2016, ECCV.

[41] Shaogang Gong,et al. Reidentification by Relative Distance Comparison , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42] Ming Yang,et al. Compressing Deep Convolutional Networks using Vector Quantization , 2014, ArXiv.

[43] Lin Wu,et al. Shifting multi-hypergraphs via collaborative probabilistic voting , 2015, Knowledge and Information Systems.

[44] Shengcai Liao,et al. Deep Metric Learning for Person Re-identification , 2014, 2014 22nd International Conference on Pattern Recognition.

[45] Lin Wu,et al. Effective Multi-Query Expansions: Robust Landmark Retrieval , 2015, ACM Multimedia.

[46] De-Shuang Huang,et al. Locally linear discriminant embedding: An efficient method for face recognition , 2008, Pattern Recognit..

[47] Geoffrey E. Hinton,et al. Neighbourhood Components Analysis , 2004, NIPS.

[48] Lin Wu,et al. PersonNet: Person Re-identification with Deep Convolutional Neural Networks , 2016, ArXiv.

[49] Yang Wang,et al. Structured Deep Hashing with Convolutional Neural Networks for Fast Person Re-identification , 2017, Comput. Vis. Image Underst..

[50] Yi Wu,et al. Stable local dimensionality reduction approaches , 2009, Pattern Recognit..

[51] Horst Bischof,et al. Large scale metric learning from equivalence constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[52] Nanning Zheng,et al. Person Re-identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53] Xiaogang Wang,et al. Locally Aligned Feature Transforms across Views , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[54] Hai Tao,et al. Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[55] Lin Wu,et al. Shifting Hypergraphs by Probabilistic Voting , 2014, PAKDD.

[56] Michael I. Jordan,et al. Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[57] Chen Huang,et al. Local Similarity-Aware Deep Feature Embedding , 2016, NIPS.

[58] Yimin Wang,et al. Sparsity-Based Occlusion Handling Method for Person Re-identification , 2015, MMM.

[59] Frédéric Jurie,et al. PCCA: A new approach for distance learning from sparse pairwise constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[60] Shaogang Gong,et al. Learning a Discriminative Null Space for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61] Lin Wu,et al. Clustering via geometric median shift over Riemannian manifolds , 2013, Inf. Sci..

[62] Michael Jones,et al. An improved deep learning architecture for person re-identification , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[63] Honglak Lee,et al. Unsupervised learning of hierarchical representations with convolutional deep belief networks , 2011, Commun. ACM.

[64] Lin Wu,et al. Exploiting Attribute Correlations: A Novel Trace Lasso-Based Weakly Supervised Dictionary Learning Method , 2017, IEEE Transactions on Cybernetics.

[65] Francis Bach,et al. SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives , 2014, NIPS.

[66] Ran Xu,et al. Random forests for metric learning with implicit pairwise position dependence , 2012, KDD.

[67] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[68] Xiaogang Wang,et al. Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69] Lin Wu,et al. Iterative Views Agreement: An Iterative Low-Rank Based Structured Optimization Method to Multi-View Spectral Clustering , 2016, IJCAI.

[70] Shengcai Liao,et al. Embedding Deep Metric for Person Re-identification: A Study Against Large Variations , 2016, ECCV.

[71] Hui Xu,et al. Two-dimensional supervised local similarity and diversity projection , 2010, Pattern Recognit..

[72] Inderjit S. Dhillon,et al. Information-theoretic metric learning , 2006, ICML '07.

[73] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[74] Jiwen Lu,et al. Nonlinear Local Metric Learning for Person Re-identification , 2015, ArXiv.

[75] Lin Wu,et al. Exploiting Correlation Consensus: Towards Subspace Clustering for Multi-modal Data , 2014, ACM Multimedia.

[76] Lin Wu,et al. Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval , 2017, IEEE Transactions on Image Processing.

[77] Yang Wang,et al. Towards metric fusion on multi-view data: a cross-view based graph random walk approach , 2013, CIKM.

[78] Lin Wu,et al. Deep Linear Discriminant Analysis on Fisher Networks: A Hybrid Architecture for Person Re-identification , 2016, Pattern Recognit..

[79] Geoffrey E. Hinton,et al. Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure , 2007, AISTATS.

[80] Xiaogang Wang,et al. DeepReID: Deep Filter Pairing Neural Network for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[81] Xiaogang Wang,et al. Human Reidentification with Transferred Metric Learning , 2012, ACCV.

[82] Lin Wu,et al. Robust Subspace Clustering for Multi-View Data by Exploiting Correlation Consensus , 2015, IEEE Transactions on Image Processing.

[83] Qi Tian,et al. Scalable Person Re-identification on Supervised Smoothed Manifold , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[84] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[85] Xiaogang Wang,et al. Learning Mid-level Filters for Person Re-identification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[86] Jian Pei,et al. An Iterative Fusion Approach to Graph-Based Semi-Supervised Learning from Multiple Views , 2014, PAKDD.

[87] David Zhang,et al. Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[88] Tong Zhang,et al. Accelerating Stochastic Gradient Descent using Predictive Variance Reduction , 2013, NIPS.

[89] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[90] Victor S. Lempitsky,et al. Multi-Region bilinear convolutional neural networks for person re-identification , 2015, 2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[91] Xuelong Li,et al. Geometric Mean for Subspace Selection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[92] Xiaogang Wang,et al. Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[93] Lin Wu,et al. LBMCH: Learning Bridging Mapping for Cross-modal Hashing , 2015, SIGIR.

[94] Gang Wang,et al. A Siamese Long Short-Term Memory Architecture for Human Re-identification , 2016, ECCV.

[95] Qi Tian,et al. Scalable Person Re-identification: A Benchmark , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[96] Dacheng Tao,et al. Packing Convolutional Neural Networks in the Frequency Domain , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[97] Lin Wu,et al. Efficient image and tag co-ranking: a bregman divergence optimization method , 2013, ACM Multimedia.