Saliency-Weighted Global-Local Fusion for Person Re-identification

Many features have been proposed to improve the accuracy of person re-identification. Due to the illumination and viewpoint changes between different cameras, individual feature is less discriminative to separate different persons. In this paper, we propose a saliency-weighted feature descriptor and global-local fusion optimization for person re-identification. Firstly, the weights on pixels are calculated via saliency detection method, then the computed weights are integrated into local maximal occurrence (LOMO) feature descriptor. Secondly, the saliency weights are used to update the metric learning distance in training so that we can learn a new metric matrix for testing. And then, the whole person image is divided into upper and lower halves. A novel global-local fusion method is proposed to combine local and global regions together in the most appropriate way. After that an optimization algorithm is proposed to learn the weights among upper half, lower half and the whole image. According to those weights, a final fused distance is obtained. Experimental results show that the proposed method outperforms many state-of-the-art person re-identification methods.

[1]  Xuelong Li,et al.  Unsupervised image saliency detection with Gestalt-laws guided optimization and visual attention based refinement , 2018, Pattern Recognit..

[2]  Li Hou,et al.  Distance aggregation for person re-identification using simulated annealing algorithm , 2016, 2016 International Conference on Audio, Language and Image Processing (ICALIP).

[3]  Shengcai Liao,et al.  Large Scale Similarity Learning Using Similar Pairs for Person Verification , 2016, AAAI.

[4]  Shaogang Gong,et al.  Multi-camera activity correlation analysis , 2009, CVPR.

[5]  Jinchang Ren,et al.  Special issue on multimodal data fusion for multidimensional signal processing , 2016, Multidimens. Syst. Signal Process..

[6]  Yijun Yan,et al.  Adaptive fusion of color and spatial features for noise-robust retrieval of colored logo and trademark images , 2016, Multidimens. Syst. Signal Process..

[7]  Zheng Wang,et al.  A deep-learning based feature hybrid framework for spatiotemporal saliency detection inside videos , 2018, Neurocomputing.

[8]  Yijun Yan,et al.  Fusion of Dominant Colour and Spatial Layout Features for Effective Image Retrieval of Coloured Logos and Trademarks , 2015, 2015 IEEE International Conference on Multimedia Big Data.

[9]  Xiang Li,et al.  An enhanced deep feature representation for person re-identification , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[10]  Ron Sun,et al.  Anatomy of the Mind: a Quick Overview , 2017, Cognitive Computation.

[11]  Hai Tao,et al.  Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[12]  Shengcai Liao,et al.  Person re-identification by Local Maximal Occurrence representation and metric learning , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Yang Li,et al.  Hierarchical and multi-featured fusion for effective gait recognition under variable scenarios , 2015, Pattern Analysis and Applications.

[14]  Yijun Yan,et al.  Fusion of block and keypoints based approaches for effective copy-move image forgery detection , 2016, Multidimens. Syst. Signal Process..

[15]  Simone Frintrop,et al.  Traditional saliency reloaded: A good old model in new shape , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Stephen Marshall,et al.  Cognitive Fusion of Thermal and Visible Imagery for Effective Detection and Tracking of Pedestrians in Videos , 2018, Cognitive Computation.