论文信息 - Specific Person Retrieval via Incomplete Text Description

Specific Person Retrieval via Incomplete Text Description

Searching for specific persons from surveillance videos captured by different cameras, is a key yet under-addressed challenge in multimedia system. Related person retrieval works mainly focus on searching person by visual appearance, known as person re-identification. However, the initial visual image may not be available in some practical applications. For example, the criminal is described by a text description indirectly, "A young woman wearing a red casual with a backpack", the traditional methods can not conquer this issue. Based on a set of pre-defined attributes that the text description query can be transformed to an attribute vector, thus can be used to retrieval in the gallery set. And yet, the user-provided attributes are sometimes incomplete. This new issue is defined as Specific Person Retrieval via Incomplete Text Description. In this paper, we conduct a specific attribute completion to enrich the original text query and generate a more expressive attribute vector. Then, a pairwise-based metric learning is introduced for completed attribute vectors. Extensive experiments conducted on two benchmark datasets have shown our superior performance.

[1] Yang Yu,et al. Automatic image annotation using group sparsity , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2] Horst Bischof,et al. Large scale metric learning from equivalence constraints , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Lei Wu,et al. Tag Completion for Image Retrieval , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Zheng Wang,et al. Coupled-View Based Ranking Optimization for Person Re-identification , 2015, MMM.

[5] Xiaogang Wang,et al. Shape and Appearance Context Modeling , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[6] Shai Shalev-Shwartz,et al. Stochastic dual coordinate ascent methods for regularized loss , 2012, J. Mach. Learn. Res..

[7] Richard I. Hartley,et al. Person Reidentification Using Spatiotemporal Appearance , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[8] Horst Bischof,et al. Person Re-identification by Descriptive and Discriminative Classification , 2011, SCIA.

[9] Xiaochun Cao,et al. Image Retrieval and Ranking via Consistently Reconstructing Multi-attribute Queries , 2014, ECCV.

[10] Shaogang Gong,et al. Towards Person Identification and Re-identification with Attributes , 2012, ECCV Workshops.

[11] Xiaogang Wang,et al. Unsupervised Salience Learning for Person Re-identification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Hai Tao,et al. Evaluating Appearance Models for Recognition, Reacquisition, and Tracking , 2007 .

[13] Jianmin Wang,et al. Image Tag Completion via Image-Specific and Tag-Specific Linear Sparse Reconstructions , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[14] Xiao Liu,et al. Attribute-restricted latent topic model for person re-identification , 2012, Pattern Recognit..

[15] Duy-Dinh Le,et al. AttRel: An Approach to Person Re-Identification by Exploiting Attribute Relationships , 2015, MMM.

[16] Xiaoou Tang,et al. Pedestrian Attribute Recognition At Far Distance , 2014, ACM Multimedia.

[17] Shaogang Gong,et al. Person Re-identification by Attributes , 2012, BMVC.