Tracked Instance Search

In this work we propose tracking as a generic addition to the instance search task. From video data perspective, much information that can be used is not taken into account in the traditional instance search approach. This work aims to provide insights on exploiting such existing information by means of tracking and the proper combination of the results, independently of the instance search system. We also present a study on the improvement of the system when using multiple independent instances (up to 4) of the same person. Experimental results show that our system improves substantially its performance when using tracking. Best configuration improves from mAP = 0.447 to mAP = 0.511for a single example, and from mAP = 0.647 to mAP = 0.704for multiple (4) given examples.

[1]  Shin'ichi Satoh,et al.  Region-Based Image Retrieval Revisited , 2017, ACM Multimedia.

[2]  Hussein Hussein,et al.  Technische Universität Chemnitz at TRECVID Instance Search 2015 , 2014, TRECVID.

[3]  Andrew Zisserman,et al.  Deep Face Recognition , 2015, BMVC.

[4]  Zhang Wen,et al.  PKU_ICST at TRECVID 2018: Instance Search Task , 2013, TRECVID.

[5]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jonathan G. Fiscus,et al.  TRECVID 2016: Evaluating Video Search, Video Event Detection, Localization, and Hyperlinking , 2016, TRECVID.

[7]  Andrew Zisserman,et al.  Multiple queries for large scale specific object retrieval , 2012, BMVC.

[8]  Shin'ichi Satoh,et al.  Multi-image aggregation for better visual object retrieval , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Linjie Xing,et al.  Shenzhen Institutes of Advanced Technology, CAS, China at TRECVID INS 2016 , 2016, TRECVID.

[10]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[11]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Shin'ichi Satoh,et al.  Large-Scale R-CNN with Classifier Adaptive Quantization , 2016, ECCV.

[13]  James Philbin,et al.  FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Georges Quénot,et al.  IRIM at TRECVID 2016: Instance Search , 2016, TRECVID.

[15]  Yiannis Kompatsiaris,et al.  ITI-CERTH participation in TRECVID 2018 , 2017, TRECVID.

[16]  Yu Qiao,et al.  A Discriminative Feature Learning Approach for Deep Face Recognition , 2016, ECCV.

[17]  Cordelia Schmid,et al.  Product Quantization for Nearest Neighbor Search , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Alan F. Smeaton,et al.  Dublin City University and Partners' Participation in the INS and VTT Tracks at TRECVid 2016 , 2016, TRECVID.