EURECOM at TrecVid 2011: The Light Semantic Indexing Task

This year EURECOM participated in the TRECVID light Semantic Indexing (SIN) Task for the submission of four different runs for 50 concepts. Our submission builds on the runs submitted last year at the 2011 SIN task with the first two runs following the same pattern as those of last year. The details of 2011 system can be found in [8]. One of our run adds uploaders bias to the pool of visual features while another run is prepared in collaboration with ECNU. Our basic run adds visual features based on larger vectors to the pool of features of last year’s base run. Larger dictionaries provide a finer representation of the visual/clustering space and increase the precision of the retrieval task. Like in last year’s submission we add two global descriptors to visual features with one capturing temporal statistics along each shot and the other capturing salient details or gist of a keyframe. Then we add textual metadata based information that has been provided with the 2012 video database to the visual features. We further benefit from the metadata by including uploaders bias to increase scores of videos uploaded by same users. The runs are composed as follows:

[1]  大恵 俊一郎,et al.  拡張Local Binary Patternを用いたテクスチャー分割 , 2000 .

[2]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[3]  Chong-Wah Ngo,et al.  Semantic Indexing and Multimedia Event Detection: ECNU at TRECVID 2012 , 2012, TRECVID.

[4]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[6]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Bernard Mérialdo,et al.  Saliency moments for image categorization , 2011, ICMR.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Bernard Mérialdo,et al.  Eurecom and ECNU at TRECVID 2010 : The Semantic Indexing Task , 2010, TRECVID.

[11]  Antonio Torralba,et al.  LabelMe: A Database and Web-Based Tool for Image Annotation , 2008, International Journal of Computer Vision.

[12]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Bernard Mérialdo,et al.  Efficient Spatio-Temporal Edge Descriptor , 2012, MMM.

[14]  Andrea Vedaldi,et al.  Vlfeat: an open and portable library of computer vision algorithms , 2010, ACM Multimedia.

[15]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[16]  Yun Fu,et al.  Is gender recognition affected by age? , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.