Participation at TRECVID 2011 Semantic Indexing & Content-based Copy Detection Tasks

Semantic Indexing Task (SIN) Run No. Run ID Run Description infMAP (%) 1 F A IUPR-DFKI 1 Fisher Kernel + SVMs 2.86 2 F A IUPR-DFKI 2 Color Correlogram + SVMs 5.38 3 F A IUPR-DFKI 3 Fisher Kernel fused with Color Correlograms + SVMs 5.0 4 F A IUPR-DFKI 4 Fisher Kernel + kNN 0.71 Content-based Copy Detection (CCD) Run No. Run ID Run Description Opt.NDCR 1 *iupr-dfki.fsift F-SIFT+BoW+HE+EWGC 0.776 2 *iupr-dfki.fsift2 F-SIFT+BoW+HE+EWGC 0.923 3 SIFT SIFT+BoW+HE+EWGC 0.884 4 SIFT+PV SIFT+BoW+HE+EWGC+PV 0.501 5 F-SIFT+PV F-SIFT+BoW+HE+EWGC+PV 0.446 *: officially submitted run. This paper describes the TRECVID 2011 participation of the IUPR-DFKI team in the semantic indexing task (SIN) and content based copy detection task (CCD) task. For SIN, this years participation was dominated by an significant increase of vocabulary concept size from 130 to 346 concepts. In particular the system setup has been changed to last year’s participation [6] with respect to computational demands employing less computational costly features for classification and no usage of external training sources like YouTube. For CCD, this years participation is aimed at testing the flip invariant SIFT applied in video-only CCD. At the same time, we investigated how well we could achieve by relying on one keypoint feature alone.

[1]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[2]  Stéphane Ayache,et al.  Video Corpus Annotation Using Active Learning , 2008, ECIR.

[3]  Jing Huang,et al.  Image indexing using color correlograms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Chong-Wah Ngo,et al.  Scale-Rotation Invariant Pattern Entropy for Keypoint-Based Near-Duplicate Detection , 2009, IEEE Transactions on Image Processing.

[6]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[7]  Stephen Kwek,et al.  Applying Support Vector Machines to Imbalanced Datasets , 2004, ECML.

[8]  Paul Over,et al.  TRECVID-2008 content-based copy detection task overview (slides) , 2008 .

[9]  Marcel Worring,et al.  Concept-Based Video Retrieval , 2009, Found. Trends Inf. Retr..

[10]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[11]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[12]  Paul Over,et al.  High-level feature detection from video in TRECVid: a 5-year retrospective of achievements , 2009 .

[13]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[14]  Chong-Wah Ngo,et al.  On the Annotation of Web Videos by Efficient Near-Duplicate Search , 2010, IEEE Transactions on Multimedia.

[15]  Adrian Ulges,et al.  Keyframe Extraction for Video Tagging & Summarization , 2008, Informatiktage.

[16]  Cordelia Schmid,et al.  Aggregating local descriptors into a compact image representation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Cordelia Schmid,et al.  INRIA-LEAR'S Video Copy Detection System , 2008, TRECVID.

[18]  Markus Koch,et al.  DFKI and University of Kaiserslautern participation at TRECVID 2010 - Semantic Indexing Task , 2010, TRECVID.

[19]  Cordelia Schmid,et al.  Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).