TRECVID 2009 of MCG-ICT-CAS

This paper describes the highlights of our interactive search system VideoMap for TRECVID 2009. To enhance the efficiency, the system has a map based displaying interface, which gives the user a global view about the similarity relationships among the whole video collection, and provides an active annotating manner to quickly localize the potential positive samples. Meanwhile, the system has powerful multiple modality feedback strategies, including the visual-based feedback, concept-based feedback and community-based feedback. These feedback algorithms can flexibly transform between each other by the automatic optimizing strategy. Finally the multi-strategies feedback achieves the best performance with MAP of 0.186.

[1]  Bernhard Schölkopf,et al.  Ranking on Data Manifolds , 2003, NIPS.

[2]  Luc Van Gool,et al.  Fast scale invariant feature detection and matching on programmable graphics hardware , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[3]  C. Lee Giles,et al.  Efficient identification of Web communities , 2000, KDD '00.

[4]  Cordelia Schmid,et al.  Hamming Embedding and Weak Geometric Consistency for Large Scale Image Search , 2008, ECCV.

[5]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[7]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[8]  Li Chen,et al.  Video copy detection: a comparative study , 2007, CIVR '07.

[9]  Yongdong Zhang,et al.  Distribution-based concept selection for concept-based video retrieval , 2009, ACM Multimedia.

[10]  Yakup Genc,et al.  GPU-based Video Feature Tracking And Matching , 2006 .

[11]  King-Ip Lin,et al.  The ANN-tree: an index for efficient approximate nearest neighbor search , 2001, Proceedings Seventh International Conference on Database Systems for Advanced Applications. DASFAA 2001.

[12]  Chong-Wah Ngo,et al.  Towards optimal bag-of-features for object categorization and semantic video retrieval , 2007, CIVR '07.

[13]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Vasudev Bhaskaran,et al.  Spatiotemporal sequence matching for efficient video copy detection , 2005, IEEE Trans. Circuits Syst. Video Technol..

[15]  Thomas Wiegand,et al.  SIFT Implementation and Optimization for General-Purpose GPU , 2007 .

[16]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..