论文信息 - VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search

VIREO @ TRECVID 2011: Instance Search, Semantic Indexing, Multimedia Event Detection and Known-Item Search

The vireo group participated in four tasks: instance search, semantic indexing, multimedia event detection and known-item search. In this paper,we will present our approaches and discuss the evaluation results. Instance Search (INS): We experimented four runs to contrast the following for instance search: full matching (vireo b) versus partial matching (vireo m); use of weak geometric information (vireo b) versus stronger spatial configuration (vireo s); use of face matching (vireo f).

[1] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[2] Stephen E. Robertson,et al. Okapi/Keenbow at TREC-8 , 1999, TREC.

[3] Cordelia Schmid,et al. Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[4] Martha Palmer,et al. Verb Semantics and Lexical Selection , 1994, ACL.

[5] Hung-Khoon Tan,et al. VIREO at TRECVID 2010: Semantic Indexing, Known-Item Search, and Content-Based Copy Detection , 2010, TRECVID.

[6] Chong-Wah Ngo,et al. Ontology-enriched semantic space for video search , 2007, ACM Multimedia.

[7] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[8] Chong-Wah Ngo,et al. VIREO/DVMM at TRECVID 2009: High-Level Feature Extraction, Automatic Video Search, and Content-Based Copy Detection , 2009, TRECVID.

[9] Andrew Zisserman,et al. Hello! My name is... Buffy'' -- Automatic Naming of Characters in TV Video , 2006, BMVC.

[10] Qiang Yang,et al. Boosting for transfer learning , 2007, ICML '07.

[11] Chong-Wah Ngo,et al. Semantic context transfer across heterogeneous sources for domain adaptive video search , 2009, ACM Multimedia.

[12] Tsuhan Chen,et al. Image retrieval with geometry-preserving visual phrases , 2011, CVPR 2011.

[13] Gang Wang,et al. On the sampling of web images for learning visual concept classifiers , 2010, CIVR '10.

[14] Paul M. B. Vitányi,et al. The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[15] Mubarak Shah,et al. Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching , 2010, TRECVID.

[16] Chong-Wah Ngo,et al. On the Annotation of Web Videos by Efficient Near-Duplicate Search , 2010, IEEE Transactions on Multimedia.

[17] Michael Isard,et al. Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[18] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[19] Rong Yan,et al. Cross-domain video concept detection using adaptive svms , 2007, ACM Multimedia.

[20] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[21] Cordelia Schmid,et al. Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.