论文信息 - Fudan University at TRECVID 2008

Fudan University at TRECVID 2008

FD IMI LK: This run is only based on the text from the English ASR/MT output provided by NIST and on the text of the topics. FD IMI ZYB: This run is based on the text search and the visual expand from the text search results. FD IMI HXS: This run is based on the concept mapping method. FD IMI ZJ: This run is based on the fusion of concept mapping and visual search. FD IMI ZW: This run is based on average fusion method. FD IMI SZC: This run is only based on multi-model fusion method.

[1] Ahmed K. Elmagarmid,et al. InsightVideo: toward hierarchical video content organization for efficient browsing, summarization and retrieval , 2005, IEEE Transactions on Multimedia.

[2] Paul A. Viola,et al. Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[3] Emilio L. Zapata,et al. A Clustering Technique for Video Copy Detection , 2007, IbPRIA.

[4] Roeland Ordelman,et al. Speech-based Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition , 2007 .

[5] Jianming Hu,et al. Automatic Detection and Verification of Text Regions in News Video Frames , 2002, Int. J. Pattern Recognit. Artif. Intell..

[6] Stephen E. Robertson,et al. Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive , 1998, TREC.

[7] Franciska de Jong,et al. Annotation of Heterogeneous Multimedia Content Using Automatic Speech Recognition , 2007, SAMT.

[8] Shih-Fu Chang,et al. Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[9] Ruud M. Bolle,et al. Comparison of sequence matching techniques for video copy detection , 2001, IS&T/SPIE Electronic Imaging.

[10] C. Frankel,et al. Distinguishing photographs and graphics on the World Wide Web , 1997, 1997 Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries.

[11] Dong Wang,et al. AP-Based Borda Voting Method for Feature Extraction in TRECVID-2004 , 2005, ECIR.

[12] Paul A. Viola,et al. Robust Real-time Object Detection , 2001 .

[13] Yuxiao Hu,et al. Learning a locality preserving subspace for visual recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14] Lide Wu,et al. Audio classification based on maximum entropy model , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[15] Jing-Yu Yang,et al. A generalized Foley-Sammon transform based on generalized fisher discriminant criterion and its application to face recognition , 2003, Pattern Recognit. Lett..

[16] Shih-Fu Chang,et al. CU-VIREO 374 : Fusing Columbia 374 and VIREO 374 for Large Scale Semantic Concept Detection , 2008 .

[17] Sanjeev R. Kulkarni,et al. Rapid estimation of camera motion from compressed video with application to video annotation , 2000, IEEE Trans. Circuits Syst. Video Technol..

[18] Justus H. Piater. Mixture Models and Expectation-Maximization , 2005 .

[19] John R. Smith,et al. Large-scale concept ontology for multimedia , 2006, IEEE MultiMedia.

[20] Hui Yu,et al. Fudan University: hierarchical video retrieval with adaptive multi-modal fusion , 2008, CIVR '08.

[21] Yoav Freund,et al. Experiments with a New Boosting Algorithm , 1996, ICML.

[22] Hideyuki Tamura,et al. Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[23] Tao Mei,et al. Building a comprehensive ontology to refine video concept detection , 2007, MIR '07.

[24] Junyu Niu,et al. FDU at TREC-10: Filtering, QA, Web and Video Tasks , 2001, TREC.

[25] B. S. Manjunath,et al. Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[26] John R. Smith,et al. Multimedia semantic indexing using model vectors , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[27] Hwann-Tzong Chen,et al. Local discriminant embedding and its variants , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28] Olivier Buisson,et al. Robust voting algorithm based on labels of behavior for video copy detection , 2006, MM '06.

[29] Xiaofei He,et al. Locality Preserving Projections , 2003, NIPS.

[30] Xuanjing Huang,et al. Language Independent Text Categorization , 2001, NLPRS.

[31] Jean-Luc Gauvain,et al. The LIMSI Broadcast News transcription system , 2002, Speech Commun..

[32] Paul Over,et al. Evaluation campaigns and TRECVid , 2006, MIR '06.

[33] Takeo Kanade,et al. Object Detection Using the Statistics of Parts , 2004, International Journal of Computer Vision.

[34] Ofer Melnik,et al. Mixed group ranks: preference and confidence in classifier combination , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35] John R. Smith,et al. IBM Research TRECVID-2009 Video Retrieval System , 2009, TRECVID.

[36] Ramin Zabih,et al. Comparing images using color coherence vectors , 1997, MULTIMEDIA '96.

[37] Anton Schwaighofer,et al. Learning Gaussian processes from multiple tasks , 2005, ICML.

[38] Matti Pietikäinen,et al. Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[39] Robert M. Haralick,et al. Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[40] William D. Penny,et al. Bayesian Approaches to Gaussian Mixture Modeling , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[41] Rich Caruana,et al. Multitask Learning: A Knowledge-Based Source of Inductive Bias , 1993, ICML.

[42] Yoav Freund,et al. Boosting the margin: A new explanation for the effectiveness of voting methods , 1997, ICML.

[43] Jun Yang,et al. CMU Informedia's TRECVID 2005 Skirmishes , 2005, TRECVID.

[44] George A. Miller,et al. Introduction to WordNet: An On-line Lexical Database , 1990 .

[45] Juyang Weng,et al. Using Discriminant Eigenfeatures for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..