论文信息 - Eurécom at TRECVid 2006: Extraction of High-level Features and BBC Rushes Exploitation

Eurécom at TRECVid 2006: Extraction of High-level Features and BBC Rushes Exploitation

For the four year we have participated to the high-level feature extraction task and we pursued our e ort on the fusion of classifier outputs. Unfortunatly a single run was submitted for evaluation this year, due to lack of computationnal ressources during the limited time available for training and tuning the entire system. This year’s run is based on a SVM classification scheme. Localised color and texture features were extracted from shot key-frames. Then, SVM classifiers were build per concept on the training data set. The fusion of classifier outputs is finally provided by a multilayer neural network. In BBC rushes exploitation, we explore the description of rushes through a visual dictionary. A set of non-redundant images are segmented into blocks. These blocks are clustered in a small number of classes to create a visual dictionary. Then, we can describe each image by the number of blocks of each class. After, we evaluate the power of this visual dictionary for retrieving images from rushes: if we use one or more blocks from an image as a query, are we able to retrieve the original image, and in which position in the result list. And finally, we organize and present video using this visual dictionary.

Bernard Mérialdo | Benoit Huet | Rachid Benmokhtar | Emilie Dumont

[1] Hugh E. Williams,et al. RMIT University at TRECVID 2004 , 2004, TRECVID.

[2] Andrew Zisserman,et al. Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[3] Zhongfei Zhang,et al. Hidden semantic concept discovery in region based image retrieval , 2004, CVPR 2004.

[4] Nozha Boujemaa,et al. New image retrieval paradigm: logical composition of region categories , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[5] George Cybenko,et al. Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..

[6] Nicu Sebe,et al. Salient Points for Content-Based Retrieval , 2001, BMVC.

[7] Joo-Hwee Lim,et al. Categorizing Visual Contents by Matching Visual "Keywords" , 1999, VISUAL.

[8] Rosalind W. Picard. Toward a Visual Thesaurus , 1995, MIRO.

[9] Hugh E. Williams,et al. RMIT University at TREC 2004 , 2004, TREC.

[10] Jitendra Malik,et al. Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.

[12] Rosalind W. Picard. A Society of Models for Video and Image Libraries , 1996, IBM Syst. J..

[13] Lei Zhu,et al. Keyblock: an approach for content-based image retrieval , 2000, ACM Multimedia.

[14] Thorsten Joachims,et al. Making large scale SVM learning practical , 1998 .

[15] Pedro F. Felzenszwalb,et al. Efficiently computing a good segmentation , 1998 .

[16] T. Joachims,et al. 1 Making Large-scale Svm Learning Practical , 1999 .