Eurecom and ECNU at TRECVID 2010 : The Semantic Indexing Task

in the TRECVID context for spatially-independent concepts like “Nighttime”. We then experiment with a multi-modal analysis, combining the visual features with the textual metadata that have been provided with the 2010 video database. As last run, we try a new system based on Hamming Embedding and Weighted Visual words. The runs are composed as follows: 1. EURECOM Fusebase This run fuses a pool of visual features, namely the Sift descriptor, the Color Moments global descriptor, the Wavelet Feature and the Edge Histogram. On top of this, a face detector and a re-ranking method based on the video knowledge are applied, according to the 4 th run that EURECOM presented in the 2009 Edition.