NII-ISM, Japan at TRECVID 2007: High Level Feature Extraction

This paper reports our experiments on the concept detection task of TRECVID 2007. In these experiments, we have addressed two ap- proaches which are selecting and fusing features and kernel-based learn- ing method. As for the former one, we investigate the following issues: (i) which features are more appropriate for the concept detection task?, (ii) whether the fusion of features can help to improve the final detection per- formance? and (iii) how does the correlation between training and testing sets affect the final performance?. As for the latter one, a combination of global alignment (GA) kernel and penalized logistic regression ma- chine (PLRM) is studied. The experimental results on TRECVID 2007 have shown that the former approach that fuses simple features such as color moments, local binary patterns and edge orientation histogram can achieve high performance. Furthermore, the correlation between the training and testing also plays an important role in generalization of concept detectors.

[1]  Jitendra Malik,et al.  Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons , 2001, International Journal of Computer Vision.

[2]  B. S. Manjunath,et al.  Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Shih-Fu Chang,et al.  Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[4]  Tomoko Matsui,et al.  A Kernel for Time Series Based on Global Alignments , 2006, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.