Region-based video content indexing and retrieval

In this paper we propose to compare two region-based approaches to content-based video indexing and retrieval. Namely a comparison of a system using the Earth Mover’s Distance and a system using the Latent Semantic Indexing is provided. Region-based methods allow to keep the local information in a way that reflects the human perception of the content. Thus, they are very attractive to design efficient Content Based Video Retrieval systems. We presented a region based approach using Latent Semantic Indexing (LSI) in previous work. And now we compare performances of our system with a method using the Earth Mover’s Distance that have the property to keep the original features describing regions. This paper shows that LSA performs better on the task of object retrieval despite the quantification process implied.

[1]  Pedro F. Felzenszwalb,et al.  Efficiently computing a good segmentation , 1998 .

[2]  Mikko Kurimo,et al.  Indexing Audio Documents by using Latent Semantic Analysis and SOM , 1999 .

[3]  KanadeTakeo,et al.  Intelligent Access to Digital Video , 1996 .

[4]  Jitendra Malik,et al.  Blobworld: A System for Region-Based Image Indexing and Retrieval , 1999, VISUAL.

[5]  Fabrice Souvannavong,et al.  Enhancing latent semantic analysis video object retrieval with structural information , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[6]  Christos Faloutsos,et al.  Efficient and effective Querying by Image Content , 1994, Journal of Intelligent Information Systems.

[7]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[8]  Edoardo Ardizzone,et al.  Automatic Video Database Indexing and Retrieval , 2004, Multimedia Tools and Applications.

[9]  Fabrice Souvannavong,et al.  Latent semantic analysis for an effective region-based video shot retrieval system , 2004, MIR '04.

[10]  Leonidas J. Guibas,et al.  A metric for distributions with applications to image databases , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[11]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[12]  William I. Grosky,et al.  From features to semantics: some preliminary results , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[13]  James Ze Wang,et al.  SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Brendan J. Frey,et al.  Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[15]  Bo Zhang,et al.  An effective region-based image retrieval framework , 2002, MULTIMEDIA '02.

[16]  Wei-Ying Ma,et al.  Benchmarking of image features for content-based retrieval , 1998, Conference Record of Thirty-Second Asilomar Conference on Signals, Systems and Computers (Cat. No.98CH36284).

[17]  Fabrice Souvannavong,et al.  Video content modeling with latent semantic analysis , 2003 .

[18]  Takeo Kanade,et al.  elligent Access Video: formedia Project , 1996 .

[19]  Shih-Fu Chang,et al.  A fully automated content-based video search engine supporting spatiotemporal queries , 1998, IEEE Trans. Circuits Syst. Video Technol..

[20]  Joo-Hwee Lim Learning visual keywords for content-based retrieval , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.