Content Based Video Retrieval Using Spatiotemporal Salient Objects

In this paper, we propose a spatiotemporal salient objects-based approach for video retrieval. The spatiotemporal salient object is defined as the region sequence which is spatial salient and temporal continuous at the same time. As attention analysis is an effective mechanism for salient information selection, it provides a practical approach to narrow the semantic gap. Most existing methods extract salient region from static image, while little work has been done on the temporal consistency of salient region in dynamic image sequences, which is very important for content based video retrieval. We propose a fusion method for motion and spatial saliency integration to detect spatiotemporal salient objects, based on both attention analysis and interest point trajectory tracking. A novel description of trajectories’ spatial context is also presented in this paper. Experiment results demonstrate the advantages of the proposed method in video content analysis and retrieval.

[1]  Mubarak Shah,et al.  Content based video matching using spatiotemporal volumes , 2008, Comput. Vis. Image Underst..

[2]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[3]  Fred Stentiford,et al.  Video sequence matching based on temporal ordinal measurement , 2008, Pattern Recognit. Lett..

[4]  King Ngi Ngan,et al.  Unsupervised extraction of visual attention objects in color images , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Michael Isard,et al.  Bundling features for large scale partial-duplicate web image search , 2009, CVPR.

[6]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[7]  Keiichiro Hoashi,et al.  Shot Boundary Determination on MPEC Compressed Domain and Story Segmentation Experiments for TRECVID 2003 , 2003, TRECVID.

[8]  Tom Drummond,et al.  Robust feature matching in 2.3µs , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[9]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[10]  Stella X. Yu,et al.  Linear solution to scale and rotation invariant object matching , 2009, CVPR.

[11]  Matthijs C. Dorst,et al.  Abstract: Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[12]  Christof Koch,et al.  Visual attention and target detection in cluttered natural scenes , 2001 .