Using video objects and relevance feedback in video retrieval

Video retrieval is mostly based on using text from dialogue and this remains the most significant component, despite progress in other aspects. One problem with this is when a searcher wants to locate video based on what is appearing in the video rather than what is being spoken about. Alternatives such as automatically-detected features and image-based keyframe matching can be used, though these still need further improvement in quality. One other modality for video retrieval is based on segmenting objects from video and allowing endusers to use these as part of querying. This uses similarity between query objects and objects from video, and in theory allows retrieval based on what is actually appearing on-screen. The main hurdles to greater use of this are the overhead of object segmentation on large amounts of video and the issue of whether we can actually achieve effective object-based retrieval. We describe a system to support object-based video retrieval where a user selects example video objects as part of the query. During a search a user builds up a set of these which are matched against objects previously segmented from a video library. This match is based on MPEG-7 Dominant Colour, Shape Compaction and Texture Browsing descriptors. We use a user-driven semi-automated segmentation process to segment the video archive which is very accurate and is faster than conventional video annotation.

[1]  Noel E. O'Connor,et al.  QIMERA: A SOFTWARE PLATFORM FOR VIDEO OBJECT SEGMENTATION AND TRACKING , 2003 .

[2]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[3]  Andrew Zisserman,et al.  Efficient object retrieval from videos , 2004, 2004 12th European Signal Processing Conference.

[4]  John R. Smith,et al.  MPEG-7 multimedia description schemes , 2001, IEEE Trans. Circuits Syst. Video Technol..

[5]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[6]  Faouzi Kossentini,et al.  Shape-based retrieval of video objects , 2005, IEEE Transactions on Multimedia.

[7]  Katsumi Tanaka,et al.  OVID: Design and Implementation of a Video-Object Database System , 1993, IEEE Trans. Knowl. Data Eng..

[8]  Mark Smith,et al.  An object-based approach for digital video retrieval , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[9]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Myungcheol Lee,et al.  Graph theory for image analysis: an approach based on the shortest spanning tree , 1986 .

[11]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[12]  Narendra Ahuja,et al.  Motion based retrieval of dynamic objects in videos , 2004, MULTIMEDIA '04.

[13]  Fabrice Souvannavong,et al.  Enhancing latent semantic analysis video object retrieval with structural information , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[14]  Konstantinos N. Plataniotis,et al.  Query feedback for interactive image retrieval , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Patrice Aknin,et al.  Comparison of Supervised Self-Organizing Maps Using Euclidian or Mahalanobis Distance in Classification Context , 2001, IWANN.