Object Re-detection Using SIFT and MPEG-7 Color Descriptors

Information about the occurrence of objects in videos and their interactions conveys an important part of the semantics of audio-visual content and can be used to narrow the semantic gap in video analysis, retrieval and summarization. Object re-detection, which aims at finding occurrences of specific objects in a single video or a collection of still images and videos, is an object identification problem and can thus be more satisfactorily solved than a general object recognition problem. As structural information and color information are often complementary, we propose a combined object re-detection approach using SIFT and MPEG-7 color descriptors extracted around the same interest points. We evaluate the approach on two different data sets and show that the MPEG-7 ColorLayout descriptor performs best of the tested color descriptors and that the joint approach yields better results than the use of SIFT or color descriptors only.

[1]  J.-P. Renno,et al.  Evaluation of MPEG7 color descriptors for visual surveillance retrieval , 2005, 2005 IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance.

[2]  Andrew Zisserman,et al.  Automated Scene Matching in Movies , 2002, CIVR.

[3]  Jan-Olof Eklundh,et al.  Recognition of Objects in the Real World from a Systems Perspective , 2005, Künstliche Intell..

[4]  Ioannis Patras,et al.  Combining color and shape information for illumination-viewpoint invariant object recognition , 2006, IEEE Transactions on Image Processing.

[5]  Aly A. Farag,et al.  Colored Local Invariant Features for Object Description , 2005 .

[6]  Gustavo Carneiro,et al.  Sparse Flexible Models of Local Features , 2006, ECCV.

[7]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[8]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Cordelia Schmid,et al.  Coloring Local Feature Extraction , 2006, ECCV.

[10]  Pietro Perona,et al.  Evaluation of Features Detectors and Descriptors Based on 3D Objects , 2005, ICCV.

[11]  Werner Bailer,et al.  Video Content Browsing Based on Iterative Feature Clustering for Rushes Exploitation , 2006, TRECVID.

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  Jean-Marc Odobez,et al.  Natural Scene Image Modeling Using Color and Texture Visterms , 2006, CIVR.

[14]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Arnold W. M. Smeulders,et al.  Color Invariance , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..