Combined Segmentation and Visual Attention for Object Categorization and Video Semantic Concepts Detection

Recent researches show that the benefits of image segmentation have been exploited in object categorization and recognition approaches. In most of these works, objects are segmented from the background around to increase recognition accuracy. However, it is generally hard to find a segmentation that captures all correct object boundaries in images of real world scene. So some researches begin to choose several segmentations for representing the objects and performing object categorization. In this paper, we take advantage of an efficient graph-based algorithm for image segmentation, and combine a visual attention model to locate the salient and effective segmentations in a real world image. We propose a model which extends the bag-of-features method for modeling the semantic objects. We evaluate our approach on two experiments: multiclass categorization in Caltech 101 datasets and high-level features extraction in video datasets of TRECVID2007. The results show that combining segmentation and visual attention makes our model achieve competitive performance.

[1]  Ralph Gross,et al.  Concurrent Object Recognition and Segmentation by Graph Partitioning , 2002, NIPS.

[2]  Ming-Hsuan Yang,et al.  Gender classification with support vector machines , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[3]  Andrea Vedaldi,et al.  Objects in Context , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[4]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[5]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Alexei A. Efros,et al.  Using Multiple Segmentations to Discover Objects and their Extent in Image Collections , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7]  Thomas Oskam Exploiting Low Level Image Segmentation for Object Recognition , 2007 .

[8]  Joachim M. Buhmann,et al.  Model Order Selection and Cue Combination for Image Segmentation , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[9]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[10]  Andrew Zisserman,et al.  A Visual Vocabulary for Flower Classification , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[11]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.