Visual object categorization based on the fusion of region and local features

This paper presents a novel approach for visual object categorization using region based features and statistical measures based image modeling. Our region-based features are extracted from coarse regions obtained by the Gestalt theory inspired region segmentation algorithm and they capture visually significant information such as segments and colors. The modeling of the visual content of an image relies upon some statistical measures over sparse region-based features, thus avoiding the major difficulty of the popular “bag-of-local features” approach which needs to fix a visual vocabulary size. Several classification schemes, including feature selection techniques (e.g. PCA or Adaboost) and fusion strategies, are also implemented and compared. Experimented on a subset of Pascal VOC dataset, we show that by separating features extracted from different sources in different “channels”, and then to combine them using an early fusion, we can actually improve classification performance. Moreover, experimental results demonstrate that our region-based features can be combined with SIFT features to reinforce performance, suggesting that our features managed to extract information which is complementary to the one of SIFT features.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Max Wertheimer,et al.  Untersuchungen zur Lehre von der Gestalt , .

[3]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[4]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[5]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[6]  L. Chen,et al.  Coarse adaptive color image segmentation for visual object classification , 2008, 2008 15th International Conference on Systems, Signals and Image Processing.

[7]  Pietro Perona,et al.  A Bayesian hierarchical model for learning natural scene categories , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[8]  Linlin Shen,et al.  AdaBoost Gabor Feature Selection for Classification , 2004 .

[9]  Jean-Michel Morel,et al.  From Gestalt Theory to Image Analysis: A Probabilistic Approach , 2007 .

[10]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[11]  Tony Lindeberg,et al.  Feature Detection with Automatic Scale Selection , 1998, International Journal of Computer Vision.

[12]  David A. Forsyth,et al.  The effects of segmentation and feature choice in a translation model of object recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[14]  Luc Van Gool,et al.  The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[15]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[16]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[17]  Hsuan-Tien Lin A Study on Sigmoid Kernels for SVM and the Training of non-PSD Kernels by SMO-type Methods , 2005 .

[18]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[19]  D. Navon Forest before trees: The precedence of global features in visual perception , 1977, Cognitive Psychology.

[20]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[21]  Liming Chen,et al.  Line segment based edge feature using Hough transform , 2007 .

[22]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[23]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[24]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .

[25]  Alan L. Yuille,et al.  Region Competition: Unifying Snakes, Region Growing, and Bayes/MDL for Multiband Image Segmentation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  B. S. Manjunath,et al.  An efficient color representation for image retrieval , 2001, IEEE Trans. Image Process..

[27]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[28]  I. Jolliffe Principal Component Analysis , 2002 .

[29]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[30]  Gabriela Csurka,et al.  Adapted Vocabularies for Generic Visual Categorization , 2006, ECCV.

[31]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[32]  Chih-Jen Lin,et al.  Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel , 2003, Neural Computation.