Hierarchy of visual features for object recognition

Most approaches for object recognition (OR) use a single feature descriptor to identify the object class from a query image. However, specifically in case of variations in appearance, scale and illumination, the performance of features not only vary depending on the class, but also on the query sample. We propose a biological inspired framework for OR using concepts from feature integration theory (FIT). Our model uses a hierarchy of visual features for OR. The key components in the proposed approach are: (i) SALCUT - unsupervised segmentation for salient object localization; (ii) optimal feature selection - identify appropriate features for each class, at each level of feature hierarchy, for a test instance; (iii) feature combination - which happens at higher levels of feature hierarchy, if features selected at the lower level are unable to classify a test instance. Our method outperforms several state-of-the-art techniques, when validated using two real-world datasets.

[1]  Joachim Denzler,et al.  Boosting colored local features for generic object recognition , 2008, Pattern Recognition and Image Analysis.

[2]  Peter Auer,et al.  Generic object recognition with boosting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[4]  Sukhendu Das,et al.  A Fast Supervised Method of Feature Ranking and Selection for Pattern Classification , 2009, PReMI.

[5]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[6]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[7]  V. Lollo The feature-binding problem is an ill-posed problem , 2012, Trends in Cognitive Sciences.

[8]  Feiping Nie,et al.  Discriminative Least Squares Regression for Multiclass Classification and Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Joachim Denzler,et al.  Generic Object Recognition Using Boosted Combined Features , 2008, RobVis.

[10]  Gustavo Deco,et al.  Computational neuroscience of vision , 2002 .

[11]  Feiping Nie,et al.  Heterogeneous Visual Features Fusion via Sparse Multimodal Machine , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Shuicheng Yan,et al.  An HOG-LBP human detector with partial occlusion handling , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Yang Wang,et al.  Kernel Latent SVM for Visual Recognition , 2012, NIPS.

[14]  A Treisman,et al.  Feature binding, attention and object perception. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[15]  Matti Pietikäinen,et al.  Face Description with Local Binary Patterns: Application to Face Recognition , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[17]  David G. Lowe,et al.  University of British Columbia. , 1945, Canadian Medical Association journal.

[18]  Tieniu Tan,et al.  Boosted local structured HOG-LBP for object localization , 2011, CVPR 2011.

[19]  Louis K. H. Chan,et al.  Feature integration theory revisited: dissociating feature detection and attentional guidance in visual search. , 2009, Journal of experimental psychology. Human perception and performance.

[20]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[21]  Ze-Nian Li,et al.  Learning image similarities via Probabilistic Feature Matching , 2010, 2010 IEEE International Conference on Image Processing.

[22]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[23]  Frédéric Jurie,et al.  Fast Discriminative Visual Codebooks using Randomized Clustering Forests , 2006, NIPS.

[24]  Semir Zeki,et al.  Feature binding in the feedback layers of area V2. , 2009, Cerebral cortex.

[25]  Antonio Criminisi,et al.  Object categorization by learned universal visual dictionary , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[26]  Sebastian Nowozin,et al.  On feature combination for multiclass object classification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[27]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.