A robust scene descriptor based on largest singular values for cortex-like mechanisms

Analysis and recognition of images observed in different situations are the most important tasks of the visual cortex. Although many studies have been done in the field of computer vision and neuroscience, the underlying processes in the visual cortex is not completely understood. An inspired model of the visual cortex that lately has gained attention for object recognition is HMAX, which describes a feed-forward hierarchical structure. This model shows a degree of scale and translation invariance. Other capabilities of the visual cortex are not that much sensitive against rotation and noise in color space. In this paper, we introduce a novel method to increase the degree of robustness against noise and rotation. We describe a hierarchical system that closely follows the organization of visual cortex and builds an increasingly complex and invariant feature representation with R, G, B and gray channels as inputs. Similar to the recently published methods, the phase of learning is done only in the S2 layer of HMAX structure. While in the proposed model instead of using directly the distance between the patches and the C1 units, the distance between largest singular values of the patches and the C1 units are used. These values behave indeed insensitive significantly with respect to rotation. To this end, we present the experimental results of the COREL datasets and shows that the proposed model have better performance than the previous HMAX models in complex visual scenes.

[1]  Ali Borji,et al.  Scene classification with a sparse set of salient regions , 2011, 2011 IEEE International Conference on Robotics and Automation.

[2]  Zhidong Deng,et al.  C-HMAX: Artificial cognitive model inspired by the color vision mechanism of the human brain , 2013 .

[3]  D H HUBEL,et al.  RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. , 1965, Journal of neurophysiology.

[4]  T. Poggio,et al.  How Visual Cortex Recognizes Objects: The Tale of the Standard Model , 2002 .

[5]  N. Logothetis,et al.  View-dependent object recognition by monkeys , 1994, Current Biology.

[6]  Christof Koch,et al.  Attentional Selection for Object Recognition - A Gentle Way , 2002, Biologically Motivated Computer Vision.

[7]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[8]  Heiko Wersing,et al.  Learning Optimized Features for Hierarchical Models of Invariant Object Recognition , 2003, Neural Computation.

[9]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[11]  W E Grimson,et al.  A computer implementation of a theory of human stereo vision. , 1981, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[12]  Tomaso Poggio,et al.  Intracellular measurements of spatial integration and the MAX operation in complex cells of the cat primary visual cortex. , 2004, Journal of neurophysiology.

[13]  Bartlett W. Mel SEEMORE: Combining Color, Shape, and Texture Histogramming in a Neurally Inspired Approach to Visual Object Recognition , 1997, Neural Computation.

[14]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[15]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[16]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[17]  Thomas Serre,et al.  A Theory of Object Recognition: Computations and Circuits in the Feedforward Path of the Ventral Stream in Primate Visual Cortex , 2005 .

[18]  T. Gawne,et al.  Responses of primate visual cortical neurons to stimuli presented by flash, saccade, blink, and external darkening. , 2002, Journal of neurophysiology.