Object recognition using a bio-inspired neuron model with bottom-up and top-down pathways

In this paper, a new artificial neural network model is proposed for visual object recognition, in which the bottom-up, sensory-driven pathway and top-down, expectation-driven pathway are fused in information processing and their corresponding weights are learned based on the fused neuron activities. During the supervised learning process, the target labels are applied to update the bottom-up synaptic weights of the neural network. Meanwhile, the hypotheses generated by the bottom-up pathway produce expectations on sensory inputs through the top-down pathway. The expectations are constrained by the real data from the sensory inputs, which can be used to update the top-down synaptic weights accordingly. To further improve the visual object recognition performance, the multi-scale histograms of oriented gradients (MS-HOG) method is proposed to extract local features of visual objects from images. Extensive experiments on different image datasets demonstrate the efficiency and robustness of the proposed neural network model with features extracted using the MS-HOG method on visual object recognition compared with other state-of-the-art methods.

[1]  J. Feldman,et al.  Connectionist models and their implications: readings from cognitive science , 1988 .

[2]  Nicu Sebe,et al.  Fast spatial pattern discovery integrating boosting with constellations of contextual descriptors , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Yan Meng,et al.  Fusing bottom-up and top-down pathways in neural networks for visual object recognition , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[4]  Zhi-Hua Zhou,et al.  Individual Stable Space: An Approach to Face Recognition Under Uncontrolled Conditions , 2008, IEEE Transactions on Neural Networks.

[5]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Olac Fuentes,et al.  Object detection using image reconstruction with PCA , 2009, Image Vis. Comput..

[7]  T. Sejnowski,et al.  Book Review: Gain Modulation in the Central Nervous System: Where Behavior, Neurophysiology, and Computation Meet , 2001, The Neuroscientist : a review journal bringing neurobiology, neurology and psychiatry.

[8]  Michael W. Spratling,et al.  A feedback model of perceptual learning and categorization , 2006, Visual Cognition.

[9]  W. Singer,et al.  Dynamic predictions: Oscillations and synchrony in top–down processing , 2001, Nature Reviews Neuroscience.

[10]  S. Kastner,et al.  Interactions of Top-Down and Bottom-Up Mechanisms in Human Visual Cortex , 2011, The Journal of Neuroscience.

[11]  Ah Chung Tsoi,et al.  The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[12]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[13]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[14]  Tom Minka,et al.  Principled Hybrids of Generative and Discriminative Models , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Mauro Ursino,et al.  Recognition of Abstract Objects Via Neural Oscillators: Interaction Among Topological Organization, Associative Memory and Gamma Band Synchronization , 2009, IEEE Transactions on Neural Networks.

[16]  Geraint Rees,et al.  Top–Down Modulation of Human Early Visual Cortex after Stimulus Offset Supports Successful Postcued Report , 2011, Journal of Cognitive Neuroscience.

[17]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[18]  Diane M. Beck,et al.  Top-down and bottom-up mechanisms in biasing competition in the human brain , 2009, Vision Research.

[19]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[20]  Cordelia Schmid,et al.  Human Detection Based on a Probabilistic Assembly of Robust Part Detectors , 2004, ECCV.

[21]  Hong Pan,et al.  Efficient Object Recognition Using Boundary Representation and Wavelet Neural Network , 2008, IEEE Transactions on Neural Networks.

[22]  E. Halgren,et al.  Top-down facilitation of visual recognition. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24]  Joachim M. Buhmann,et al.  Learning the Compositional Nature of Visual Objects , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  S. Treue Visual attention: the where, what, how and why of saliency , 2003, Current Opinion in Neurobiology.

[26]  Horst Bischof,et al.  Eigenboosting: Combining Discriminative and Generative Information , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Kunihiko Fukushima,et al.  Neural network model restoring partly occluded patterns , 2003, Int. J. Knowl. Based Intell. Eng. Syst..

[28]  Trevor Darrell,et al.  Fast pose estimation with parameter-sensitive hashing , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[29]  Liming Zhang,et al.  Spatio–Temporal Adaptation in the Unsupervised Development of Networked Visual Neurons , 2009, IEEE Transactions on Neural Networks.

[30]  Jeremy M Wolfe,et al.  Visual Attention , 2020, Computational Models for Cognitive Vision.

[31]  Thai Hoang Le,et al.  An approach to combine AdaBoost and Artificial Neural Network for detecting human faces , 2008, 2008 IEEE International Conference on Systems, Man and Cybernetics.

[32]  J J Hopfield,et al.  Neural networks and physical systems with emergent collective computational abilities. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Jitendra Malik,et al.  SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[34]  Y. LeCun,et al.  Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[35]  Xuelong Li,et al.  Enhanced biologically inspired model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Antoniya Georgieva,et al.  Intelligent Visual Recognition and Classification of Cork Tiles With Neural Networks , 2009, IEEE Transactions on Neural Networks.

[37]  Cordelia Schmid,et al.  Learning to Parse Pictures of People , 2002, ECCV.

[38]  Dacheng Tao,et al.  Biologically Inspired Feature Manifold for Scene Classification , 2010, IEEE Transactions on Image Processing.

[39]  J. Duncan,et al.  Competitive brain activity in visual attention , 1997, Current Opinion in Neurobiology.

[40]  Stephen Grossberg,et al.  Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[41]  Thomas Serre,et al.  Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[42]  Matti Pietikäinen,et al.  Multiresolution Gray-Scale and Rotation Invariant Texture Classification with Local Binary Patterns , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[44]  Gang Wang,et al.  Using Dependent Regions for Object Categorization in a Generative Framework , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[45]  Sergio Escalera,et al.  Boosted Landmarks of Contextual Descriptors and Forest-ECOC: A novel framework to detect and classify objects in cluttered scenes , 2007, Pattern Recognit. Lett..