Top-down attention based on object representation and incremental memory for knowledge building and inference

Humans can efficiently perceive arbitrary visual objects based on an incremental learning mechanism with selective attention. This paper proposes a new task specific top-down attention model to locate a target object based on its form and color representation along with a bottom-up saliency based on relativity of primitive visual features and some memory modules. In the proposed model top-down bias signals corresponding to the target form and color features are generated, which draw the preferential attention to the desired object by the proposed selective attention model in concomitance with the bottom-up saliency process. The object form and color representation and memory modules have an incremental learning mechanism together with a proper object feature representation scheme. The proposed model includes a Growing Fuzzy Topology Adaptive Resonance Theory (GFTART) network which plays two important roles in object color and form biased attention; one is to incrementally learn and memorize color and form features of various objects, and the other is to generate a top-down bias signal to localize a target object by focusing on the candidate local areas. Moreover, the GFTART network can be utilized for knowledge inference which enables the perception of new unknown objects on the basis of the object form and color features stored in the memory during training. Experimental results show that the proposed model is successful in focusing on the specified target objects, in addition to the incremental representation and memorization of various objects in natural scenes. In addition, the proposed model properly infers new unknown objects based on the form and color features of previously trained objects.

[1]  David M. Skapura,et al.  Neural networks - algorithms, applications, and programming techniques , 1991, Computation and neural systems series.

[2]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[3]  Timothée Masquelier,et al.  Unsupervised Learning of Visual Features through Spike Timing Dependent Plasticity , 2007, PLoS Comput. Biol..

[4]  B L McNaughton,et al.  Brain growth and the cognitive map. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Stephen Grossberg,et al.  Fuzzy ART: Fast stable learning and categorization of analog patterns by an adaptive resonance system , 1991, Neural Networks.

[6]  Yoshifumi Nishio,et al.  Fuzzy Adaptive Resonance Theory Combining Overlapped Category in consideration of connections , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[7]  Minho Lee,et al.  Growing fuzzy topology adaptive resonance theory models with a push-pull learning algorithm , 2011, Neurocomputing.

[8]  Stephen R. Marsland,et al.  A self-organising network that grows when required , 2002, Neural Networks.

[9]  T Allison,et al.  Contextual guidance of attention: human intracranial event-related potential evidence for feedback modulation in anatomically early temporally late stages of visual processing. , 2001, Brain : a journal of neurology.

[10]  Antonio Torralba,et al.  Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. , 2006, Psychological review.

[11]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[12]  Fahad Shahbaz Khan,et al.  Top-down color attention for object recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[14]  Laurent Itti,et al.  An Integrated Model of Top-Down and Bottom-Up Attention for Optimizing Detection Speed , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Minho Lee,et al.  Biologically Motivated Incremental Object Perception Based on Selective Attention , 2007, Int. J. Pattern Recognit. Artif. Intell..

[16]  Bala Srinivasan,et al.  Dynamic self-organizing maps with controlled growth for knowledge discovery , 2000, IEEE Trans. Neural Networks Learn. Syst..

[17]  Minho Lee,et al.  Stereo saliency map considering affective factors and selective motion analysis in a dynamic environment , 2008, Neural Networks.

[18]  J. Moran,et al.  Sensation and perception , 1980 .

[19]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[20]  ‫ﻣﺒﺘﻨ‬ ‫ﺍﻟﮕﻮﺭﻳﺘﻤﻲ‬ ‫ﻲ‬ ‫ﺍﺗﻮﻣﺎﺗ‬ ‫ﺑﺮ‬ ‫ﺎ‬ ‫ﻫﺎ‬ ‫ﻱ‬ ‫ﻳ‬ ‫ﺎﺩﮔ‬ ‫ﻴ‬ ‫ﺑﺮﺍ‬ ‫ﺮ‬ ‫ﻱ‬ ‫ﺗﻨﻈﻴﻢ‬ ‫ﺩﺭ‬ ‫ﻣﺮﺍﻗﺒﺖ‬ ‫ﭘﺎﺭﺍﻣﺘﺮ‬ ‫ﺷﺒﻜﻪ‬ Fuzzy Artmap ‫ﺍﻧﺠﻴﺪﻧﻲ‬ ‫ﻣﺠﻴﺪ‬ , 2022 .

[21]  J. F. Kalaska,et al.  Attention in hierarchical models of object recognition , 2007 .

[22]  Stephen Grossberg,et al.  Competitive Learning: From Interactive Activation to Adaptive Resonance , 1987, Cogn. Sci..

[23]  Sang-Woo Ban,et al.  Biologically Motivated Vergence Control System Based on Stereo Saliency Map Model , 2007 .

[24]  Minho Lee,et al.  Saliency map model with adaptive masking based on independent component analysis , 2002, Neurocomputing.

[25]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[26]  Zhaoping Li,et al.  Computational Design and Nonlinear Dynamics of a Recurrent Network Model of the Primary Visual Cortex , 2001, Neural Computation.

[27]  Kunihiko Fukushima,et al.  Use of non-uniform spatial blur for image comparison: symmetry axis extraction , 2005, Neural Networks.

[28]  Shaun P. Vecera,et al.  Toward a Biased Competition Account of Object-Based Segregation and Attention , 2000 .

[29]  T. Poggio,et al.  Hierarchical models of object recognition in cortex September 23 , 1999 , 1999 .

[30]  Minho Lee,et al.  Top-Down Object Color Biased Attention Using Growing Fuzzy Topology ART , 2008, IDEAL.

[31]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[32]  Thomas Serre,et al.  Object recognition with features inspired by visual cortex , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[33]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[34]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[35]  Bernd Fritzke,et al.  Growing cell structures--A self-organizing network for unsupervised and supervised learning , 1994, Neural Networks.