Where-What Network 5: Dealing with scales for objects in complex backgrounds

The biologically-inspired developmental Where-What Networks (WWN) are general purpose visuomotor networks for detecting and recognizing objects from complex backgrounds, modeling the dorsal and ventral streams of the biological visual cortex. The networks are designed for the attention and recognition problem. The architecture in previous versions were meant for a single scale of foreground. This paper focuses on Where-What Network-5 (WWN-5), the extension for multiple scales. WWN-5 can learn three concepts of an object: type, location and scale.

[1]  E. Rolls,et al.  A Neurodynamical cortical model of visual attention and invariant object recognition , 2004, Vision Research.

[2]  Leslie G. Ungerleider Two cortical visual systems , 1982 .

[3]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[4]  Juyang Weng,et al.  A 5-chunk developmental brain-mind network model for multiple events in complex backgrounds , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[5]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[6]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[7]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[8]  Juyang Weng,et al.  A Multilayer in-Place Learning Network for Development of General Invariances , 2007, Int. J. Humanoid Robotics.

[9]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[10]  Juyang Weng,et al.  Where-what network-4: The effect of multiple internal areas , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[11]  Narendra Ahuja,et al.  Learning Recognition and Segmentation Using the Cresceptron , 1997, International Journal of Computer Vision.

[12]  Juyang Weng,et al.  Dually Optimal Neuronal Layers: Lobe Component Analysis , 2009, IEEE Transactions on Autonomous Mental Development.

[13]  I. Rybak,et al.  A model of attention-guided visual perception and recognition , 1998, Vision Research.

[14]  D C Van Essen,et al.  Shifter circuits: a computational strategy for dynamic aspects of visual processing. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[15]  G. Deco,et al.  A hierarchical neural system with attentional top–down enhancement of the spatial resolution for object recognition , 2000, Vision Research.

[16]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[17]  Juyang Weng,et al.  Where What Network 3 : Developmental Top-Down Attention with Multiple Meaningful Foregrounds , 2010 .

[18]  Juyang Weng,et al.  Where-what network 1: “Where” and “what” assist each other through top-down connections , 2008, 2008 7th IEEE International Conference on Development and Learning.

[19]  Juyang Weng,et al.  Laterally connected lobe component analysis: Precision and topography , 2009, 2009 IEEE 8th International Conference on Development and Learning.