Incremental Knowledge Representation Based on Visual Selective Attention

Knowledge-based clustering and autonomous mental development remains a high priority research topic, among which the learning techniques of neural networks are used to achieve optimal performance. In this paper, we present a new framework that can automatically generate a relevance map from sensory data that can represent knowledge regarding objects and infer new knowledge about novel objects. The proposed model is based on understating of the visual what pathway in our brain. A bottom-up attention model can selectively decide salient object areas. Color and form features for a selected object are generated by a sparse coding mechanism by a convolution neural network (CNN). Using the extracted features by the CNN as inputs, the incremental knowledge representation model, called the growing fuzzy topology adaptive resonant theory (TART) network, makes clusters for the construction of an ontology map in the color and form domains. The clustered information is relevant to describe specific objects, and the proposed model can automatically infer an unknown object by using the learned information. Experimental results with real data have demonstrated the validity of this approach.

[1]  George Wolberg,et al.  Robust image registration using log-polar transform , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[2]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[3]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[4]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[5]  Robert B. Fisher,et al.  Hierarchical Selectivity for Object-Based Visual Attention , 2002, Biologically Motivated Computer Vision.

[6]  Stephen Grossberg,et al.  Fuzzy ARTMAP: A neural network architecture for incremental supervised learning of analog multidimensional maps , 1992, IEEE Trans. Neural Networks.

[7]  Stephen R. Marsland,et al.  A self-organising network that grows when required , 2002, Neural Networks.

[8]  Li Fei-Fei Knowledge transfer in learning to recognize visual objects classes , 2006 .

[9]  A. Guyton,et al.  Textbook of Medical Physiology , 1961 .

[10]  James L. McClelland,et al.  Autonomous Mental Development by Robots and Animals , 2001, Science.

[11]  Giorgio Metta,et al.  Early integration of vision and manipulation , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[12]  Minho Lee,et al.  Biologically motivated vergence control system using human-like selective attention model , 2006, Neurocomputing.

[13]  Mitchell P. Marcus,et al.  Maximum entropy models for natural language ambiguity resolution , 1998 .

[14]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[15]  Seong-Whan Lee,et al.  Biologically Motivated Computer Vision , 2002, Lecture Notes in Computer Science.

[16]  Brian Scassellati,et al.  Investigating models of social development using a humanoid robot , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[17]  Ah Chung Tsoi,et al.  Face recognition: a convolutional neural-network approach , 1997, IEEE Trans. Neural Networks.

[18]  Laurent Itti,et al.  A Goal Oriented Attention Guidance Model , 2002, Biologically Motivated Computer Vision.

[19]  Christof Koch,et al.  Attentional Selection for Object Recognition - A Gentle Way , 2002, Biologically Motivated Computer Vision.

[20]  J. Nicholls From neuron to brain , 1976 .

[21]  Minho Lee,et al.  Saliency map model with adaptive masking based on independent component analysis , 2002, Neurocomputing.

[22]  Minho Lee,et al.  Incremental Knowledge Representation Model based on Visual Selective Attention , 2005 .