How human visual systems recognize objects - a novel computational model

This paper presents a novel computational model of 3D object recognition based on human visual system. Conventional schemes have feed forward structure based on the bottom-up process of human vision. However, psychological and physiological evidence suggests that top-down process and feature binding by visual attention are also important. Hence, we propose a method to integrate these facts under statistical framework, Markov chain Monte Carlo. In this scheme, object recognition is regarded as parameter optimization problem. The bottom-up process is used to initialize parameters and top-down process is used to optimize them. On both processes, feature map binding is performed by spatial attention mechanism. Experimental results show that the proposed computational model is feasible for 3D object recognition.

[1]  T. Poggio,et al.  Towards trainable man-machine interfaces: combining top-down constraints with bottom-up learning in facial analysis , 2002 .

[2]  Rong Zhang,et al.  Integrating bottom-up/top-down for object recognition by data driven Markov chain Monte Carlo , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[3]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[5]  Konrad P. Körding,et al.  Integrating Top-Down and Bottom-Up Sensory Processing by Somato-Dendritic Interactions , 2004, Journal of Computational Neuroscience.

[6]  William T. Newsome,et al.  erratum The neurobiology of cognition , 2000, Nature.

[7]  Thierry Pun,et al.  Integration of bottom-up and top-down cues for visual attention using non-linear relaxation , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Jean-Michel Morel,et al.  Gestalt Theory and Computer Vision , 2004 .

[9]  Takeo Kanade,et al.  Model-Based Vision By Cooperative Processing Of Evidence And Hypotheses Using Configuration Spaces , 1988, Defense, Security, and Sensing.

[10]  M. Goodale,et al.  The visual brain in action , 1995 .

[11]  A Treisman,et al.  Feature binding, attention and object perception. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[12]  W. Newsome,et al.  The neurobiology of cognition , 1999, Nature.

[13]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.