Effects of generalized pooling on binocular disparity selectivity of neurons in the early visual cortex

The key problem of stereoscopic vision is traditionally defined as accurately finding the positional shifts of corresponding object features between left and right images. Here, we demonstrate that the problem must be considered in a four-dimensional parameter space; with respect not only to shifts in space (X, Y), but also spatial frequency (SF) and orientation (OR). The proposed model sums outputs of binocular energy units linearly over the multi-dimensional V1 parameter space (X, Y, SF, OR). Theoretical analyses and physiological experiments show that many binocular neurons achieve sharp binocular tuning properties by pooling the output of multiple neurons with relatively broad tuning. Pooling in the space domain sharpens disparity-selective responses in the SF domain so that the responses to combinations of unmatched left–right SFs are attenuated. Conversely, pooling in the SF domain sharpens disparity selectivity in the space domain, reducing the possibility of false matches. Analogous effects are observed for the OR domain in that the spatial pooling sharpens the binocular tuning in the OR domain. Such neurons become selective to relative OR disparity. Therefore, pooling allows the visual system to refine binocular information into a form more desirable for stereopsis. This article is part of the themed issue ‘Vision in our three-dimensional world’.

[1]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[2]  P. O. Bishop,et al.  Responses to moving slits by single units in cat striate cortex , 2004, Experimental Brain Research.

[3]  I. Ohzawa,et al.  The binocular organization of simple cells in the cat's visual cortex. , 1986, Journal of neurophysiology.

[4]  A.V. Oppenheim,et al.  The importance of phase in signals , 1980, Proceedings of the IEEE.

[5]  Tai Sing Lee,et al.  Image Representation Using 2D Gabor Wavelets , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Shinji Nishimoto,et al.  Accuracy of subspace mapping of spatiotemporal frequency domain visual receptive fields. , 2005, Journal of neurophysiology.

[7]  C. Wheatstone XVIII. Contributions to the physiology of vision. —Part the first. On some remarkable, and hitherto unobserved, phenomena of binocular vision , 1962, Philosophical Transactions of the Royal Society of London.

[8]  G. Poggio,et al.  Binocular interaction and depth sensitivity in striate and prestriate cortex of behaving rhesus monkey. , 1977, Journal of neurophysiology.

[9]  P. O. Bishop,et al.  Discrimination of orientation and position disparities by binocularly activated neurons in cat straite cortex. , 1977, Journal of neurophysiology.

[10]  I. Ohzawa,et al.  On the neurophysiological organization of binocular vision , 1990, Vision Research.

[11]  Kunihiko Fukushima,et al.  Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position , 1980, Biological Cybernetics.

[12]  I. Ohzawa,et al.  Stereoscopic depth discrimination in the visual cortex: neurons ideally suited as disparity detectors. , 1990, Science.

[13]  耕太 佐々木,et al.  Internal spatial organization of receptive fields of complex cells in the early visual cortex , 2007 .

[14]  I. Ohzawa,et al.  Encoding of binocular disparity by complex cells in the cat's visual cortex. , 1996, Journal of neurophysiology.

[15]  C. Blakemore,et al.  A second neural mechanism of binocular depth discrimination , 1972, The Journal of physiology.

[16]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[17]  I. Ohzawa,et al.  Encoding of binocular disparity by simple cells in the cat's visual cortex. , 1996, Journal of neurophysiology.

[18]  Yann LeCun,et al.  Large-scale Learning with SVM and Convolutional for Generic Object Categorization , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[19]  J. Movshon,et al.  Spatial and temporal contrast sensitivity of neurones in areas 17 and 18 of the cat's visual cortex. , 1978, The Journal of physiology.

[20]  B. Cumming,et al.  Suppressive Mechanisms in Monkey V1 Help to Solve the Stereo Correspondence Problem , 2011, The Journal of Neuroscience.

[21]  Bruce G. Cumming,et al.  Delayed suppression shapes disparity selective responses in monkey V1. , 2014, Journal of Neurophysiology.

[22]  I. Ohzawa,et al.  Neural mechanisms for processing binocular information II. Complex cells. , 1999, Journal of neurophysiology.

[23]  D. Hubel,et al.  Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[24]  Ning Qian,et al.  Physiological computation of binocular disparity , 1997, Vision Research.

[25]  Izumi Ohzawa,et al.  Complex Cells in the Cat Striate Cortex Have Multiple Disparity Detectors in the Three-Dimensional Binocular Receptive Fields , 2010, The Journal of Neuroscience.

[26]  B. G. Cumming,et al.  Responses of primary visual cortical neurons to binocular disparity without depth perception , 1997, Nature.

[27]  B. Cumming,et al.  Testing quantitative models of binocular disparity selectivity in primary visual cortex. , 2003, Journal of neurophysiology.

[28]  A. Parker,et al.  Quantitative analysis of the responses of V1 neurons to horizontal disparity in dynamic random-dot stereograms. , 2002, Journal of neurophysiology.

[29]  A. Parker,et al.  Modeling V1 neuronal responses to orientation disparity , 2001, Visual Neuroscience.

[30]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[31]  I. Ohzawa,et al.  The binocular organization of complex cells in the cat's visual cortex. , 1986, Journal of neurophysiology.

[32]  Guillermo Sapiro,et al.  A subspace reverse-correlation technique for the study of visual neurons , 1997, Vision Research.

[33]  I. Ohzawa,et al.  Neural mechanisms for processing binocular information I. Simple cells. , 1999, Journal of neurophysiology.

[34]  B G Cumming,et al.  Responses of Macaque V1 Neurons to Binocular Orientation Differences , 2001, The Journal of Neuroscience.

[35]  Gregory C. DeAngelis,et al.  Depth is encoded in the visual cortex by a specialized receptive field structure , 1991, Nature.

[36]  Tomaso Poggio,et al.  Intracellular measurements of spatial integration and the MAX operation in complex cells of the cat primary visual cortex. , 2004, Journal of neurophysiology.

[37]  B. Julesz Binocular depth perception of computer-generated patterns , 1960 .

[38]  D. Ferster,et al.  Computational Diversity in Complex Cells of Cat Primary Visual Cortex , 2007, The Journal of Neuroscience.

[39]  Bruce G Cumming,et al.  A simple model accounts for the response of disparity-tuned V1 neurons to anticorrelated images , 2002, Visual Neuroscience.

[40]  I. Ohzawa,et al.  Integration of Multiple Spatial Frequency Channels in Disparity-Sensitive Neurons in the Primary Visual Cortex , 2015, The Journal of Neuroscience.

[41]  David J. Fleet,et al.  Neural encoding of binocular disparity: Energy models, position shifts and phase shifts , 1996, Vision Research.

[42]  A. B. Bonds,et al.  Classifying simple and complex cells on the basis of response modulation , 1991, Vision Research.

[43]  C. Blakemore,et al.  The neural mechanism of binocular depth discrimination , 1967, The Journal of physiology.

[44]  Takahisa M. Sanada,et al.  Encoding of three-dimensional surface slant in cat visual areas 17 and 18. , 2006, Journal of neurophysiology.