Self-Supervised Learning in Cooperative Stereo Vision Correspondence

This paper presents a neural network model of stereoscopic vision, in which a process of fusion seeks the correspondence between points of stereo inputs. Stereo fusion is obtained after a self-supervised learning phase, so called because the learning rule is a supervised-learning rule in which the supervisory information is autonomously extracted from the visual inputs by the model. This supervisory information arises from a global property of the potential matches between the points. The proposed neural network, which is of the cooperative type, and the learning procedure, are tested with random-dot stereograms (RDS) and feature points extracted from real-world images. Those feature points are extracted by a technique based on the use of sigma-pi units. The matching performance and the generalization ability of the model are quantified. The relationship between what have been learned by the network and the constraints used in previous cooperative models of stereo vision, is discussed.

[1]  Alireza Khotanzad,et al.  Stereopsis by constraint learning feed-forward neural networks , 1993, IEEE Trans. Neural Networks.

[2]  Jake K. Aggarwal,et al.  Structure from stereo-a review , 1989, IEEE Trans. Syst. Man Cybern..

[3]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[4]  P. O. Bishop,et al.  Binocular interaction fields of single units in the cat striate cortex , 1971, The Journal of physiology.

[5]  Stephen Grossberg,et al.  Stereo boundary fusion by cortical complex cells: A system of maps, filters, and feedback networks for multiplexing distributed data , 1989, Neural Networks.

[6]  T. Poggio,et al.  A computational theory of human stereo vision , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[7]  A J O'Toole Structure from Stereo by Associative Learning of the Constraints , 1989, Perception.

[8]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[9]  J I Nelson,et al.  Globality and stereoscopic fusion in binocular vision. , 1975, Journal of theoretical biology.

[10]  Parvati Dev,et al.  Perception of Depth Surfaces in Random-Dot Stereograms: A Neural Model , 1975, Int. J. Man Mach. Stud..

[11]  Jae Chang Shim,et al.  Stereo correspondence using the Hopfield neural network of a new energy function , 1994, Pattern Recognit..

[12]  B. Julesz Early vision and focal attention , 1991 .

[13]  John E. W. Mayhew,et al.  Psychophysical and Computational Studies Towards a Theory of Human Stereopsis , 1981, Artif. Intell..

[14]  W. Eric L. Grimson,et al.  Computational Experiments with a Feature Based Stereo Algorithm , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  J P Frisby,et al.  PMF: A Stereo Correspondence Algorithm Using a Disparity Gradient Limit , 1985, Perception.

[16]  H. Wilson,et al.  Neural models of stereoscopic vision , 1991, Trends in Neurosciences.

[17]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[18]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[19]  Geoffrey E. Hinton,et al.  Self-organizing neural network that discovers surfaces in random-dot stereograms , 1992, Nature.

[20]  Bernard Widrow,et al.  Punish/Reward: Learning with a Critic in Adaptive Threshold Systems , 1973, IEEE Trans. Syst. Man Cybern..