ANN Implementation of Stereo Vision Using a Multi-Layer Feedback Architecture

An artificial neural network (ANN), consisting of three interacting neural modules, is developed for stereo vision. The first module locates sharp intensity changes in each of the images. The edge detection process is basically a bottom-up, one-to-one input-output mapping process with a network structure which is time-invariant. In the second module, a multilayered connectionist network is used to extract the features or primitives For disparity analysis (matching). A similarity measure is defined and computed for each pair of primitive matches and is passed to the third module. The third module solves the difficult correspondence problem by mapping it into a constraint satisfaction problem. Intra- and inter-scanline constraints are used in order to restrict possible feature matches. The inter-scanline constraints are implemented via interconnections of a three-dimensional neural network. The overall process is iterative. At the end of each network iteration, the output of the third constraint satisfaction module feeds back updated information on matching pairs as well as their corresponding location in the left and right images to the input of the second module. This iterative process continues until the output of the third module converges to a stable state. Once the matching process is completed, the disparity can be calculated, and camera calibration parameters can be used to find the three-dimensional location of object points. Results using this computational architecture are shown. >

[1]  T. Poggio,et al.  A computational theory of human stereo vision , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[2]  E. R. Haddow,et al.  A parallel binocular stereo algorithm utilizing dynamic programming and relaxation labelling , 1987 .

[3]  Takeo Kanade,et al.  Stereo by Intra- and Inter-Scanline Search Using Dynamic Programming , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  T. O. Binford,et al.  Geometric Constraints In Stereo Vision , 1980, Optics & Photonics.

[5]  Ernest L. Hall,et al.  Matching of featured objects using relational tables from stereo images , 1982, Comput. Graph. Image Process..

[6]  J. Mayhew,et al.  Vergence Eye Movements Made in Response to Spatial-Frequency-Filtered Random-Dot Stereograms , 1981, Perception.

[7]  William B. Thompson,et al.  Disparity Analysis of Images , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  R. J. Schalkoff,et al.  A parallel distributed algorithm for feature extraction and disparity analysis of computer images , 1990, Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing 1990.

[9]  Thomas O. Binford,et al.  Depth from Edge and Intensity Based Stereo , 1981, IJCAI.

[10]  Nasser M. Nasrabadi,et al.  Hopfield network for stereo vision correspondence , 1992, IEEE Trans. Neural Networks.

[11]  John E. W. Mayhew,et al.  Psychophysical and Computational Studies Towards a Theory of Human Stereopsis , 1981, Artif. Intell..

[12]  W. Eric L. Grimson,et al.  Computational Experiments with a Feature Based Stereo Algorithm , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Martin A. Fischler,et al.  Computational Stereo , 1982, CSUR.

[14]  Rama Chellappa,et al.  Stereo matching using a neural network , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[15]  Robert J. Schalkoff,et al.  Digital Image Processing and Computer Vision , 1989 .

[16]  Tomaso Poggio,et al.  A Theory of Human Stereo Vision , 1977 .

[17]  Ramakant Nevatia,et al.  Segment-based stereo matching , 1985, Comput. Vis. Graph. Image Process..

[18]  J J Hopfield,et al.  Neurons with graded response have collective computational properties like those of two-state neurons. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R. J. Schalkoff,et al.  A neural network approach for stereo vision , 1990, IEEE Proceedings on Southeastcon.

[20]  Madjid S Mousavi COMPUTER STEREO VISION FOR 3-D OBJECT LOCATION USING A NEURAL NETWORK ARCHITECTURE , 1990 .

[21]  Robert J. Schalkoff,et al.  Image labelling: a neural network approach , 1988, Image Vis. Comput..

[22]  John Frisby,et al.  The role of spatial frequency tuned channels in vergence control , 1980, Vision Research.

[23]  Robert J. Schalkoff,et al.  Stereo vision: a neural network application to constraint satisfaction problem , 1991, Other Conferences.