Visual information processing: the structure and creation of visual representations.

For human vision to be explained by a computational theory, the first question is plain: What are the problems that the brain solves when we see? It is argued that vision is the construction of efficient symbolic descriptions from images of the world. An important aspect of vision is therefore the choice of representations for the different kinds of information in a visual scene. An overall framework is suggested for extracting shape information from images, in which the analysis proceeds through three representations: (1) the primal sketch, which makes explicit the intensity changes and local two-dimensional geometry of an image; (2) 2 1/2-D sketch, which is a viewer-centred representation of the deplth, orientation and discontinuities of the visible surfaces; and (3) the 3-D model representation, which allows an object-centred description of the three-dimensional structure and organization of a viewed shape. The critical act in formulating computational theories for process capable of constructing these representations is the discovery of valid constraints on the way the world behaves, that provide sufficient additional information to allow recovery of the desired characteristic. Finally, once a computational theory for a process has been formulated, algorithms for implementing it may be designed, and their performance compared with that of the human visual processor.

[1]  J. P. Southall Helmholtz's Treatise on Physiological Optics, Translated from the Third German Edition. Vol. I , 1925 .

[2]  H. Wallach,et al.  The kinetic depth effect. , 1953, Journal of experimental psychology.

[3]  S. W. Kuffler Discharge patterns and functional organization of mammalian retina. , 1953, Journal of neurophysiology.

[4]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[5]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[6]  R. Shepard,et al.  Mental Rotation of Three-Dimensional Objects , 1971, Science.

[7]  B. Julesz Foundations of Cyclopean Perception , 1971 .

[8]  Gerald J. Agin Representation and description of curved objects , 1972 .

[9]  H. Blum Biological shape and visual science (part I) , 1973 .

[10]  H. Blum Biological shape and visual science. I. , 1973, Journal of theoretical biology.

[11]  R. Nevatia Structured descriptions of complex curved objects for recognition and visual memory. , 1975 .

[12]  Berthold K. P. Horn Obtaining shape from shading information , 1989 .

[13]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[14]  Tomaso Poggio,et al.  From Understanding Computation to Understanding Neural Circuitry , 1976 .

[15]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[16]  D. Marr,et al.  Analysis of occluding contour , 1977, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[17]  Tomaso Poggio,et al.  A Theory of Human Stereo Vision , 1977 .

[18]  Harry G. Barrow,et al.  Experiments in Interpretation-Guided Segmentation , 1977, Artificial Intelligence.

[19]  David Marr,et al.  Representing Visual Information , 1977 .

[20]  D. Marr,et al.  Artificial Intelligence - A Personal View , 1976, Artif. Intell..

[21]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[22]  T. Poggio,et al.  A computational theory of human stereo vision , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[23]  S. Ullman The interpretation of structure from motion , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[24]  D Marr,et al.  Bandpass channels, zero-crossings, and early visual information processing. , 1979, Journal of the Optical Society of America.

[25]  S. Ullman,et al.  The interpretation of visual motion , 1977 .

[26]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[27]  D Marr,et al.  Directional selectivity and its use in early visual processing , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.