Visual perception an information-based approach to understanding biological and artificial vision

The central issues of this dissertation are (a) what should we be doing — what problems should we be trying to solve — in order to build computer vision systems, and (b) what relevance biological vision has to the solution of these problems. The approach taken to tackle these issues centres mostly on the clarification and use of information-based ideas, and an investigation into the nature of the processes underlying perception. The primary objective is to demonstrate that information theory and extensions of it, and measurement theory are powerful tools in helping to find solutions to these problems. The quantitative meaning of information is examined, from its origins in physical theories, through Shannon information theory, Gabor representations and codes towards semantic interpretations of the term. Also the application of information theory to the understanding of the developmental and functional properties of biological visual systems is discussed. This includes a review of the current state of knowledge of the architecture and function of the early visual pathways, particularly the retina, and a discussion of the possible coding functions of cortical neurons. The nature of perception is discussed from a number of points of view: the types and function of explanation of perceptual systems and how these relate to the operation of the system; the role of the observer in describing perceptual functions in other systems or organisms; the status and role of objectivist and representational viewpoints in understanding vision; the philosophical basis of perception; the relationship between pattern recognition and perception, and the interpretation of perception in terms of a theory of measurement These two threads of research, information theory and measurement theory are brought together in an overview and reinterpretation of the cortical role in mammalian vision. Finally the application of some of the coding and recognition concepts to industrial inspection problems are described. The nature of the coding processes used are unusual in that coded images are used as the input for a simple neural network classifier, rather than a heuristic feature set The relationship between the Karhunen-Loeve transform and the singular value decomposition is clarified as background the coding technique used to code the images. This coding technique has also been used to code long sequences of moving images to investigate the possibilities of recognition of people on the basis of their gait or posture and this application is briefly described.

[1]  Paul G. Roetling Visual Performance And Image Coding , 1976, Other Conferences.

[2]  H. Barlow Critical limiting factors in the design of the eye and visual cortex , 1981 .

[3]  Robert F. Miller Are single retinal neurons both excitatory and inhibitory? , 1988, Nature.

[4]  D. Perkel Logical neurons: the enigmatic legacy of Warren McCulloch , 1988, Trends in Neurosciences.

[5]  D. Williams,et al.  Consequences of spatial sampling by a human photoreceptor mosaic. , 1983, Science.

[6]  D. Slepian,et al.  Prolate spheroidal wave functions, fourier analysis and uncertainty — II , 1961 .

[7]  H. Nyquist,et al.  Certain factors affecting telegraph speed , 1924, Journal of the A.I.E.E..

[8]  T. Wiesel,et al.  Functional architecture of macaque monkey visual cortex , 1977 .

[9]  Peter Földiák,et al.  Adaptation and decorrelation in the cortex , 1989 .

[10]  Ralph Linsker,et al.  Self-organization in a perceptual network , 1988, Computer.

[11]  D Kersten,et al.  Predictability and redundancy of natural images. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[12]  Peter Sterling,et al.  Microcircuitry and functional architecture of the cat retina , 1986, Trends in Neurosciences.

[13]  O. Faugeras Digital color image processing within the framework of a human visual model , 1979 .

[14]  P Bourgine,et al.  Towards a Practice of Autonomous Systems , 1992 .

[15]  R. Linsker From basic network principles to neural architecture (series) , 1986 .

[16]  D. M. MacKay,et al.  Strife over visual cortical function , 1981, Nature.

[17]  F. Varela Whence Perceptual Meaning? A Cartography of Current Ideas , 1992 .

[18]  Ralph Linsker,et al.  How to Generate Ordered Maps by Maximizing the Mutual Information between Input and Output Signals , 1989, Neural Computation.

[19]  W. H. Miller,et al.  Photoreceptor diameter and spacing for highest resolving power. , 1977, Journal of the Optical Society of America.

[20]  D. Hubel,et al.  Segregation of form, color, movement, and depth: anatomy, physiology, and perception. , 1988, Science.

[21]  R. Rosen 3 – Organisms as Causal Systems Which Are Not Mechanisms: An Essay into the Nature of Complexity , 1985 .

[22]  R H Masland,et al.  The functional architecture of the retina. , 1986, Scientific American.

[23]  Cajal and the retina: a 100-year retrospective , 1988, Trends in Neurosciences.

[24]  C. Enroth-Cugell,et al.  The contrast sensitivity of retinal ganglion cells of the cat , 1966, The Journal of physiology.

[25]  D J Field,et al.  Relations between the statistics of natural images and the response properties of cortical cells. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[26]  J. Neumann,et al.  The Logic of Quantum Mechanics , 1936 .

[27]  J. T. Massey,et al.  Mental rotation of the neuronal population vector. , 1989, Science.

[28]  Ralph Linsker,et al.  An Application of the Principle of Maximum Information Preservation to Linear Systems , 1988, NIPS.

[29]  M. Livingstone Art, illusion and the visual system. , 1988, Scientific American.

[30]  Gershon Buchsbaum,et al.  An Analytical Derivation of Visual Nonlinearity , 1980, IEEE Transactions on Biomedical Engineering.

[31]  D. Field,et al.  The structure and symmetry of simple-cell receptive-field profiles in the cat’s visual cortex , 1986, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[32]  H. Wechsler,et al.  Joint spatial/spatial-frequency representation , 1988 .

[33]  Jonathan A. Marshall,et al.  Self-organizing neural networks for perception of visual motion , 1990, Neural Networks.

[34]  J. Mayer,et al.  On the Quantum Correction for Thermodynamic Equilibrium , 1947 .

[35]  Horace Barlow,et al.  Understanding Natural Vision , 1983 .

[36]  H. Barlow The absolute efficiency of perceptual decisions. , 1980, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[37]  H. Wilson,et al.  Modified line-element theory for spatial-frequency and width discrimination. , 1984, Journal of the Optical Society of America. A, Optics and image science.

[38]  L. Cohen,et al.  Time-frequency distributions-a review , 1989, Proc. IEEE.

[39]  Yehoshua Y. Zeevi,et al.  The Generalized Gabor Scheme of Image Representation in Biological and Machine Vision , 1988, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Ernest L. Hall,et al.  A Nonlinear Model for the Spatial Characteristics of the Human Visual System , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[41]  L. Cohen Generalized Phase-Space Distribution Functions , 1966 .

[42]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[43]  T. Sejnowski Neural populations revealed , 1988, Nature.

[44]  Dennis Gabor,et al.  Theory of communication , 1946 .

[45]  Terry Bossomaier,et al.  Why spatial frequency processing in the visual cortex? , 1986, Vision Research.

[46]  S. W. Kuffler Discharge patterns and functional organization of mammalian retina. , 1953, Journal of neurophysiology.

[47]  Jr. Thomas G. Stockham,et al.  Image processing in the context of a visual model , 1972 .

[48]  Tomaso A. Poggio,et al.  On Edge Detection , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  C.E. Shannon,et al.  Communication in the Presence of Noise , 1949, Proceedings of the IRE.

[50]  Leonard J. Y. Schulman,et al.  Communication in the presence of noise , 1992 .

[51]  M. Davidson Perturbation approach to spatial brightness interaction in human vision. , 1968, Journal of the Optical Society of America.

[52]  R. Lemon Cognitive control of movement , 1989, Nature.

[53]  H. Nyquist,et al.  Certain Topics in Telegraph Transmission Theory , 1928, Transactions of the American Institute of Electrical Engineers.

[54]  C. Page Instantaneous Power Spectra , 1952 .

[55]  Harry Wechsler,et al.  Segmentation of Textured Images and Gestalt Organization Using Spatial/Spatial-Frequency Representations , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[56]  H. Barlow Three Theories of Cortical Function , 1979 .

[57]  H B Barlow,et al.  Single units and sensation: a neuron doctrine for perceptual psychology? , 1972, Perception.