Distributed representation and analysis of visual motion

The central theme of the thesis is that the failure of image motion algorithms is due primarily to the use of vector fields as a representation for visual motion. We argue that the translational vector field representation is inherently impoverished and error-prone. Furthermore, there is evidence that a direct optical flow representation scheme is not used by biological systems for motion analysis. Instead, we advocate distributed representations of motion, in which the encoding of image plane velocity is implicit. As a simple example of this idea, and in consideration of the errors in the flow vectors, we re-cast the traditional optical flow problem as a probabilistic one, modeling the measurement and constraint errors as random variables. The resulting framework produces probability distributions of optical flow, allowing proper handling of the uncertainties inherent in the optical flow computation, and facilitating the combination with information from other sources. We demonstrate the advantages of this probabilistic approach on a set of examples. In order to overcome the temporal aliasing commonly found in time-sampled imagery (eg, video), we develop a probabilistic "coarse-to-fine" algorithm that functions much like a Kalman filter over scale. We implement an efficient version of this algorithm and show its success in computing Gaussian distributions of optical flow of both synthetic and real image sequences. We then extend the notion of distributed representation to a generalized framework that is capable of representing multiple motions at a point. We develop an example representation through a series of modifications of the differential approach to optical flow estimation. We show that this example is capable of representing multiple motions at a single image location and we demonstrate its use near occlusion boundaries and on simple synthetic examples containing transparent objects. Finally, we show that these distributed representation are effective as models for biological motion representation. We show qualitative comparisons of stages of the algorithm with neurons found in mammalian visual systems, suggesting experiments to test the validity of the model. We demonstrate that such a model can account quantitatively for a set of psychophysical data on the perception of moving sinusoidal plaid patterns. (Copies available exclusively from MIT Libraries, Rm. 14-0551, Cambridge, MA 02139-4307. Ph. 617-253-5668; Fax 617-253-1690.) (Abstract shortened by UMI.)

[1]  D Marr,et al.  Directional selectivity and its use in early visual processing , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[2]  J. Limb,et al.  Estimating the Velocity of Moving Images in Television Signals , 1975 .

[3]  Eero P. Simoncelli,et al.  A computational model for perception of two-dimensional pattern ve-locities , 1992 .

[4]  Vincent P. Ferrera,et al.  Perceived speed of moving two-dimensional patterns , 1991, Vision Research.

[5]  Mandyam V. Srinivasan,et al.  Measurement of optical flow by a generalized gradient scheme , 1991 .

[6]  D J Heeger,et al.  Model for the extraction of image flow. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[7]  Edward H. Adelson,et al.  Probability distributions of optical flow , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  W. Reichardt,et al.  Autocorrelation, a principle for the evaluation of sensory information by the central nervous system , 1961 .

[9]  J. van Santen,et al.  Temporal covariance model of human motion perception. , 1984, Journal of the Optical Society of America. A, Optics and image science.

[10]  D C Van Essen,et al.  Functional properties of neurons in middle temporal visual area of the macaque monkey. I. Selectivity for stimulus direction, speed, and orientation. , 1983, Journal of neurophysiology.

[11]  T. Albright Direction and orientation selectivity of neurons in visual area MT of the macaque. , 1984, Journal of neurophysiology.

[12]  Ehud Weinstein,et al.  Parameter estimation of superimposed signals using the EM algorithm , 1988, IEEE Trans. Acoust. Speech Signal Process..

[13]  Kenji Mase,et al.  Simultaneous multiple optical flow estimation , 1990, [1990] Proceedings. 10th International Conference on Pattern Recognition.

[14]  Allen M. Waxman,et al.  Contour Evolution, Neighborhood Deformation, and Global Image Flow: Planar Surfaces in Motion , 1985 .

[15]  Michèle Basseville,et al.  Modeling and estimation of multiresolution stochastic processes , 1992, IEEE Trans. Inf. Theory.

[16]  L. Quam Hierarchical warp stereo , 1987 .

[17]  A. Yuille,et al.  A model for the estimate of local image velocity by cells in the visual cortex , 1990, Proceedings of the Royal Society of London. B. Biological Sciences.

[18]  D. G. Albrecht,et al.  Motion selectivity and the contrast-response function of simple cells in the visual cortex , 1991, Visual Neuroscience.

[19]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[20]  Kenji Mase,et al.  Unified computational theory for motion transparency and motion boundaries based on eigenenergy analysis , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Yuan-Fang Wang,et al.  Experiments in computing optical flow with the gradient-based, multiconstraint method , 1987, Pattern Recognit..

[22]  A. Pentland,et al.  Robust estimation of a multi-layered motion representation , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[23]  R. Hetherington The Perception of the Visual World , 1952 .

[24]  Jake K. Aggarwal,et al.  On the computation of motion from sequences of images-A review , 1988, Proc. IEEE.

[25]  Daniel A. Pollen,et al.  Visual cortical neurons as localized spatial frequency filters , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[26]  Ellen C. Hildreth,et al.  Computations Underlying the Measurement of Visual Motion , 1984, Artif. Intell..

[27]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Ajit Singh,et al.  Incremental estimation of image flow using a Kalman filter , 1992, J. Vis. Commun. Image Represent..

[29]  P. Thompson,et al.  Human speed perception is contrast dependent , 1992, Vision Research.

[30]  Ciro Cafforio,et al.  Methods for measuring small displacements of television images , 1976, IEEE Trans. Inf. Theory.

[31]  S. Negahdaripour,et al.  Relaxing the Brightness Constancy Assumption in Computing Optical Flow , 1987 .

[32]  Y. J. Tejwani,et al.  Robot vision , 1989, IEEE International Symposium on Circuits and Systems,.

[33]  Andrew B. Watson,et al.  A look at motion in the frequency domain , 1983 .

[34]  Roger W. Brockett Gramians, generalized inverses, and the least squares approximation of optical flow , 1987, Proceedings. 1987 IEEE International Conference on Robotics and Automation.

[35]  Hans Knutsson,et al.  Texture Analysis Using Two-Dimensional Quadrature Filters , 1983 .

[36]  Donald Geman,et al.  Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images , 1984 .

[37]  Joseph K. Kearney,et al.  Optical Flow Estimation: An Error Analysis of Gradient-Based Methods with Local Optimization , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[39]  H. Wilson,et al.  A psychophysically motivated model for two-dimensional motion perception , 1992, Visual Neuroscience.

[40]  Hans-Hellmut Nagel,et al.  An Investigation of Smoothness Constraints for the Estimation of Displacement Vector Fields from Image Sequences , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  David J. Heeger,et al.  Model of visual motion sensing , 1994 .

[42]  Claude L. Fennema,et al.  Velocity determination in scenes containing several moving objects , 1979 .

[43]  K. C. Chou,et al.  Recursive and iterative estimation algorithms for multiresolution stochastic processes , 1989, Proceedings of the 28th IEEE Conference on Decision and Control,.

[44]  A. Willsky,et al.  An estimation-based approach to the reconstruction of optical flow , 1987 .

[45]  P. J. Burt,et al.  Fast Filter Transforms for Image Processing , 1981 .

[46]  A J Ahumada,et al.  Model of human visual-motion sensing. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[47]  Hans-Hellmut Nagel,et al.  Displacement vectors derived from second-order intensity variations in image sequences , 1983, Comput. Vis. Graph. Image Process..

[48]  G. F. Cooper,et al.  The angular selectivity of visual cortical cells to moving gratings , 1968, The Journal of physiology.

[49]  R. Haralick,et al.  The Facet Approach to Optic Flow , 1983 .

[50]  A.H. Haddad,et al.  Applied optimal estimation , 1976, Proceedings of the IEEE.

[51]  E. Adelson,et al.  Directionally selective complex cells and the computation of motion energy in cat visual cortex , 1992, Vision Research.

[52]  H. Wilson,et al.  Perceived direction of moving two-dimensional patterns , 1990, Vision Research.

[53]  Leslie Welch,et al.  The perception of moving plaids reveals two motion-processing stages , 1989, Nature.

[54]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[55]  Demetri Terzopoulos,et al.  Image Analysis Using Multigrid Relaxation Methods , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  E. Adelson,et al.  Phenomenal coherence of moving visual patterns , 1982, Nature.

[57]  Tomaso Poggio,et al.  Optical flow: computational properties and networks, biological and analog , 1989 .

[58]  J. Daugman Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[59]  David W. Murray,et al.  Scene Segmentation from Visual Motion Using Global Optimization , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60]  P. Thompson Perceived rate of movement depends on contrast , 1982, Vision Research.

[61]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[62]  Michael J. Black,et al.  Robust dynamic motion estimation over time , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[63]  T. Poggio,et al.  Visual hyperacuity: spatiotemporal interpolation in human vision , 1981, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[64]  W. C. Karl,et al.  Eecient Multiscale Regularization with Applications to the Computation of Optical Flow 1 , 1993 .

[65]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[66]  D.H. Johnson,et al.  The application of spectral estimation methods to bearing estimation problems , 1982, Proceedings of the IEEE.

[67]  J. B. Mulligan,et al.  Effect of contrast on the perceived direction of a moving plaid , 1990, Vision Research.