Scene Segmentation from Visual Motion Using Global Optimization

This paper presents results from computer experiments with an algorithm to perform scene disposition and motion segmentation from visual motion or optic flow. The maximum a posteriori (MAP) criterion is used to formulate what the best segmentation or interpretation of the scene should be, where the scene is assumed to be made up of some fixed number of moving planar surface patches. The Bayesian approach requires, first, specification of prior expectations for the optic flow field, which here is modeled as spatial and temporal Markov random fields; and, secondly, a way of measuring how well the segmentation predicts the measured flow field. The Markov random fields incorporate the physical constraints that objects and their images are probably spatially continuous, and that their images are likely to move quite smoothly across the image plane. To compute the flow predicted by the segmentation, a recent method for reconstructing the motion and orientation of planar surface facets is used. The search for the globally optimal segmentation is performed using simulated annealing.

[1]  B. Buxton,et al.  Monocular depth perception from optical flow by space time signal processing , 1983, Proceedings of the Royal Society of London. A. Mathematical and Physical Sciences.

[2]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[3]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[4]  Gilad Adiv,et al.  Determining Three-Dimensional Motion and Structure from Optical Flow Generated by Several Moving Objects , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Allen M. Waxman,et al.  Contour Evolution, Neighborhood Deformation, and Global Image Flow: Planar Surfaces in Motion , 1985 .

[6]  Tomaso Poggio,et al.  Computational vision and regularization theory , 1985, Nature.

[7]  David W. Murray,et al.  Structure-from-motion algorithms for computer vision on an SIMD architecture , 1985 .

[8]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[9]  H. Wallach,et al.  The kinetic depth effect. , 1953, Journal of experimental psychology.

[10]  Thomas S. Huang,et al.  Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Daryl T. Lawton,et al.  Processing translational motion sequences , 1983, Comput. Vis. Graph. Image Process..

[12]  David W. Murray,et al.  3D Solutions to the Aperture Problem , 1984, ECAI.

[13]  Berthold K. P. Horn,et al.  Passive navigation , 1982, Computer Vision Graphics and Image Processing.

[14]  W. B. Thompson,et al.  Combining motion and contrast for segmentation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  David W. Murray,et al.  Optic flow segmentation as an ill-posed and maximum likelihood problem , 1985, Image Vis. Comput..

[16]  Joseph K. Kearney,et al.  GRADIENT-BASED ESTIMATION OF OPTICAL FLOW WITH GLOBAL OPTIMIZATION. , 1984 .

[17]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[18]  Hans-Hellmut Nagel,et al.  Displacement vectors derived from second-order intensity variations in image sequences , 1983, Comput. Vis. Graph. Image Process..

[19]  Hilary Buxton,et al.  Computation of optic flow from the motion of edge features in image sequences , 1984, Image Vis. Comput..

[20]  S. Ullman,et al.  The interpretation of visual motion , 1977 .

[21]  David W. Murray,et al.  Detecting the image boundaries between optical flow fields from several moving planar facets , 1986, Pattern Recognit. Lett..

[22]  Ellen C. Hildreth,et al.  Computations Underlying the Measurement of Visual Motion , 1984, Artif. Intell..

[23]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  William B. Thompson,et al.  Lower-Level Estimation and Interpretation of Visual Motion , 1981, Computer.

[25]  Claude L. Fennema,et al.  Velocity determination in scenes containing several moving objects , 1979 .

[26]  Bennett I. Bertenthal,et al.  Recovering Connectivity from Moving Point-Light Displays , 1988 .

[27]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[28]  J. Marroquín Surface Reconstruction Preserving Discontinuities , 1984 .

[29]  G Ishai,et al.  Visual stability and space perception in monocular vision: mathematical model. , 1980, Journal of the Optical Society of America.

[30]  David W. Murray,et al.  A parallel approach to the picture restoration algorithm of Geman and Geman on an SIMD machine , 1986, Image Vis. Comput..

[31]  H C Longuet-Higgins,et al.  The visual ambiguity of a moving plane , 1984, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[32]  Allen M. Waxman,et al.  Surface Structure and Three-Dimensional Motion from Image Flow Kinematics , 1985 .

[33]  T. Poggio,et al.  Ill-Posed Problems and Regularization Analysis in Early Vision , 1984 .

[34]  R. Paquin,et al.  A spatio-temporal gradient method for estimating the displacement field in time-varying imagery , 1982, Computer Graphics and Image Processing.

[35]  W F Clocksin,et al.  Perception of Surface Slant and Edge Labels from Optical Flow: A Computational Approach , 1980, Perception.