Monocular optical flow for real-time vision systems

This paper introduces a monocular optical flow algorithm that has been shown to perform well at nearly real-time frame rates (4 FPS on a 100 MHz SGI Indy workstation), using natural image sequences. The system is completely bottom-up, using pixel region-matching techniques. A coordinated gradient descent method is broken down into two stages: pixel region matching error measures are locally minimized and flow field consistency constraints apply nonlinear adaptive diffusion, causing confident measurements to influence their less confident neighbors. Convergence is usually accomplished with one iteration for an image frame pair. Temporal integration predicts upcoming flow fields. The algorithm is designed for flexibility: large displacements are tracked as easily as sub-pixel displacements, and higher-level information can feed flow field predictions into the measurement process.

[1]  Steven W. Zucker,et al.  Early orientation selection: Tangent fields and the dimensionality of their support , 1985, Comput. Vis. Graph. Image Process..

[2]  T. Poggio Vision by man and machine. , 1984, Scientific American.

[3]  Arun K. Sood,et al.  Analysis of long image sequence for structure and motion estimation , 1989, IEEE Trans. Syst. Man Cybern..

[4]  Ishwar K. Sethi,et al.  Finding Trajectories of Feature Points in a Monocular Image Sequence , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Rama Chellappa,et al.  Estimation of Object Motion Parameters from Noisy Images , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Olivier D. Faugeras,et al.  Motion analysis with a camera with unknown, and possibly varying intrinsic parameters , 1995, Proceedings of IEEE International Conference on Computer Vision.

[7]  Ajit Singh,et al.  Optic flow computation : a unified perspective , 1991 .

[8]  Thomas S. Huang,et al.  Estimating three-dimensional motion parameters of a rigid planar patch, II: Singular value decomposition , 1982 .

[9]  D Marr,et al.  Early processing of visual information. , 1976, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[10]  Shimon Ullman,et al.  Analysis of Visual Motion by Biological and Computer Systems , 1981, Computer.

[11]  Narendra Ahuja,et al.  Motion and Structure From Two Perspective Views: Algorithms, Error Analysis, and Error Estimation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ajit Singh,et al.  An estimation-theoretic framework for image-flow computation , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[13]  Jake K. Aggarwal,et al.  Structure from Motion of Rigid and Jointed Objects , 1981, Artif. Intell..

[14]  Rama Chellappa,et al.  Monocular motion estimation using a long sequence of noisy images , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[15]  P. Anandan Measuring Visual Motion From Image Sequences , 1987 .

[16]  Harry G. Barrow,et al.  Interpreting Line Drawings as Three-Dimensional Surfaces , 1980, Artif. Intell..

[17]  Peter Allen,et al.  Image-flow computation: An estimation-theoretic framework and a unified perspective , 1992, CVGIP Image Underst..

[18]  R. W. Ditchburn Seeing is Deceiving: The Psychology of Visual Illusions , 1979 .

[19]  L L Kontsevich,et al.  Pairwise comparison technique: a simple solution for depth reconstruction. , 1993, Journal of the Optical Society of America. A, Optics and image science.

[20]  Joachim Heel,et al.  Temporally integrated surface reconstruction , 1990, [1990] Proceedings Third International Conference on Computer Vision.

[21]  Richard I. Hartley,et al.  In defence of the 8-point algorithm , 1995, Proceedings of IEEE International Conference on Computer Vision.

[22]  H.-H. Nagel,et al.  Representation of Moving Rigid Objects Based on Visual Observations , 1981, Computer.

[23]  Jitendra Malik,et al.  Rigid body segmentation and shape description from dense optical flow under weak perspective , 1995, Proceedings of IEEE International Conference on Computer Vision.

[24]  D Marr,et al.  Cooperative computation of stereo disparity. , 1976, Science.

[25]  Amnon Shashua,et al.  Projective Structure from Uncalibrated Images: Structure From Motion and Recognition , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  David J. Fleet Measurement of image velocity , 1992 .

[27]  Andrea Giachetti,et al.  Optical flow and deformable objects , 1995, Proceedings of IEEE International Conference on Computer Vision.

[28]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[29]  J. Allman,et al.  Cytochrome oxidase and functional coding in primate striate cortex: a hypothesis. , 1990, Cold Spring Harbor symposia on quantitative biology.

[30]  Steven W. Zucker,et al.  Trace Inference, Curvature Consistency, and Curve Detection , 1989, IEEE Trans. Pattern Anal. Mach. Intell..