Active Tracking Strategy for Monocular Depth Inference over Multiple Frames

The extraction of depth information from a sequence of images is investigated. An algorithm that exploits the constraint imposed by active motion of the camera is described. Within this framework, in order to facilitate measurement of the navigation parameters, a constrained egomotion strategy was adopted in which the position of the fixation point is stabilized during the navigation (in an anthropomorphic fashion). This constraint reduces the dimensionality of the parameter space without increasing the complexity of the equations. A further distinctive point is the use of two sampling rates: the faster (related to the computation of the instantaneous optical flow) is fast enough to allow the local operator to sense the passing edge (or, in other words, to allow the tracking of moving contour points), while the slower (used to perform the triangulation procedure necessary to derive depth) is slow enough to provide a sufficiently large baseline for triangulation. Experimental results on real image sequences are presented. >

[1]  D Marr,et al.  Theory of edge detection , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[2]  Dana H. Ballard,et al.  ACTIVE NAVIGATION: TRACKING AN ENVIRONMENTAL POINT CONSIDERED BENEFICIAL. , 1986 .

[3]  Larry S. Davis,et al.  Contour-based motion estimation , 1982, Comput. Vis. Graph. Image Process..

[4]  Giulio Sandini,et al.  Analysis of object motion and camera motion in real scenes , 1986, Proceedings. 1986 IEEE International Conference on Robotics and Automation.

[5]  Giulio Sandini,et al.  Control Strategies in the Eye-Head Coordination System , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[6]  Larry S. Davis,et al.  Contour-based motion estimation , 1982, Comput. Vis. Graph. Image Process..

[7]  Hans-Hellmut Nagel,et al.  Displacement vectors derived from second-order intensity variations in image sequences , 1983, Comput. Vis. Graph. Image Process..

[8]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[9]  H. K. Nishihara,et al.  PRISM: A Practical Mealtime Imaging Stereo Matcher , 1984, Other Conferences.

[10]  Thomas S. Huang,et al.  A discrete spatial representation for lateral motion stereo , 1982, Comput. Graph. Image Process..

[11]  Giulio Sandini,et al.  Motor and spatial aspects in artificial vision , 1988 .

[12]  D Marr,et al.  A computational theory of human stereo vision. , 1979, Proceedings of the Royal Society of London. Series B, Biological sciences.

[13]  Giulio Sandini,et al.  Active vision: integration of fixed and mobile cameras , 1988 .

[14]  W Richards,et al.  Structure from stereo and motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[15]  Noboru Sugie,et al.  Visual Target Tracking with Active Head Rotation , 1970, IEEE Trans. Syst. Sci. Cybern..

[16]  T. Poggio,et al.  A computational theory of human stereo vision , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[17]  Berthold K. P. Horn Robot vision , 1986, MIT electrical engineering and computer science series.

[18]  Hans-Hellmut Nagel,et al.  An Investigation of Smoothness Constraints for the Estimation of Displacement Vector Fields from Image Sequences , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  S Ullman,et al.  Maximizing Rigidity: The Incremental Recovery of 3-D Structure from Rigid and Nonrigid Motion , 1984, Perception.

[20]  Ellen C. Hildreth,et al.  Measurement of Visual Motion , 1984 .

[21]  Daryl T. Lawton,et al.  Processing translational motion sequences , 1983, Comput. Vis. Graph. Image Process..

[22]  Gérard G. Medioni,et al.  Detection of Intensity Changes with Subpixel Accuracy Using Laplacian-Gaussian Masks , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.