A 3D Feature-Based Tracker for Multiple Object Tracking

This paper presents a 3D feature-based tracker for tracking multiple moving objects using a computer- controlled binocular head. Our tracker operates in two phases: an initialization phase and a tracking phase. In the initialization phase, correspondence between 2D features in the first stereo image pair is determined reliably using the epipolar line constraint and mutually-supported consistency. In the tracking phase, the feedback loop is established by first predicting new 3D feature locations with Kalman filters (KF) and then projecting them onto the 2D images to guide the extraction of 2D features in the new image pair. Here, we propose a RANSAC (RAN-dom SAmple Consensus)-based clustering method for motion segmentation and estimation using the principle of rigid body consensus, which states that all the extracted 3D features on a rigid body should have the same 3D motion. This new method leads to a feature-clustering algorithm which provides a systematic method for managing splitting, merging, appearance and disappearance of multiple moving rigid objects-including articulated objects, such as robot manipulators. Using the motion estimates obtained with the RANSAC-based method as the measurements for the KFs, we are able to use linear KFs for predictive visual tracking instead of the commonly-used extended Kalman filters (EKF). Experiments have shown that our tracking system does give good results and can serve as a robust 3D feature tracker for the active binocular vision system we are developing.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Narendra Ahuja,et al.  3-D Motion Estimation, Understanding, and Prediction from Noisy Image Sequences , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Howard Wainer,et al.  Robust Regression & Outlier Detection , 1988 .

[4]  Shimon Ullman,et al.  Aligning A Model To An Image Using Minimal Information , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[5]  Y. Bar-Shalom Tracking and data association , 1988 .

[6]  R. Bajcsy Active perception , 1988, Proc. IEEE.

[7]  Ronen Basri,et al.  The Alignment Of Objects With Smooth Surfaces , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[8]  R. Chellappa,et al.  Recursive 3-D motion estimation from a monocular image sequence , 1990 .

[9]  K. Kanatani Group-Theoretical Methods in Image Understanding , 1990 .

[10]  Rama Chellappa,et al.  3-D Motion Estimation Using a Sequence of Noisy Stereo Images: Models, Estimation, and Uniqueness Results , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Ronen Basri,et al.  Recognition by Linear Combinations of Models , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Nikolaos Papanikolopoulos Controlled active vision , 1992 .

[13]  James L. Crowley,et al.  Gaze Control for a Binocular Camera Head , 1992, ECCV.

[14]  Jan-Olof Eklundh,et al.  A head-eye system - Analysis and design , 1992, CVGIP Image Underst..

[15]  Olivier Faugeras,et al.  3D Dynamic Scene Analysis: A Stereo Based Approach , 1992 .

[16]  David W. Murray,et al.  A modular head/eye platform for real-time reactive vision Mechatronics , 1993 .

[17]  James W. Cooper,et al.  Early Jump-Out Corner Detectors , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Peter K. Allen,et al.  Automated tracking and grasping of a moving object with a robotic hand-eye system , 1993, IEEE Trans. Robotics Autom..

[19]  A. Lynn Abbott,et al.  Active Stereo: Integrating Disparity, Vergence, Focus, Aperture and Calibration for Surface Estimation , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  James J. Clark,et al.  The Harvard Binocular Head , 1993, Int. J. Pattern Recognit. Artif. Intell..

[21]  Henrik I. Christensen,et al.  A Low-Cost Robot Camera Head , 1993, Int. J. Pattern Recognit. Artif. Intell..

[22]  O. Faugeras Three-dimensional computer vision: a geometric viewpoint , 1993 .

[23]  M. Levine,et al.  Extracting geometric primitives , 1993 .

[24]  Ian D. Reid,et al.  Saccade and pursuit on an active head/eye platform , 1994, Image Vis. Comput..

[25]  Thomas S. Huang,et al.  Learning and Feature Selection in Stereo Matching , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Henrik I. Christensen,et al.  A Control Theoretical Approach to Active Vision , 1995, ACCV.

[27]  Rama Chellappa,et al.  Tracking a dynamic set of feature points , 1994, IEEE Trans. Image Process..

[28]  Rachid Deriche,et al.  A Robust Technique for Matching two Uncalibrated Images Through the Recovery of the Unknown Epipolar Geometry , 1995, Artif. Intell..

[29]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Andrew Zisserman,et al.  Detection and tracking of independent motion , 1996, Image Vis. Comput..

[31]  Roberto Cipolla,et al.  Fast visual tracking by temporal consensus , 1996, Image Vis. Comput..

[32]  Wei-Song Lin,et al.  New closed-form solution for kinematic parameter identification of a binocular head using point measurements , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[33]  Wei-Song Lin,et al.  Calibration of an active binocular head , 1998, IEEE Trans. Syst. Man Cybern. Part A.