X Vision: Combining Image Warping and Geometric Constraints for Fast Visual Tracking

In this article, we describe X Vision, a modular, portable framework for visual tracking. X Vision is designed to be a programming environment for real-time vision which provides high performance on standard workstations outfitted with a simple digitizer. X Vision consists of a small set of image-level tracking primitives and a framework for combining tracking primitives to form complex tracking systems. Efficiency and robustness are achieved by propagating geometric and temporal constraints to the feature detection level, where image warping and specialized image processing are combined to perform feature detection quickly and robustly. We illustrate how useful, robust tracking systems can be constructed by simple combinations of a few basic primitives with the appropriate task-specific constraints.

[1]  C Tomasi,et al.  Shape and motion from image streams: a factorization method. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Gregory D. Hager,et al.  Robot hand-eye coordination based on stereo vision , 1995 .

[3]  Gregory D. Hager,et al.  Real-time feature tracking and projective invariance as a basis for hand-eye coordination , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[5]  Chien-Ping Lu Online pose estimation and model matching , 1996 .

[6]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.