A Multi-Camera Pose Tracker for Assisting the Visually Impaired

6DOF Pose tracking is useful in many contexts, e.g., in augmented reality (AR) applications. In particular, we seek to assist visually impaired persons by providing them with an auditory interface to their environment through soni.cation. For this purpose, accurate head tracking in mixed indoor/outdoor settings is the key enabling technology. Most of the work to date has concentrated on single-camera systems with a relatively small .eld of view, but this presents a fundamental limit on the accuracy of such systems. We present a multi-camera pose tracker that handles an arbitrary con.guration of cameras rigidly .xed to the object of interest. By using multiple cameras, we increase both the robustness and the accuracy by which a 6-DOF pose is tracked. However, in a multi-camera rig setting, earlier methods for determining the unknown pose from three world-tocamera correspondences are no longer applicable, as they all assume a common center of projection. In this paper, we present a RANSAC-based method that copes with this limitation and handles multi-camera rigs. In addition, we present quantitative results to serve as a design guide for full system deployments based on multicamera rigs. Our formulation is completely general, in that it handles an arbitrary, heterogeneous collection of cameras in any arbitrary con.guration.

[1]  Frank Biocca,et al.  A Survey of Position Trackers , 1992, Presence: Teleoperators & Virtual Environments.

[2]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[3]  Robert M. Haralick,et al.  Review and analysis of solutions of the three point perspective pose estimation problem , 1994, International Journal of Computer Vision.

[4]  Joachim Hertzberg,et al.  Robust localization using context in omnidirectional imaging , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[5]  Tom Drummond,et al.  Robust visual tracking for non-instrumental augmented reality , 2003, The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, 2003. Proceedings..

[6]  Yakup Genc,et al.  Marker-less tracking for AR: a learning-based approach , 2002, Proceedings. International Symposium on Mixed and Augmented Reality.

[7]  X. Yi On Automatic Differentiation , 2005 .

[8]  David Nistér,et al.  A Minimal Solution to the Generalised 3-Point Pose Problem , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[9]  Ales Leonardis,et al.  Robust localization using panoramic view-based recognition , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[10]  Griewank,et al.  On automatic differentiation , 1988 .

[11]  Greg Welch,et al.  SCAAT: incremental tracking with incomplete information , 1997, SIGGRAPH.

[12]  Liqiang Feng,et al.  Navigating Mobile Robots: Systems and Techniques , 1996 .

[13]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[14]  Long Quan,et al.  Linear N-Point Camera Pose Determination , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  W. James MacLean,et al.  Video-rate stereo depth measurement on programmable hardware , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[16]  Adam Baumberg,et al.  Reliable feature matching across widely separated views , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[17]  C. Schmid,et al.  Indexing based on scale invariant interest points , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[18]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  Cordelia Schmid,et al.  An Affine Invariant Interest Point Detector , 2002, ECCV.

[20]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[21]  Pietro Perona,et al.  Object class recognition by unsupervised scale-invariant learning , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[22]  Vincent Lepetit,et al.  Fully automated and stable registration for augmented reality applications , 2003, The Second IEEE and ACM International Symposium on Mixed and Augmented Reality, 2003. Proceedings..

[23]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[24]  Luc Van Gool,et al.  Markerless augmented reality with a real-time affine region tracker , 2001, Proceedings IEEE and ACM International Symposium on Augmented Reality.

[25]  Frank Dellaert,et al.  Linear 2D localization and mapping for single and multiple robot scenarios , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[26]  R. Chellappa,et al.  Recursive 3-D motion estimation from a monocular image sequence , 1990 .