Global Hand Pose Estimation by Multiple Camera Ellipse Tracking

Immersive virtual environments with life-like interaction capabilities have very demanding requirements including high precision and processing speed. These issues raise many challenges for computer vision-based motion estimation algorithms. In this study, we consider the problem of hand tracking using multiple cameras and estimating its 3D global pose (i.e., position and orientation of the palm). Our interest is in developing an accurate and robust algorithm to be employed in an immersive virtual training environment, called ”Virtual GloveboX” (VGX) [1], which is currently under development at NASA Ames. In this context, we present a marker-based, hand tracking and 3D global pose estimation algorithm that operates in a controlled, multi-camera, environment built to track the user’s hand inside VGX. The key idea of the proposed algorithm is tracking the 3D position and orientation of an elliptical marker placed on the dorsal part of the hand using model-based tracking approaches and active camera selection. It should be noted that, the use of markers is well justified in the context of our application since VGX naturally allows for the use of gloves without disrupting the fidelity of the interaction. Our experimental results and comparisons illustrate that the proposed approach is more accurate and robust than related approaches. A byproduct of our multi-camera ellipse tracking algorithm is that, with only minor modifications, the same algorithm can be used to automatically re-calibrate (i.e., fine-tune) the extrinsic parameters of a multi-camera system leading to more accurate pose estimates.

[1]  Eun-Jung Holden,et al.  Visual recognition of hand motion , 1997 .

[2]  C. Maggioni,et al.  Computer Vision for Human–Machine Interaction: GestureComputer – History, Design and Applications , 1998 .

[3]  Haiying Guan,et al.  Model-based 3D hand posture estimation from a single 2D image , 2002, Image Vis. Comput..

[4]  Jeffrey D. Smith,et al.  The virtual glovebox (vgx): a semi-immersive virtual environment for training astronauts in life sci , 2004 .

[5]  Mircea Nicolescu,et al.  A Review on Vision-Based Full DOF Hand Motion Estimation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[6]  Tomás Svoboda,et al.  A Convenient Multicamera Self-Calibration for Virtual Environments , 2005, Presence: Teleoperators & Virtual Environments.

[7]  Songde Ma,et al.  Conics-based stereo, motion estimation, and pose determination , 1993, International Journal of Computer Vision.

[8]  Chung-Lin Huang,et al.  Model-based articulated hand motion tracking for gesture recognition , 1998, Image Vis. Comput..

[9]  Dieter W. Fellner,et al.  Interaction with hand gesture for a back-projection wall , 2004 .

[10]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Long Quan,et al.  Conic Reconstruction and Correspondence From Two Views , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Tosiyasu L. Kunii,et al.  Constraint-Based Hand Animation , 1993 .

[14]  Radu Horaud,et al.  Multiple-Camera Tracking of Rigid Objects , 2002, Int. J. Robotics Res..