Global hand pose estimation by multiple camera ellipse tracking

Immersive virtual environments with life-like interaction capabilities have very demanding requirements including high-precision motion capture and high-processing speed. These issues raise many challenges for computer vision-based motion estimation algorithms. In this study, we consider the problem of hand tracking using multiple cameras and estimating its 3D global pose (i.e., position and orientation of the palm). Our interest is in developing an accurate and robust algorithm to be employed in an immersive virtual training environment, called “Virtual GloveboX” (VGX) (Twombly et al. in J Syst Cybern Inf 2:30–34, 2005), which is currently under development at NASA Ames. In this context, we present a marker-based, hand tracking and 3D global pose estimation algorithm that operates in a controlled, multi-camera, environment built to track the user’s hand inside VGX. The key idea of the proposed algorithm is tracking the 3D position and orientation of an elliptical marker placed on the dorsal part of the hand using model-based tracking approaches and active camera selection. It should be noted that, the use of markers is well justified in the context of our application since VGX naturally allows for the use of gloves without disrupting the fidelity of the interaction. Our experimental results and comparisons illustrate that the proposed approach is more accurate and robust than related approaches. A byproduct of our multi-camera ellipse tracking algorithm is that, with only minor modifications, the same algorithm can be used to automatically re-calibrate (i.e., fine-tune) the extrinsic parameters of a multi-camera system leading to more accurate pose estimates.

[1]  C. Maggioni,et al.  Computer Vision for Human–Machine Interaction: GestureComputer – History, Design and Applications , 1998 .

[2]  Cheng-Chang Lien A scalable model-based hand posture analysis system , 2004, Machine Vision and Applications.

[3]  Mircea Nicolescu,et al.  A Review on Vision-Based Full DOF Hand Motion Estimation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[4]  Chung-Lin Huang,et al.  Model-based articulated hand motion tracking for gesture recognition , 1998, Image Vis. Comput..

[5]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Jeffrey D. Smith,et al.  The virtual glovebox (vgx): a semi-immersive virtual environment for training astronauts in life sci , 2004 .

[7]  Andrew W. Fitzgibbon,et al.  Direct Least Square Fitting of Ellipses , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Haiying Guan,et al.  Model-based 3D hand posture estimation from a single 2D image , 2002, Image Vis. Comput..

[9]  Tomás Svoboda,et al.  A Convenient Multicamera Self-Calibration for Virtual Environments , 2005, Presence: Teleoperators & Virtual Environments.

[10]  Songde Ma,et al.  Conics-based stereo, motion estimation, and pose determination , 1993, International Journal of Computer Vision.

[11]  Mircea Nicolescu,et al.  Vision-based hand pose estimation: A review , 2007, Comput. Vis. Image Underst..

[12]  Vladimir Pavlovic,et al.  Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Tosiyasu L. Kunii,et al.  Constraint-Based Hand Animation , 1993 .

[14]  Dieter W. Fellner,et al.  Interaction with hand gesture for a back-projection wall , 2004 .

[15]  Radu Horaud,et al.  Multiple-Camera Tracking of Rigid Objects , 2002, Int. J. Robotics Res..

[16]  Larry S. Davis,et al.  3-D model-based tracking of humans in action: a multi-view approach , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  J. O'Rourke,et al.  Model-based image analysis of human motion using constraint propagation , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Jean-Yves Bouguet,et al.  Camera calibration toolbox for matlab , 2001 .

[19]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Dieter W. Fellner,et al.  Interaction with hand gesture for a back-projection wall , 2004, Proceedings Computer Graphics International, 2004..

[21]  Daniel Thalmann,et al.  Models and Techniques in Computer Animation , 2014, Computer Animation Series.

[22]  Long Quan,et al.  Conic Reconstruction and Correspondence From Two Views , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Eun-Jung Holden,et al.  Visual recognition of hand motion , 1997 .