Hand pose estimation using voxel-based individualized hand model

A robust and adaptive hand pose estimation technique is necessary to create a natural and non-contact manmachine interface system. In this regard, individualizing the system's hand model is key to improving hand pose estimation. This paper addresses the issue of calibrating the hand model of the system for every user in order to get a more robust pose estimation results. Our system is a vision-based model-based approach that uses a skeletal hand model composed of a finger link structure and a surface structure. The surface structure is composed of voxel which is derived from silhouette images obtained by multi-viewpoint cameras. The finger link structure is estimated by searching for the optimum lengths from a set of values generated from the calibration motion of the fingers. We compared the output of using a standard (non-calibrated) hand model with our proposed individualized hand model in the pose estimation system. Results show that using the calibrated hand model has a more robust result than the one using the standard model.

[1]  Tsukasa Ogasawara,et al.  Hand pose estimation for vision-based human interface , 2001, Proceedings 10th IEEE International Workshop on Robot and Human Interactive Communication. ROMAN 2001 (Cat. No.01TH8591).

[2]  Tsuneya Kurihara,et al.  Modeling deformable human hands from medical images , 2004, SCA '04.

[3]  Takeo Kanade,et al.  DigitEyes: vision-based hand tracking for human-computer interaction , 1994, Proceedings of 1994 IEEE Workshop on Motion of Non-rigid and Articulated Objects.

[4]  Sebastian Thrun,et al.  A Gesture Based Interface for Human-Robot Interaction , 2000, Auton. Robots.

[5]  Luc Van Gool,et al.  Smart particle filtering for 3D hand tracking , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[6]  Tsukasa Ogasawara,et al.  A hand-pose estimation for vision-based human interfaces , 2003, IEEE Trans. Ind. Electron..

[7]  Modesto Castrillón Santana,et al.  Fast and Accurate Hand Pose Detection for Human-Robot Interaction , 2005, IbPRIA.

[8]  Ying Wu,et al.  Capturing natural hand articulation , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9]  Jonathan Kofman,et al.  Teleoperation of a robot manipulator using a vision-based human-robot interface , 2005, IEEE Transactions on Industrial Electronics.

[10]  Paulo R. S. Mendonça,et al.  Model-based 3D tracking of an articulated hand , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[11]  Mohan M. Trivedi,et al.  Human Body Model Acquisition and Tracking Using Voxel Data , 2003, International Journal of Computer Vision.

[12]  John P. Lewis,et al.  Soft-Tissue Deformation for In Vivo Volume Animation , 2007, 15th Pacific Conference on Computer Graphics and Applications (PG'07).

[13]  François Bérard,et al.  Bare-hand human-computer interaction , 2001, PUI '01.

[14]  Takeo Kanade,et al.  A real time system for robust 3D voxel reconstruction of human motions , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[15]  Gershon Elber,et al.  Papercraft Models using Generalized Cylinders , 2007 .

[16]  Alexander H. Waibel,et al.  Natural human-robot interaction using speech, head pose and gestures , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[17]  Richard Szeliski,et al.  Rapid octree construction from image sequences , 1993 .