A natural and synthetic corpus for benchmarking of hand gesture recognition systems

The use of hand gestures offers an alternative to the commonly used human–computer interfaces (i.e. keyboard, mouse, gamepad, voice, etc.), providing a more intuitive way of navigating among menus and in multimedia applications. This paper presents a dataset for the evaluation of hand gesture recognition approaches in human–computer interaction scenarios. It includes natural data and synthetic data from several State of the Art dictionaries. The dataset considers single-pose and multiple-pose gestures, as well as gestures defined by pose and motion or just by motion. Data types include static pose videos and gesture execution videos—performed by a set of eleven users and recorded with a time-of-flight camera—and synthetically generated gesture images. A novel collection of critical factors involved in the creation of a hand gestures dataset is proposed: capture technology, temporal coherence, nature of gestures, representativeness, pose issues and scalability. Special attention is given to the scalability factor, proposing a simple method for the synthetic generation of depth images of gestures, making possible the extension of a dataset with new dictionaries and gestures without the need of recruiting new users, as well as providing more flexibility in the point-of-view selection. The method is validated for the presented dataset. Finally, a separability study of the pose-based gestures of a dictionary is performed. The resulting corpus, which exceeds in terms of representativity and scalability the datasets existing in the State Of Art, provides a significant evaluation scenario for different kinds of hand gesture recognition solutions.

[1]  Ming-Kuei Hu,et al.  Visual pattern recognition by moment invariants , 1962, IRE Trans. Inf. Theory.

[2]  Abdolhossein Sarrafzadeh,et al.  A Color Hand Gesture Database for Evaluating and Improving Algorithms on Hand Gesture and Posture Recognition , 2005 .

[3]  Xia Liu,et al.  Hand gesture recognition using depth data , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[4]  Shigeo Morishima,et al.  Example based skinning with progressively optimized support joints , 2009, SIGGRAPH ASIA '09.

[5]  Shuzhi Sam Ge,et al.  Hand Gesture Recognition and Tracking based on Distributed Locally Linear Embedding , 2006, 2006 IEEE Conference on Robotics, Automation and Mechatronics.

[6]  Jochen Triesch,et al.  Robust classification of hand postures against complex backgrounds , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[7]  Jesús Bescós,et al.  Real-time user independent hand gesture recognition from time-of-flight camera video using static and dynamic models , 2011, Machine Vision and Applications.

[8]  John P. Lewis,et al.  Pose Space Deformation: A Unified Approach to Shape Interpolation and Skeleton-Driven Deformation , 2000, SIGGRAPH.

[9]  Tae-Kyun Kim,et al.  Tensor Canonical Correlation Analysis for Action Classification , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Chung-Lin Huang,et al.  A multi-view vision-based hand motion capturing system , 2011, Pattern Recognit..

[11]  Sébastien Marcel,et al.  Hand gesture recognition using input-output hidden Markov models , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[12]  Joseph J. LaViola Bringing VR and Spatial 3D Interaction to the Masses through Video Games , 2008, IEEE Computer Graphics and Applications.

[13]  Tsukasa Ogasawara,et al.  Model-based hand pose estimation using multiple viewpoint silhouette images and Unscented Kalman Filter , 2008, RO-MAN 2008 - The 17th IEEE International Symposium on Robot and Human Interactive Communication.

[14]  Nobutoshi Yamazaki,et al.  Pose Space Surface Manipulation , 2012, Int. J. Comput. Games Technol..

[15]  Jochen Triesch,et al.  A System for Person-Independent Hand Posture Recognition against Complex Backgrounds , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Tsukasa Ogasawara,et al.  Hand pose estimation using voxel-based individualized hand model , 2009, 2009 IEEE/ASME International Conference on Advanced Intelligent Mechatronics.

[17]  Wei Liang,et al.  Continuous Hand Gesture Recognition in the Learned Hierarchical Latent Variable Space , 2008, AMDO.

[18]  Junsong Yuan,et al.  Robust hand gesture recognition with kinect sensor , 2011, ACM Multimedia.

[19]  Joachim Hornegger,et al.  3-D gesture-based scene navigation in medical imaging applications using Time-of-Flight cameras , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[20]  Joachim Hornegger,et al.  Gesture recognition with a Time-Of-Flight camera , 2008, Int. J. Intell. Syst. Technol. Appl..

[21]  Sébastien Marcel,et al.  Hand posture recognition in a body-face centered space , 1999, CHI Extended Abstracts.

[22]  Mircea Nicolescu,et al.  Vision-based hand pose estimation: A review , 2007, Comput. Vis. Image Underst..