Depth matrix and adaptive Bayes classifier based dynamic hand gesture recognition

Abstract A sequence of apparently ad-hoc hand postures can generate meaningful dynamic gestures which can be utilized in interface controls for computer, television, or games. In order to develop deployable systems with these gestures, selected descriptors should be fast enough to meet the live recognition requirements. This paper proposes framework for a practical system capable of recognizing continuous dynamic gestures characterized by short-duration posture sequences. A depth-based modification to the shape matrix is devised to describe hand silhouettes, which gives a faster alternative to region-based descriptors. Postures are recognized using depth matrix and 1-nearest neighbor strategy. Posture sequence labels are predicted by a dynamic naive Bayes classifier which works in association with an adaptive windowing mechanism. The conducted experiments report up to 96.2% accurate results with mean accuracy of 95.2% on dynamic gesture dataset. Depth matrix computation takes a maximum of 2ms time.

[1]  Tae-Kyun Kim,et al.  Real-Time Articulated Hand Pose Estimation Using Semi-supervised Transductive Regression Forests , 2013, 2013 IEEE International Conference on Computer Vision.

[2]  ZAFAR AHMED ANSARI,et al.  Nearest neighbour classification of Indian sign language gestures using kinect camera , 2016 .

[3]  Yingli Tian,et al.  Histogram of 3D Facets: A depth descriptor for human action and hand gesture recognition , 2015, Comput. Vis. Image Underst..

[4]  A. Ardeshir Goshtasby,et al.  Description and Discrimination of Planar Shapes Using Shape Matrices , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Cordelia Schmid,et al.  A Spatio-Temporal Descriptor Based on 3D-Gradients , 2008, BMVC.

[6]  Li-Chen Fu,et al.  Learning a deep network with spherical part model for 3D hand pose estimation , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Ying Wu,et al.  Robust 3D Action Recognition with Random Occupancy Patterns , 2012, ECCV.

[8]  Zicheng Liu,et al.  HON4D: Histogram of Oriented 4D Normals for Activity Recognition from Depth Sequences , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Riccardo Leonardi,et al.  XKin: an open source framework for hand pose and gesture recognition using kinect , 2014, The Visual Computer.

[10]  Arif Mahmood,et al.  Real time action recognition using histograms of depth gradients and random decision forests , 2014, IEEE Winter Conference on Applications of Computer Vision.

[11]  Nicolas Pugeault,et al.  Spelling it out: Real-time ASL fingerspelling recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[12]  Anbumani Subramanian,et al.  Dynamic Hand Pose Recognition Using Depth Data , 2010, 2010 20th International Conference on Pattern Recognition.

[13]  Stephan Liwicki,et al.  Automatic recognition of fingerspelled words in British Sign Language , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[14]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Jesús Bescós,et al.  Real-time user independent hand gesture recognition from time-of-flight camera video using static and dynamic models , 2011, Machine Vision and Applications.

[16]  Joachim Hornegger,et al.  3-D gesture-based scene navigation in medical imaging applications using Time-of-Flight cameras , 2008, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[17]  Mauro Donadeo,et al.  Combining multiple depth-based descriptors for hand gesture recognition , 2014, Pattern Recognit. Lett..

[18]  Lale Akarun,et al.  Hierarchically constrained 3D hand pose estimation using regression forests from single frame depth data , 2014, Pattern Recognit. Lett..

[19]  Junsong Yuan,et al.  Robust Part-Based Hand Gesture Recognition Using Kinect Sensor , 2013, IEEE Transactions on Multimedia.

[20]  Guillermo Cámara Chávez,et al.  Finger Spelling Recognition from RGB-D Information Using Kernel Descriptor , 2013, SIBGRAPI.