A user independent hand gesture recognition system using deep CNN feature fusion and machine learning technique

Abstract Hand gesture recognition is one of the active research areas in the field of human-computer interface due to its flexibility and user friendliness. The gesture recognition technique is used to develop a system that can be used to convey information among disabled people or for controlling a device. Major challenges for the development of an efficient hand gesture recognition technique are illumination variation, nonuniform backgrounds, diversities in the size and shape of a user's hand, and high interclass similarities between hand gesture poses. In this chapter, a user independent static hand gesture recognition technique is analyzed using handcrafted features such as a histogram of oriented gradients (HOG) and a deep convolutional neural network (CNN). The deep features are extracted from fully connected layers of two different well-known pretrained CNNs such as AlexNet and VGG-16. The fusion of feature vectors extracted from different fully connected layers of both the CNNs is proposed for the enhancement of gesture recognition accuracy. The proposed CNN-based feature does not require any hand segmentation or background subtraction technique to segment the hand region from the input image. A support vector machine as a machine learning algorithm is used to classify the gesture poses. A comparison of the HOG feature and the deep CNN feature is presented for the recognition of static hand gesture poses. The proposed deep CNN feature fusion-based hand gesture recognition is robust to illumination variation, nonuniform backgrounds, and interclass similarities. The performance of the proposed technique is evaluated using leave-one-subject-out cross-validation (LOO CV) and holdout CV tests. The extensive analysis is performed on three benchmark static hand gesture datasets with uniform and nonuniform backgrounds on both the CV tests. A significant improvement of user independent static hand gesture recognition performance using the LOO CV test is found using the proposed technique. The experimental results show that the proposed technique is superior as compared to individual deep CNN features and state-of-the-art techniques. A real-time application of the gesture recognition system is developed and tested using the proposed technique.

[1]  Ai Poh Loh,et al.  Attention Based Detection and Recognition of Hand Postures Against Complex Backgrounds , 2012, International Journal of Computer Vision.

[2]  Jochen Triesch,et al.  Classification of hand postures against complex backgrounds using elastic graph matching , 2002, Image Vis. Comput..

[3]  Donq-Liang Lee,et al.  Recognition of complex static hand gestures by using the wristband-based contour features , 2018, IET Image Process..

[4]  Sérgio F. Chevtchenko,et al.  Multi-objective optimization for hand posture recognition , 2018, Expert Syst. Appl..

[5]  Wei Hu,et al.  Automatic user state recognition for hand gesture based low-cost television control system , 2014, IEEE Transactions on Consumer Electronics.

[6]  Filipe R. Cordeiro,et al.  A convolutional neural network with feature fusion for real-time hand posture recognition , 2018, Appl. Soft Comput..

[7]  Patrick Haffner,et al.  Support vector machines for histogram-based image classification , 1999, IEEE Trans. Neural Networks.

[8]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[9]  Haitham Badi Recent methods in vision-based hand gesture recognition , 2016, International Journal of Data Science and Analytics.

[10]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[11]  Wen-Yuan Chen,et al.  Hand gesture recognition using valley circle feature and Hu’s moments technique for robot movement control , 2016 .

[12]  Dipak Kumar Ghosh,et al.  Hand gesture recognition using DWT and F-ratio based feature descriptor , 2018, IET Image Process..

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Ankit Chaudhary,et al.  Light invariant real-time robust hand gesture recognition , 2017 .

[15]  Napoleon H. Reyes,et al.  A New 2D Static Hand Gesture Colour Image Dataset for ASL Gestures , 2011 .

[16]  Mohan M. Trivedi,et al.  Hand Gesture Recognition in Real Time for Automotive Interfaces: A Multimodal Vision-Based Approach and Evaluations , 2014, IEEE Transactions on Intelligent Transportation Systems.

[17]  Dipak Kumar Ghosh,et al.  On an algorithm for Vision-based hand gesture recognition , 2016, Signal Image Video Process..

[18]  Martin Saerbeck,et al.  Recent methods and databases in vision-based hand gesture recognition: A review , 2015, Comput. Vis. Image Underst..

[19]  Narciso García,et al.  Human-computer interaction based on visual hand-gesture recognition using volumetric spatiograms of local binary patterns , 2015, Comput. Vis. Image Underst..

[20]  Gourav Modanwal,et al.  Towards hand gesture based writing support system for blinds , 2016, Pattern Recognit..