A Unified Learning Approach for Hand Gesture Recognition and Fingertip Detection

In human-computer interaction or sign language interpretation, recognizing hand gestures and detecting fingertips become ubiquitous in computer vision research. In this paper, a unified approach of convolutional neural network for both hand gesture recognition and fingertip detection is introduced. The proposed algorithm uses a single network to predict the probabilities of finger class and positions of fingertips in one forward propagation of the network. Instead of directly regressing the positions of fingertips from the fully connected layer, the ensemble of the position of fingertips is regressed from the fully convolutional network. Subsequently, the ensemble average is taken to regress the final position of fingertips. Since the whole pipeline uses a single network, it is significantly fast in computation. The proposed method results in remarkably less pixel error as compared to that in the direct regression approach and it outperforms the existing fingertip detection approaches including the Heatmap-based framework.

[1]  Derry Alamsyah,et al.  Faster R-CNN with Inception V2 for Fingertip Detection in Homogenous Background Image , 2019 .

[2]  Yichao Huang,et al.  DeepFinger: A Cascade Convolutional Neuron Network Approach to Finger Key Point Detection in Egocentric Vision with Mobile Camera , 2015, 2015 IEEE International Conference on Systems, Man, and Cybernetics.

[3]  Debanga Raj Neog,et al.  Fingertip Detection for Hand Pose Recognition , 2012 .

[4]  Ke Liu,et al.  A multi-objective optimization model and its evolution-based solutions for the fingertip localization problem , 2018, Pattern Recognit..

[5]  Dimitrios Hatzinakos,et al.  Static hand gesture recognition using discriminative 2D Zernike moments , 2014, TENCON 2014 - 2014 IEEE Region 10 Conference.

[6]  S. M. Mahbubur Rahman,et al.  Affine transformation of virtual 3D object using 2D localization of fingertips , 2020, Virtual Real. Intell. Hardw..

[7]  Sergio Escalera,et al.  Sign Language Recognition: A Deep Survey , 2021, Expert Syst. Appl..

[8]  Yimin Zhou,et al.  A novel finger and hand pose estimation technique for real-time hand gesture recognition , 2016, Pattern Recognit..

[9]  SeungGwan Lee,et al.  Vision‐Based Finger Action Recognition by Angle Detection and Contour Analysis , 2011 .

[10]  Mei Wang,et al.  Hand Gesture Recognition Based on Fingertip Detection , 2013, 2013 Fourth Global Congress on Intelligent Systems.

[11]  Zhongyuan Lai,et al.  Fingertips detection and hand gesture recognition based on discrete curve evolution with a kinect sensor , 2016, 2016 Visual Communications and Image Processing (VCIP).

[12]  Nitin V. Pujari,et al.  Finger Detection for Sign Language Recognition , 2009 .

[13]  S. Mohamed Mansoor Roomi,et al.  Hand Gesture Recognition for Human-Computer Interaction , 2010 .

[14]  Analysis of the hands in egocentric vision: A survey , 2019, IEEE transactions on pattern analysis and machine intelligence.

[15]  Pai-Wen Ting,et al.  Hand pose estimation in object-interaction based on deep learning for virtual reality applications , 2020, J. Vis. Commun. Image Represent..

[16]  N. Kasthuri,et al.  Gesture recognition and finger tip detection for human computer interaction , 2017, 2017 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS).

[17]  Yichao Huang,et al.  A Pointing Gesture Based Egocentric Interaction System: Dataset, Approach and Application , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[18]  Hermann Ney,et al.  Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  David Kim,et al.  MixFab: a mixed-reality environment for personal fabrication , 2014, CHI.

[20]  Purnendu Mishra,et al.  Fingertips Detection in Egocentric Video Frames using Deep Neural Networks , 2019, 2019 International Conference on Image and Vision Computing New Zealand (IVCNZ).

[21]  Wenxiong Kang,et al.  Robust Fingertip Detection in a Complex Environment , 2016, IEEE Transactions on Multimedia.

[22]  Hai Duong Nguyen,et al.  Hand Segmentation and Fingertip Tracking from Depth Camera Images Using Deep Convolutional Neural Network and Multi-task SegNet , 2019, ArXiv.

[23]  Gourav Modanwal,et al.  Towards hand gesture based writing support system for blinds , 2016, Pattern Recognit..

[24]  Soo-Hyung Kim,et al.  A method for fingertips detection using RGB-D image and convolution neural network , 2017, 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[25]  Mariusz Flasinski,et al.  On the use of graph parsing for recognition of isolated hand postures of Polish Sign Language , 2010, Pattern Recognit..

[26]  Rajiv Ranjan Sahay,et al.  Deep Gesture: Static Hand Gesture Recognition Using CNN , 2016, CVIP.

[27]  Hsien-I Lin,et al.  Human hand gesture recognition using a convolution neural network , 2014, 2014 IEEE International Conference on Automation Science and Engineering (CASE).

[28]  Soumi Paul,et al.  Hand sign recognition from depth images with multi-scale density features for deaf mute persons , 2020 .

[29]  François Bérard,et al.  Bare-hand human-computer interaction , 2001, PUI '01.

[30]  Brandon Garcia,et al.  Real-time American Sign Language Recognition with Convolutional Neural Networks , 2022 .

[31]  Juan José Pantrigo,et al.  Convolutional Neural Networks and Long Short-Term Memory for skeleton-based human activity and hand gesture recognition , 2018, Pattern Recognit..

[32]  Xin Zhang,et al.  YOLSE: Egocentric Fingertip Detection from Single RGB Images , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[33]  Stan Sclaroff,et al.  A Unified Framework for Gesture Recognition and Spatiotemporal Gesture Segmentation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Stefan Winkler,et al.  Vision-based 3D finger interactions for mixed reality games with physics simulation , 2008, VRCAI.

[35]  Ali Farhadi,et al.  YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Lianwen Jin,et al.  Egocentric Fingertip Detection from Single RGB Images , 2022 .

[37]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38]  Junsong Yuan,et al.  Robust Part-Based Hand Gesture Recognition Using Kinect Sensor , 2013, IEEE Transactions on Multimedia.

[39]  Dipak Kumar Ghosh,et al.  Hand gesture recognition using DWT and F-ratio based feature descriptor , 2018, IET Image Process..

[40]  Mohammad Mahmudul Alam,et al.  Detection and Tracking of Fingertips for Geometric Transformation of Objects in Virtual Environment , 2019, 2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA).