A new framework for sign language alphabet hand posture recognition using geometrical features through artificial neural network (part 1)

Hand pose tracking is essential in sign languages. An automatic recognition of performed hand signs facilitates a number of applications, especially for people with speech impairment to communication with normal people. This framework which is called ASLNN proposes a new hand posture recognition technique for the American sign language alphabet based on the neural network which works on the geometrical feature extraction of hands. A user’s hand is captured by a three-dimensional depth-based sensor camera; consequently, the hand is segmented according to the depth analysis features. The proposed system is called depth-based geometrical sign language recognition as named DGSLR. The DGSLR adopted in easier hand segmentation approach, which is further used in segmentation applications. The proposed geometrical feature extraction framework improves the accuracy of recognition due to unchangeable features against hand orientation compared to discrete cosine transform and moment invariant. The findings of the iterations demonstrate the combination of the extracted features resulted to improved accuracy rates. Then, an artificial neural network is used to drive desired outcomes. ASLNN is proficient to hand posture recognition and provides accuracy up to 96.78% which will be discussed on the additional paper of this authors in this journal.

[1]  Tomaso Poggio,et al.  Everything old is new again: a fresh look at historical approaches in machine learning , 2002 .

[2]  David Ricardo Cruz,et al.  Novel Nonlinear Hypothesis for the Delta Parallel Robot Modeling , 2020, IEEE Access.

[3]  Nooritawati Md Tahir,et al.  Review in Sign Language Recognition Systems , 2012, 2012 IEEE Symposium on Computers & Informatics (ISCI).

[4]  Joshua Ryan New,et al.  A Method for Hand Gesture Recognition , 2002 .

[5]  Thorsten Joachims,et al.  Contextually guided semantic labeling and search for three-dimensional point clouds , 2013, Int. J. Robotics Res..

[6]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[7]  David Zhang,et al.  Fusion of phase and orientation information for palmprint authentication , 2005, IEEE International Conference on Image Processing 2005.

[8]  Antonis A. Argyros,et al.  Efficient model-based 3D tracking of hand articulations using Kinect , 2011, BMVC.

[9]  Lei Wang,et al.  Video Object Segmentation by Fusion of Spatio-Temporal Information Based on Gaussian Mixture Model , 2011 .

[10]  Chris Gordon,et al.  Analysis of XBOX Kinect sensor data for use on construction sites: Depth accuracy and sensor interference assessment , 2012 .

[11]  Martin Mozina,et al.  Orange: data mining toolbox in python , 2013, J. Mach. Learn. Res..

[12]  Miguel Figueroa,et al.  Competitive learning with floating-gate circuits , 2002, IEEE Trans. Neural Networks.

[13]  S. Sathiya Keerthi,et al.  Which Is the Best Multiclass SVM Method? An Empirical Study , 2005, Multiple Classifier Systems.

[14]  Luca Zanni,et al.  Parallel Software for Training Large Scale Support Vector Machines on Multiprocessor Systems , 2006, J. Mach. Learn. Res..

[15]  Tan Tian Swee,et al.  Malay Sign Language Gesture Recognition system , 2007, 2007 International Conference on Intelligent and Advanced Systems.

[16]  Hui Lin,et al.  Depth image enhancement for Kinect using region growing and bilateral filter , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[17]  Kwok-Wing Chau,et al.  A Survey of Deep Learning Techniques: Application in Wind and Solar Energy Resources , 2019, IEEE Access.

[18]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[19]  Nicolas Pugeault,et al.  Spelling it out: Real-time ASL fingerspelling recognition , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[20]  P. V. V. Kishore,et al.  4-Camera model for sign language recognition using elliptical fourier descriptors and ANN , 2015, 2015 International Conference on Signal Processing and Communication Engineering Systems.

[21]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[22]  Wei Xiong,et al.  Model-guided deformable hand shape recognition without positioning aids , 2005, Pattern Recognit..

[23]  Aly A. Farag,et al.  SHREC'13 Track: Retrieval of Objects Captured with Low-Cost Depth-Sensing Cameras , 2013, 3DOR@Eurographics.

[24]  P. V. V. Kishore,et al.  Segment, Track, Extract, Recognize and Convert Sign Language Videos to Voice/Text , 2012 .

[25]  Enrique Garcia,et al.  Hessian with Mini-Batches for Electrical Demand Prediction , 2020, Applied Sciences.

[26]  Chih-Jen Lin,et al.  A formal analysis of stopping criteria of decomposition methods for support vector machines , 2002, IEEE Trans. Neural Networks.

[27]  Bülent Sankur,et al.  Shape-based hand recognition , 2006, IEEE Transactions on Image Processing.

[28]  Ramesh Raskar,et al.  3D Depth Cameras in Vision: Benefits and Limitations of the Hardware , 2014 .

[29]  Michael I. Jordan,et al.  Advances in Neural Information Processing Systems 30 , 1995 .

[30]  Elena Mugellini,et al.  Context-Aware 3D Gesture Interaction Based on Multiple Kinects , 2011 .

[31]  Sanjeev Sofat,et al.  Vision Based Hand Gesture Recognition , 2009 .

[32]  Peter James Vial,et al.  Australian Sign Language Recognition Using Moment Invariants , 2013, ICIC.

[33]  Miguel A. Ferrer,et al.  Automatic biometric identification system by hand geometry , 2003, IEEE 37th Annual 2003 International Carnahan Conference onSecurity Technology, 2003. Proceedings..

[34]  Shahaboddin Shamshirband,et al.  Application of ANNs, ANFIS and RSM to estimating and optimizing the parameters that affect the yield and cost of biodiesel production , 2018 .

[35]  Gavin C. Cawley,et al.  Preventing Over-Fitting during Model Selection via Bayesian Regularisation of the Hyper-Parameters , 2007, J. Mach. Learn. Res..

[36]  Kai Oliver Arras,et al.  People tracking in RGB-D data with on-line boosted target models , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[37]  Gregg D. Wilensky,et al.  Neural Network Studies , 1993 .

[38]  Chih-Jen Lin,et al.  A comparison of methods for multiclass support vector machines , 2002, IEEE Trans. Neural Networks.

[39]  Loris Nanni,et al.  Ensemble to improve gesture recognition , 2014 .

[40]  Khaled Assaleh,et al.  Vision-based system for continuous Arabic Sign Language recognition in user dependent mode , 2008, 2008 5th International Symposium on Mechatronics and Its Applications.

[41]  N. Atzpadin,et al.  Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability , 2007, Signal Process. Image Commun..

[42]  Junfei Qiao,et al.  Nonlinear Systems Modeling Based on Self-Organizing Fuzzy-Neural-Network With Adaptive Computation Algorithm , 2014, IEEE Transactions on Cybernetics.

[43]  José de Jesús Rubio,et al.  SOFMLS: Online Self-Organizing Fuzzy Modified Least-Squares Network , 2009, IEEE Transactions on Fuzzy Systems.

[44]  Anupam Agrawal,et al.  Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.

[45]  LimHyotaek,et al.  Hand tracking and gesture recognition system for human-computer interaction using low-cost hardware , 2015 .

[46]  Anil K. Jain,et al.  Matching of palmprints , 2002, Pattern Recognit. Lett..

[47]  Sazali Yaacob,et al.  Gesture recognition system for Kod Tangan Bahasa Melayu (KTBM) using neural network , 2009, 2009 5th International Colloquium on Signal Processing & Its Applications.

[48]  Nello Cristianini,et al.  Large Margin DAGs for Multiclass Classification , 1999, NIPS.

[49]  In-So Kweon,et al.  Adaptive Support-Weight Approach for Correspondence Search , 2006, IEEE Trans. Pattern Anal. Mach. Intell..

[50]  Igor V. Tetko,et al.  Neural network studies, 1. Comparison of overfitting and overtraining , 1995, J. Chem. Inf. Comput. Sci..

[51]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[52]  Lei Wei,et al.  Low cost multimodal facial recognition via kinect sensors , 2012 .

[53]  Gérard Dreyfus,et al.  Single-layer learning revisited: a stepwise procedure for building and training a neural network , 1989, NATO Neurocomputing.

[54]  P. V. V. Kishore,et al.  A Video Based Indian Sign Language Recognition System (INSLR) Using Wavelet Transform and Fuzzy Logic , 2012 .

[55]  Alex Pentland,et al.  Real-time American Sign Language recognition from video using hidden Markov models , 1995 .

[56]  Henry Fuchs,et al.  Reducing interference between multiple structured light depth sensors using motion , 2012, 2012 IEEE Virtual Reality Workshops (VRW).

[57]  Sergiu Nedevschi,et al.  Real-time dynamic environment perception in driving scenarios using difference fronts , 2012, 2012 IEEE Intelligent Vehicles Symposium.

[58]  Geoffrey J. McLachlan,et al.  Analyzing Microarray Gene Expression Data , 2004 .

[59]  Wen Gao,et al.  Large-Vocabulary Continuous Sign Language Recognition Based on Transition-Movement Models , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[60]  Daniel Thalmann,et al.  Parsing the Hand in Depth Images , 2014, IEEE Transactions on Multimedia.

[61]  Bodo Rosenhahn,et al.  Real-Time Sign Language Recognition Using a Consumer Depth Camera , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[62]  Wen Gao,et al.  A Real-Time Large Vocabulary Recognition System for Chinese Sign Language , 2001, Gesture Workshop.

[63]  Guang Li,et al.  Sign Language Recognition and Translation with Kinect , 2013 .

[64]  Sumit Kumar,et al.  Recognition of Single Handed Sign Language Gestures using Contour Tracing Descriptor , 2022 .

[65]  Ana González-Marcos,et al.  Biometric Identification through Hand Geometry Measurements , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[66]  Koby Crammer,et al.  On the Algorithmic Implementation of Multiclass Kernel-based Vector Machines , 2002, J. Mach. Learn. Res..