Arabic Sign Language Recognition System Using 2D Hands and Body Skeleton Data

This paper presents a novel Arabic Sign Language (ArSL) recognition system, using selected 2D hands and body key points from successive video frames. The system recognizes the recorded video signs, for both signer dependent and signer independent modes, using the concatenation of a 3D CNN skeleton network and a 2D point convolution network. To accomplish this, we built a new ArSL video-based sign database. We will present the detailed methodology of recording the new dataset, which comprises 80 static and dynamic signs that were repeated five times by 40 signers. The signs include Arabic alphabet, numbers, and some daily use signs. To facilitate building an online sign recognition system, we introduce the inverse efficiency score to find a sufficient optimal number of successive frames for the recognition decision, in order to cope with a near real-time automatic ArSL system, where tradeoff between accuracy and speed is crucial to avoid delayed sign classification. For the dependent mode, best results were obtained for dynamic signs with an accuracy of 98.39%, and 88.89% for the static signs, and for the independent mode, we obtained for the dynamic signs an accuracy of 96.69%, and 86.34% for the static signs. When both the static and dynamic signs were mixed and the system trained with all the signs, accuracies of 89.62% and 88.09% were obtained in the signer dependent and signer independent modes respectively.

[1]  Khaled Assaleh,et al.  Telescopic Vector Composition and Polar Accumulated Motion Residuals for Feature Extraction in Arabic Sign Language Recognition , 2007, EURASIP J. Image Video Process..

[2]  Majid Ali Khan,et al.  An Automatic Arabic Sign Language Recognition System based on Deep CNN: An Assistive System for the Deaf and Hard of Hearing , 2020, International Journal of Computing and Digital Systems.

[3]  Ghulam Muhammad,et al.  Development of the Arabic Voice Pathology Database and Its Evaluation by Using Speech Features and Machine Learning Algorithms , 2017, Journal of healthcare engineering.

[4]  Debi Prosad Dogra,et al.  A multimodal framework for sensor based sign language recognition , 2017, Neurocomputing.

[5]  Ghulam Muhammad,et al.  Deep Learning-Based Approach for Sign Language Gesture Recognition With Efficient Hand Gesture Representation , 2020, IEEE Access.

[6]  Yaser Sheikh,et al.  OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Walaa Aly,et al.  Arabic sign language fingerspelling recognition from depth and intensity images , 2016, 2016 12th International Computer Engineering Conference (ICENCO).

[8]  Hesham N. Elmahdy,et al.  Supporting Arabic Sign Language Recognition with Facial Expressions SASLRWFE , 2015, ICIT 2015.

[9]  A. S. Elons GPU implementation for Arabic Sign Language real time recognition using Multi-level Multiplicative Neural Networks , 2014, 2014 9th International Conference on Computer Engineering & Systems (ICCES).

[10]  Vítor H. Carvalho,et al.  Sign language learning using the hangman videogame , 2015, 2015 7th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT).

[11]  Cecilia Sik-Lányi,et al.  Suitability of the Kinect Sensor and Leap Motion Controller—A Literature Review , 2019, Sensors.

[12]  M. Deriche,et al.  Arabie sign language recognition using the Microsoft Kinect , 2016, 2016 13th International Multi-Conference on Systems, Signals & Devices (SSD).

[13]  M. Brysbaert,et al.  Combining speed and accuracy in cognitive psychology: Is the inverse efficiency score (IES) a better dependent variable than the mean reaction time (RT) and the percentage of errors (PE)? , 2011 .

[14]  Melanie Metzger,et al.  Gesture in sign language discourse , 1998 .

[15]  Khaled Assaleh,et al.  Glove-Based Continuous Arabic Sign Language Recognition in User-Dependent Mode , 2015, IEEE Transactions on Human-Machine Systems.

[16]  Bidyut B. Chaudhuri,et al.  A Modified LSTM Model for Continuous Sign Language Recognition Using Leap Motion , 2019, IEEE Sensors Journal.

[17]  Sudeep Sarkar,et al.  Progress in Automated Computer Recognition of Sign Language , 2004, ICCHP.

[18]  Wang Xi,et al.  Deep Learning for Hand Gesture Recognition on Skeletal Data , 2018, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[19]  Zaid Omar,et al.  A review of hand gesture and sign language recognition techniques , 2017, International Journal of Machine Learning and Cybernetics.

[20]  Thad Starner,et al.  American sign language recognition with the kinect , 2011, ICMI '11.

[21]  John H. L. Hansen,et al.  UTD-CRSS Systems for 2018 NIST Speaker Recognition Evaluation , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Mohamed F. Tolba,et al.  A Proposed Hybrid Sensor Architecture for Arabic Sign Language Recognition , 2014, IEEE Conf. on Intelligent Systems.

[23]  Oscar Koller,et al.  Sign Language Transformers: Joint End-to-End Sign Language Recognition and Translation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Mazen M. Selim,et al.  An Automatic Arabic Sign Language Recognition System (ArSLRS) , 2017, J. King Saud Univ. Comput. Inf. Sci..

[25]  M. M. Kamruzzaman,et al.  Arabic Sign Language Recognition and Generating Arabic Speech Using Convolutional Neural Network , 2020, Wirel. Commun. Mob. Comput..

[26]  Hazem Wannous,et al.  Skeleton-Based Dynamic Hand Gesture Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[27]  Saleh Aly,et al.  DeepArSLR: A Novel Signer-Independent Deep Learning Framework for Isolated Arabic Sign Language Gestures Recognition , 2020, IEEE Access.

[28]  A. Hamouda,et al.  Arabic Dynamic Gestures Recognition Using Microsoft Kinect , 2018 .

[29]  Khaled Assaleh,et al.  Multiple Proposals for Continuous Arabic Sign Language Recognition , 2019 .

[30]  Haiqiang Liu,et al.  Hard Sample Mining and Learning for Skeleton-Based Human Action Recognition and Identification , 2019, IEEE Access.

[31]  M ShohiebSamaa,et al.  SignsWorld Atlas; a benchmark Arabic Sign Language database , 2015 .

[32]  M. F. Tolba,et al.  Arabic sign language recognition with 3D convolutional neural networks , 2017, 2017 Eighth International Conference on Intelligent Computing and Information Systems (ICICIS).

[33]  Yichen Wei,et al.  Deep Feature Flow for Video Recognition , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Yaser Saleh,et al.  Arabic Sign Language Recognition through Deep Neural Networks Fine-Tuning , 2020, Int. J. Online Biomed. Eng..

[35]  Bharti Bansal,et al.  Gesture Recognition: A Survey , 2016 .

[36]  Pavlo Molchanov,et al.  Hand gesture recognition with 3D convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[37]  Hamzah Luqman,et al.  Transform-based Arabic sign language recognition , 2017, ACLING.

[38]  Ghulam Muhammad,et al.  Hand Gesture Recognition for Sign Language Using 3DCNN , 2020, IEEE Access.

[39]  Hana Al-Nuaim,et al.  Recognizing Arabic Sign Language gestures using depth sensors and a KSVM classifier , 2016, 2016 8th Computer Science and Electronic Engineering (CEEC).

[40]  Petros Daras,et al.  A Deep Learning Approach for Analyzing Video and Skeletal Features in Sign Language Recognition , 2018, 2018 IEEE International Conference on Imaging Systems and Techniques (IST).

[41]  Mohamed A. Deriche,et al.  Arabic Sign Language Recognition an Image-Based Approach , 2007, 21st International Conference on Advanced Information Networking and Applications Workshops (AINAW'07).

[42]  Nagwa Lotfy Badr,et al.  Arabic sign language benchmark database for different heterogeneous sensors , 2015, 2015 5th International Conference on Information & Communication Technology and Accessibility (ICTA).

[43]  Hermann Ney,et al.  Benchmark Databases for Video-Based Automatic Sign Language Recognition , 2008, LREC.

[44]  Rafia Mumtaz,et al.  Deaf talk using 3D animated sign language: A sign language interpreter using Microsoft's kinect v2 , 2016, 2016 SAI Computing Conference (SAI).

[45]  Nahla A. Belal,et al.  Arabic Sign Language Alphabet Recognition Based on HOG-PCA Using Microsoft Kinect in Complex Backgrounds , 2016, 2016 IEEE 6th International Conference on Advanced Computing (IACC).

[46]  Luigi Cinque,et al.  Exploiting Recurrent Neural Networks and Leap Motion Controller for the Recognition of Sign Language and Semaphoric Hand Gestures , 2018, IEEE Transactions on Multimedia.

[47]  Ashok Kumar Sahoo,et al.  SIGN LANGUAGE RECOGNITION: STATE OF THE ART , 2014 .

[48]  Yasser El-Sonbaty,et al.  HMM-based Arabic Sign Language Recognition using Kinect , 2015, 2015 Tenth International Conference on Digital Information Management (ICDIM).

[49]  Mansour Alsulaiman,et al.  KSU rich Arabic speech database , 2013 .

[50]  Jaafar Alghazo,et al.  ArASL: Arabic Alphabets Sign Language Dataset , 2019, Data in brief.

[51]  Cheng Zhang,et al.  FingerTrak , 2020, Proc. ACM Interact. Mob. Wearable Ubiquitous Technol..

[52]  Ghulam Muhammad,et al.  Investigation of Voice Pathology Detection and Classification on Different Frequency Regions Using Correlation Functions. , 2017, Journal of voice : official journal of the Voice Foundation.

[53]  S. N. Karishma,et al.  Novel contour based detection and GrabCut segmentation for sign language recognition , 2017, 2017 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET).

[54]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[55]  Basma Hisham,et al.  Arabic Static and Dynamic Gestures Recognition Using Leap Motion , 2017, J. Comput. Sci..