Enhanced dynamic programming approach for subunit modelling to handle segmentation and recognition ambiguities in sign language

Abstract Sign language serves as a primary means of communication among the deaf impaired community. The major challenges faced by the Sign Language Recognition (SLR) system are recognizing signs from large vocabularies in continuous video sequences. In this research paper, a novel subunit sign modelling framework is proposed for vision-based SLR which aims in solving the major issues in SLR systems. The problem of hand segmentation ambiguities and segregating epentheses movements between two adjacent signs in continuous video sequences are addressed. A novel subunit sign modelling framework is presented and illustrated to embark upon these problems while considering large-vocabularies. This framework is developed using a novel methodology of Enhanced Dynamic Programming (EDP) approach in subunit sign modelling. This EDP framework works with a combination of dynamic time warping and spatiotemporal clustering techniques. Since, sign language consists of both spatial and temporal feature vectors, dynamic time warping is used as a distance measure to compute the distance between two adjacent signs in sign trajectories. This distance is used as a temporal feature vector during the clustering of spatial feature vectors using Minimum Entropy Clustering (MEC). This process is done recursively to cluster all the epentheses movements dynamically without using any explicit or implicit modelling. Experimental results have confirmed that the computation cost of the proposed system is less because the epenthesis movements are eliminated before classification and the gesture base space utilized by the sign gestures is very low because the proposed system does not require any modelling to handle epenthesis movements. The results obtained from the proposed subunit sign modelling framework is compared with other existing models in order to prove that the proposed system is best among the existing systems.

[1]  Sergio Escalera,et al.  Challenges in multimodal gesture recognition , 2016, J. Mach. Learn. Res..

[2]  KwangYun Wohn,et al.  Recognition of space-time hand-gestures using hidden Markov model , 1996, VRST.

[3]  Wen Gao,et al.  A Chinese sign language recognition system based on SOFM/SRN/HMM , 2004, Pattern Recognit..

[4]  Wen Gao,et al.  Transition movement models for large vocabulary continuous sign language recognition , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[5]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[6]  J. Weisberg,et al.  Neural correlates of fingerspelling, text, and sign processing in deaf American Sign Language–English bilinguals , 2015 .

[7]  Petros Maragos,et al.  Multimodal gesture recognition via multiple hypotheses rescoring , 2015, J. Mach. Learn. Res..

[8]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[10]  Houqiang Li,et al.  Sign Language Recognition using 3D convolutional neural networks , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[11]  Anand Singh Jalal,et al.  A survey on manual and non-manual sign language recognition for isolated and continuous sign , 2016, Int. J. Appl. Pattern Recognit..

[12]  Wen Gao,et al.  CSLDS: Chinese sign language dialog system , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[13]  R. Elakkiya,et al.  An intelligent framework for recognizing sign language from continuous video sequence using boosted subunits , 2013 .

[14]  Ruiduo Yang,et al.  Handling Movement Epenthesis and Hand Segmentation Ambiguities in Continuous Sign Language Recognition Using Nested Dynamic Programming , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[16]  Xuelong Li,et al.  A-Optimal Projection for Image Representation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Zhongyuan Lai,et al.  Fingertips detection and hand gesture recognition based on discrete curve evolution with a kinect sensor , 2016, 2016 Visual Communications and Image Processing (VCIP).

[18]  Yang Gao,et al.  Multi-layered gesture recognition with Kinect , 2015, J. Mach. Learn. Res..

[19]  Chin-Hui Lee,et al.  Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications , 2016, ACM Trans. Access. Comput..

[20]  Surendra Ranganath,et al.  Towards subject independent continuous sign language recognition: A segment and merge approach , 2014, Pattern Recognit..

[21]  Surendra Ranganath,et al.  Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Vassilis Athitsos,et al.  Sign language recognition using dynamic time warping and hand shape distance based on histogram of oriented gradient features , 2014, PETRA.

[23]  Daniel Kelly,et al.  Recognizing Spatiotemporal Gestures and Movement Epenthesis in Sign Language , 2009, 2009 13th International Machine Vision and Image Processing Conference.

[24]  Haoxiang Wang,et al.  An Effective Image Representation Method Using Kernel Classification , 2014, 2014 IEEE 26th International Conference on Tools with Artificial Intelligence.

[25]  Timothy H. Lucas,et al.  The Virtual Trackpad: An Electromyography-Based, Wireless, Real-Time, Low-Power, Embedded Hand-Gesture-Recognition System Using an Event-Driven Artificial Neural Network , 2017, IEEE Transactions on Circuits and Systems II: Express Briefs.

[26]  Kostas Karpouzis,et al.  Non-manual cues in automatic sign language recognition , 2011, PETRA '11.

[27]  Masaru Takeuchi,et al.  A method for recognizing a sequence of sign language words represented in a Japanese sign language sentence , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[28]  Shih-Chia Huang,et al.  Error-Optimized Sparse Representation for Single Image Rain Removal , 2017, IEEE Transactions on Industrial Electronics.

[29]  Sudeep Sarkar,et al.  Progress in Automated Computer Recognition of Sign Language , 2004, ICCHP.

[30]  Wen Gao,et al.  Recognition of strong and weak connection models in continuous sign language , 2002, Object recognition supported by user interaction for service robots.

[31]  Kaishun Wu,et al.  GRfid: A Device-Free RFID-Based Gesture Recognition System , 2017, IEEE Transactions on Mobile Computing.

[32]  Anupam Agrawal,et al.  Vision based hand gesture recognition for human computer interaction: a survey , 2012, Artificial Intelligence Review.

[33]  Wenzhun Huang,et al.  Two-stage plant species recognition by local mean clustering and Weighted sparse representation classification , 2017, Cluster Computing.

[34]  Aurélien Garivier,et al.  On the Complexity of Best-Arm Identification in Multi-Armed Bandit Models , 2014, J. Mach. Learn. Res..

[35]  Karl-Friedrich Kraiss,et al.  Towards an Automatic Sign Language Recognition System Using Subunits , 2001, Gesture Workshop.

[36]  Dimitris N. Metaxas,et al.  Handshapes and movements: Multiple-channel ASL recognition , 2004 .

[37]  Ming Ouhyoung,et al.  A real-time continuous gesture recognition system for sign language , 1998, Proceedings Third IEEE International Conference on Automatic Face and Gesture Recognition.

[38]  George Awad,et al.  Boosted subunits: a framework for recognising sign language from videos , 2013, IET Image Process..

[39]  R. Elakkiya,et al.  A framework for recognizing and segmenting sign language gestures from continuous video sequence using boosted learning algorithm , 2014, 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT).