Chinese sign language recognition based on trajectory and hand shape features

Sign language recognition(SLR) is a challenging task due to the diversity of the signs. To tackle the problem, this paper utilize both trajectory features and hand shape features. Since the trajectory features and hand shape features are not in the same domain, it is unreasonable to concatenate them naively or model them with a unified model. To deal with the issue, we adopt Support Vector Machine(SVM) and validation Hidden Markov Models(VHMM), respectively. To depict the direction of the trajectory, we first employ histogram of oriented displacement(HOD) with SVM to SLR. We propose the relative distance features(RDF) by using VHMM to consider the relationship between hands and the other body parts. As for hand shape feature, we explore histogram of oriented gradient(HOG) in local hand regions with VHMM, too. To facilitate late fusion, we normalize the probabilities of different features to the same range and fuse them for the final classification. To demonstrate the effectiveness of our proposed method, we conduct the experiments both in ChaLearn dataset and our self-build Kinect-based Chinese sign language dataset. The results show that our method outperforms the classical methods and some state-of-the-art methods.

[1]  Chao Xie,et al.  Chinese sign language recognition with adaptive HMM , 2016, 2016 IEEE International Conference on Multimedia and Expo (ICME).

[2]  Jovan Popovic,et al.  Real-time hand-tracking with a color glove , 2009, SIGGRAPH '09.

[3]  Luc Van Gool,et al.  Gesture Recognition Portfolios for Personalization , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Changsheng Xu,et al.  Latent support vector machine for sign language recognition with Kinect , 2013, 2013 IEEE International Conference on Image Processing.

[5]  Wen Gao,et al.  Large-Vocabulary Continuous Sign Language Recognition Based on Transition-Movement Models , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[6]  Marwan Torki,et al.  Histogram of Oriented Displacements (HOD): Describing Trajectories of Human Joints for Action Recognition , 2013, IJCAI.

[7]  Ying Wu,et al.  Mining actionlet ensemble for action recognition with depth cameras , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Hanqing Lu,et al.  Fusing multi-modal features for gesture recognition , 2013, ICMI '13.

[9]  Andrew Zisserman,et al.  Domain-Adaptive Discriminative One-Shot Learning of Gestures , 2014, ECCV.

[10]  Tinne Tuytelaars,et al.  Towards sign language recognition based on body parts relations , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[11]  Sergio Escalera,et al.  Multi-modal gesture recognition challenge 2013: dataset and results , 2013, ICMI '13.

[12]  Xilin Chen,et al.  Curve Matching from the View of Manifold for Sign Language Recognition , 2014, ACCV Workshops.

[13]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Houqiang Li,et al.  Sign Language Recognition Based on Trajectory Modeling with HMMs , 2016, MMM.