Thai Sign Language Translation Using Fuzzy C-Means and Scale Invariant Feature Transform

Visual communication is important for a deft and/or mute person. It is also one of the tools for the communication between human and machines. In this paper, we develop an automatic Thai finger-spelling sign language translation system using Fuzzy C-Means (FCM) and Scale Invariant Feature Transform (SIFT) algorithms. We collect key frames from several subjects at different times of day and for several days. We also collect testing Thai finger-spelling words video from 4 subjects. The system achieves 79.90% and 51.17% correct alphabet translation and the correct word translation, respectively, with the SIFT threshold of 0.7 and 1 nearest neighbor prototype. However, when we change the number of nearest neighbor prototypes to 3, the system yields 82.19% and 55.08% correct alphabet and correct word translation, respectively, at the same SIFT threshold. These results are comparable with the manually-picked Rframe translation system.

[1]  Chung-Lin Huang,et al.  Hand gesture recognition using a real-time tracking method and hidden Markov models , 2003, Image Vis. Comput..

[2]  T. Kobayashi,et al.  Partly-hidden Markov model and its application to gesture recognition , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Mu-Chun Su,et al.  A static hand gesture recognition system using a composite neural network , 1996, Proceedings of IEEE 5th International Fuzzy Systems.

[4]  Chung-Lin Huang,et al.  Sign language recognition using model-based tracking and a 3D Hopfield neural network , 1998, Machine Vision and Applications.

[5]  Wu Jiang A Hierarchical DGMM Recognizer for Chinese Sign Language Recognition , 2000 .

[6]  Ahmed K. Elmagarmid,et al.  Scene change detection techniques for video database systems , 1998, Multimedia Systems.

[7]  直樹 水野,et al.  各指の曲げ状態に着目した手話の手形状認識(機械力学,計測,自動制御) , 2002 .

[8]  Geoffrey E. Hinton,et al.  Glove-Talk: a neural network interface between a data-glove and a speech synthesizer , 1993, IEEE Trans. Neural Networks.

[9]  Sansanee Auephanwiriyakul,et al.  On fuzzy clustering and content based access to networked video databases , 1998, Proceedings Eighth International Workshop on Research Issues in Data Engineering. Continuous-Media Databases and Applications.

[10]  Stan Sclaroff,et al.  Simultaneous Localization and Recognition of Dynamic Hand Gestures , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[11]  Wu jiangqin,et al.  A simple sign language recognition system based on data glove , 1998, ICSP '98. 1998 Fourth International Conference on Signal Processing (Cat. No.98TH8344).

[12]  Ho-Sub Yoon,et al.  Hand gesture recognition using hidden Markov models , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[13]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[14]  Sansanee Auephanwiriyakul,et al.  Fuzzy shot clustering to support networked video databases , 1998, 1998 IEEE International Conference on Fuzzy Systems Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36228).

[15]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.