Joint Optimization of Word Alignment and Epenthesis Generation for Chinese to Taiwanese Sign Synthesis

This work proposes a novel approach to translate Chinese to Taiwanese sign language and to synthesize sign videos. An aligned bilingual corpus of Chinese and Taiwanese sign language (TSL) with linguistic and signing information is also presented for sign language translation. A two-pass alignment in syntax level and phrase level is developed to obtain the optimal alignment between Chinese sentences and Taiwanese sign sequences. For sign video synthesis, a scoring function is presented to develop motion transition-balanced sign videos with rich combinations of intersign transitions. Finally, the maximum a posteriori (MAP) algorithm is employed for sign video synthesis based on joint optimization of two-pass word alignment and intersign epenthesis generation. Several experiments are conducted in an educational environment to evaluate the performance on the comprehension of sign expression. The proposed approach outperforms the IBM Model2 in sign language translation. Moreover, deaf students perceived sign videos generated by the proposed method to be satisfactory

[1]  Richard A. Foulds,et al.  A parametric approach to sign language synthesis , 2005, Assets '05.

[2]  Carl Brown,et al.  Assistive technology computers and persons with disabilities , 1992, CACM.

[3]  Okan Arikan,et al.  Interactive motion generation from examples , 2002, ACM Trans. Graph..

[4]  Hermann Ney,et al.  Algorithms for statistical translation of spoken language , 2000, IEEE Trans. Speech Audio Process..

[5]  Wen Gao,et al.  CSLDS: Chinese sign language dialog system , 2003, 2003 IEEE International SOI Conference. Proceedings (Cat. No.03CH37443).

[6]  Christoph Bregler,et al.  Video Rewrite: Driving Visual Speech with Audio , 1997, SIGGRAPH.

[7]  Richard Kennaway,et al.  Synthetic Animation of Deaf Signing Gestures , 2001, Gesture Workshop.

[8]  Chung-Hsien Wu,et al.  Error-Tolerant Sign Retrieval Using Visual Features and Maximum A Posteriori Estimation , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Chung-Hsien Wu,et al.  Text generation from Taiwanese Sign Language using a PST-based language model for augmentative communication. , 2004, IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[10]  Surendra Ranganath,et al.  Automatic Sign Language Analysis: A Survey and the Future beyond Lexical Meaning , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Dimitris N. Metaxas,et al.  A Framework for Recognizing the Simultaneous Aspects of American Sign Language , 2001, Comput. Vis. Image Underst..

[12]  Jessica K. Hodgins,et al.  Interactive control of avatars animated with human motion data , 2002, SIGGRAPH.

[13]  Lucas Kovar,et al.  Motion graphs , 2002, SIGGRAPH Classes.

[14]  H F Chen,et al.  A fuzzy rule-based approach to recognizing 3-D arm movements. , 2001, IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society.

[15]  Harry Shum,et al.  Motion texture: a two-level statistical model for character motion synthesis , 2002, ACM Trans. Graph..

[16]  Angus B. Grieve-Smith,et al.  SignSynth: A Sign Language Synthesis Application Using Web3D and Perl , 2001, Gesture Workshop.

[17]  Tomaso Poggio,et al.  Learning to see , 1996 .

[18]  Tomaso Poggio,et al.  Trainable Videorealistic Speech Animation , 2004, FGR.

[19]  Eddie Kohler,et al.  Real-time speech motion synthesis from recorded motions , 2004, SCA '04.

[20]  Sang-Woon Kim,et al.  On intelligent avatar communication using Korean, Chinese and Japanese sign-languages: an overview , 2004, ICARCV 2004 8th Control, Automation, Robotics and Vision Conference, 2004..

[21]  Helen Arvidson,et al.  Augmentative and Alternative Communication: A Handbook of Principles and Practices , 1997 .

[22]  Catherine N. Ball,et al.  Representation of american sign language for machine translation , 2002 .

[23]  Robyn A. Owens,et al.  An effective sign language display system , 2005, Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005..

[24]  Dimitris N. Metaxas,et al.  Toward Scalability in ASL Recognition: Breaking Down Signs into Phonemes , 1999, Gesture Workshop.

[25]  Wu Chou,et al.  Pattern Recognition in Speech and Language Processing , 2002 .

[26]  M. R. Mickey,et al.  Estimation of Error Rates in Discriminant Analysis , 1968 .

[27]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[28]  Angélica de Antonio Jiménez,et al.  Teaching Communication Skills to Hearing-Impaired Children With An Intelligent Multimedia System , 1995, IEEE Multim..

[29]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[30]  S. Shott,et al.  Statistics for Health Professionals , 1990 .

[31]  Franc Solina,et al.  Synthesis of the sign language of the deaf from the sign video clips , 1999 .

[32]  Alex Pentland,et al.  Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video , 1998, IEEE Trans. Pattern Anal. Mach. Intell..