Reordering: a stepping-stone to perfect Thai Sign generation

We proposed the Sign Code Reordering approach (SCR) for reordering the intermediate sign codes (ISC) to Sign code script (SCS) generation. SCR uses language structure matching techniques to reduce complicated grammar rules, provide efficient results. SCR comprises three steps: extraction, reordering and integration. The distinction between source and target language in both grammar and vocabulary is concerned in each processing step to ensure the accuracy of reordering. SCR focuses on accurate and acceptable reordering that are not conforming to the original structure. SCR was designed to capture linguistic differences such as phrase, sentence and multi-sentence structures, no matter how long the input sentence is. The SCR prototype system was implemented and tested to reorder a number ISCs. The test results have been proved that SCR arranges ISCs successfully. SCR can be augmented into any NLP application which requires ISC arrangement e.g., T3STS. T3STS translates Thai text into Thai Sign language. Thai Sign language is the language of the Deaf in Thailand.

[1]  W. Stokoe Sign Language Structure , 1980 .

[2]  Daniel Marcu,et al.  A Phrase-Based,Joint Probability Model for Statistical Machine Translation , 2002, EMNLP.

[3]  Marine Carpuat,et al.  Word Sense Disambiguation vs. Statistical Machine Translation , 2005, ACL.

[4]  Jason Eisner,et al.  Learning Non-Isomorphic Tree Mappings for Machine Translation , 2003, ACL.

[5]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[6]  Andy Way,et al.  Seeing the wood for the trees: data-oriented translation , 2003, MTSUMMIT.

[7]  Daniel Gildea,et al.  Loosely Tree-Based Alignment for Machine Translation , 2003, ACL.

[8]  Kevin Knight,et al.  A Syntax-based Statistical Translation Model , 2001, ACL.

[9]  David Chiang,et al.  A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[10]  Dekai Wu,et al.  Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[11]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[12]  Michael Carl Inducing Translation Templates for Example-Based Machine Translation , 1999 .

[13]  Srinivas Bangalore,et al.  Learning Dependency Translation Models as Collections of Finite-State Head Transducers , 2000, Computational Linguistics.

[14]  Hermann Ney,et al.  The Alignment Template Approach to Statistical Machine Translation , 2004, CL.

[15]  Fei Xia,et al.  Improving a Statistical MT System with Automatically Learned Rewrite Patterns , 2004, COLING.

[16]  Ralf D. Brown Automated Dictionary Extraction for “Knowledge-Free” Example-Based Translation , 2006 .

[17]  Chris Quirk,et al.  Dependency Treelet Translation: Syntactically Informed Phrasal SMT , 2005, ACL.

[18]  Taro Watanabe,et al.  Reordering Constraints for Phrase-Based Statistical Machine Translation , 2004, COLING.

[19]  Chunyu Kit,et al.  Learning Case-based Knowledge for Disambiguating Chinese Word Segmentation: A Preliminary Study , 2002, SIGHAN@COLING.

[20]  Heidi Fox,et al.  Phrasal Cohesion and Statistical Machine Translation , 2002, EMNLP.