Statistical Machine Translation for Greek to Greek Sign Language Using Parallel Corpora Produced via Rule-Based Machine Translation

One of the objectives of Assistive Technologies is to help people with disabilities communicate with others and provide means of access to information. As an aid to Deaf people, we present in this work a novel prototype Rule-Based Machine Translation (RBMT) system for the creation of large quality written Greek text to Greek Sign Language (GSL) glossed corpora. In particular, the proposed RBMT system supports the professional translator of GSL to produce high quality parallel Greek text GSL glossed corpus, which is then used as training data by the Statistical Machine Translation (SMT) MOSES [1] application system. It should be noted that the whole process is robust and flexible, since it does not demand deep grammar knowledge of GSL. With this work we manage to overcome the two biggest obstacles in Natural Processing Language (NLP) of GSL. Firstly, the lack of written system and secondly the lack of grammar and finally we have been able to lay the foundations for an autonomous translation system of Greek text to GSL. Evaluation of the proposed scheme is carried out in the weather reports domain, where 20,284 tokens and 1,000 sentences have been produced. By using the BiLingual Evaluation Understudy (BLEU) metric score, our prototyped MT system achieves a relative average score of 60.53% and 85.1%/65.5%/53.8%/44.8% for for 1-gram/2gram/3-gram/4-gram evaluation.

[1]  C Neidle,et al.  SignStream: A tool for linguistic and computer vision research on visual-gestural language data , 2001, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[2]  Harold L. Somers,et al.  An introduction to machine translation , 1992 .

[3]  Mitchell P. Marcus,et al.  Generating american sign language classifier predicates for english-to-asl machine translation , 2006 .

[4]  W. Stokoe,et al.  Sign language structure: an outline of the visual communication systems of the American deaf. 1960. , 1961, Journal of deaf studies and deaf education.

[5]  Andy Way,et al.  Manual labour: tackling machine translation for sign languages , 2013, Machine Translation.

[6]  Sandra Baldassarri,et al.  An Automatic Rule-Based Translation System to Spanish Sign Language (LSE) , 2007, Interacción.

[7]  Sara Morrissey Assessing Three Representation Methods for Sign Language Machine Translation and Evaluation , 2011 .

[8]  Chris Callison-Burch,et al.  Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Lattice Decoding , 2006 .

[9]  Sara Morrissey,et al.  Data-Driven Machine Translation for Sign Languages , 2008 .

[10]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[11]  Ευίτα-Σταυρούλα Φωτεινέα,et al.  Generating linguistic content for Greek to GSL conversion , 2005 .

[12]  José Colás Pasamontes,et al.  A rule-based translation from written Spanish to Spanish Sign Language glosses , 2014, Comput. Speech Lang..

[13]  Bob Carpenter,et al.  The logic of typed feature structures , 1992 .

[14]  Philipp Koehn,et al.  Pharaoh: A Beam Search Decoder for Phrase-Based Statistical Machine Translation Models , 2004, AMTA.

[15]  R. Elliott,et al.  An Overview of the SiGML Notation and SiGMLSigning Software System , 2004 .

[16]  Ludek Müller,et al.  Automatic Czech - Sign Speech Translation , 2007, TSD.

[17]  Stavroula-Evita Fotinea,et al.  From grammar-based MT to post-processed SL representations , 2016, Universal Access in the Information Society.

[18]  Mark Wells,et al.  Virtual signing: capture, animation, storage and transmission-an overview of the ViSiCAST project , 2000 .

[19]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[20]  Hermann Ney,et al.  Phrase-Based Statistical Machine Translation , 2002, KI.

[21]  Ian Marshall,et al.  Sign Language Translation via DRT and HPSG , 2002, CICLing.

[22]  Lluís Padró,et al.  FreeLing 1.3: Syntactic and semantic services in an open-source NLP library , 2006, LREC.

[23]  Andy Way,et al.  MATREX: DCU machine translation system for IWSLT 2006. , 2006, IWSLT.

[24]  Bob Carpenter Logic of Typed Feature Structures, The (Cambridge Tracts in Theoretical Computer Science) , 2005 .

[25]  Amy Weinberg,et al.  Sign language transcription at the level of meaning components: The Berkeley Transcription System (BTS) , 2002 .

[26]  SystemBritta,et al.  Towards an Automatic Sign Language Translation , 1999 .

[27]  Catherine N. Ball,et al.  Representation of american sign language for machine translation , 2002 .

[28]  Francis M. Tyers,et al.  Apertium: a free/open-source platform for rule-based machine translation , 2011, Machine Translation.

[29]  Pierre Isabelle,et al.  Machine Translation: Overview , 2006 .

[30]  Dan I. Slobin,et al.  Transcription as a tool for understanding: The Berkeley Transcription System for sign language research (BTS) , 2002 .

[31]  Tony Veale,et al.  Cross modal comprehension in ZARDOZ an English to sign-language translation system , 1994 .

[32]  Hermann Ney,et al.  Sign language machine translation overkill , 2010, IWSLT.

[33]  Stavroula-Evita Fotinea,et al.  A prototype Greek text to Greek Sign Language conversion system , 2010, Behav. Inf. Technol..

[34]  Stavroula-Evita Fotinea,et al.  GSLC: Creation and Annotation of a Greek Sign Language Corpus for HCI , 2007, HCI.

[35]  Norman I. Badler,et al.  A machine translation system from English to American Sign Language , 2000, AMTA.

[36]  E. Klima The signs of language , 1979 .

[37]  Petros Maragos,et al.  Sign Language technologies and resources of the Dicta-Sign project , 2012 .

[38]  Booncharoen Sirinaovakul,et al.  Intelligent Thai text - Thai sign translation for language learning , 2008, Comput. Educ..

[39]  Arturo Trujillo Transfer Machine Translation , 1999 .