A Computer-Assisted Translation and Writing System

We introduce a method for learning to predict text and grammatical construction in a computer-assisted translation and writing framework. In our approach, predictions are offered on the fly to help the user make appropriate lexical and grammar choices during the translation of a source text, thus improving translation quality and productivity. The method involves automatically generating general-to-specific word usage summaries (i.e., writing suggestion module), and automatically learning high-confidence word- or phrase-level translation equivalents (i.e., translation suggestion module). At runtime, the source text and its translation prefix entered by the user are broken down into n-grams to generate grammar and translation predictions, which are further combined and ranked via translation and language models. These ranked prediction candidates are iteratively and interactively displayed to the user in a pop-up menu as translation or writing hints. We present a prototype writing assistant, TransAhead, that applies the method to a human-computer collaborative environment. Automatic and human evaluations show that novice translators or language learners substantially benefit from our system in terms of translation performance (i.e., translation accuracy and productivity) and language learning (i.e., collocation usage and grammar). In general, our methodology of inline grammar and text predictions or suggestions has great potential in the field of computer-assisted translation, writing, or even language learning.

[1]  George F. Foster,et al.  TransType: text prediction for translators , 2002 .

[2]  L. Lasagna The nature of evidence. , 1972, Triangle; the Sandoz journal of medical science.

[3]  Philippe Langlais,et al.  Trans Type: Development-Evaluation Cycles to Boost Translator's Productivity , 2002, Machine Translation.

[4]  Jason S. Chang,et al.  EdIt: A Broad-Coverage Grammar Checker Using Pattern Grammar , 2011, ACL.

[5]  Frank Smadja,et al.  Retrieving Collocations from Text: Xtract , 1993, CL.

[6]  Francisco Casacuberta,et al.  An Interactive Machine Translation System with Online Learning , 2011, ACL.

[7]  Hermann Ney,et al.  Statistical Approaches to Computer-Assisted Translation , 2009, CL.

[8]  D. Noël Pattern grammar: A corpus-driven approach to the lexical grammar of English. By SUSAN HUNSTON and GILL FRANCIS (Studies in corpus linguistics 4.) Amsterdam & Philadelphia: John Benjamins, 2000 , 2002 .

[9]  Joel D. Martin,et al.  Improving Translation Quality by Discarding Most of the Phrasetable , 2007, EMNLP.

[10]  Philippe Langlais,et al.  Translating Unknown Words by Analogical Learning , 2007, EMNLP.

[11]  Pete Whitelock,et al.  Strategies for Interactive Machine Translation: the experience and implications of the UMIST Japanese project , 1986, COLING.

[12]  Ralf D. Brown Human-Computer Interaction for Semantic Disambiguation , 1990, COLING.

[13]  George F. Foster,et al.  TransType: a Computer-Aided Translation Typing System , 2000 .

[14]  Susan Hunston,et al.  Book Reviews: Pattern Grammar: A Corpus-Driven Approach to the Lexical Grammar of English , 2000, CL.

[15]  Daniel Marcu,et al.  Statistical Phrase-Based Translation , 2003, NAACL.

[16]  Michael Gamon,et al.  Search right and thou shalt find ... Using Web Queries for Learner Error Detection , 2010 .

[17]  Philipp Koehn,et al.  A Web-Based Interactive Computer Aided Translation Tool , 2009, ACL.

[18]  E. K. Blau Teaching Collocation—Further Developments in the Lexical Approach , 2002 .

[19]  Mari Ostendorf,et al.  Classifying Factored Genres with Part-of-Speech Histograms , 2009, HLT-NAACL.

[20]  George F. Foster,et al.  Adaptive Language and Translation Models for Interactive Machine Translation , 2004, EMNLP.

[21]  N. H. Beebe A Complete Bibliography of ACM Transactions on Asian Language Information Processing , 2007 .

[22]  Tong Zhang,et al.  Named Entity Recognition through Classifier Combination , 2003, CoNLL.

[23]  Hermann Ney,et al.  A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[24]  Jason S. Chang,et al.  TANGO: Bilingual Collocational Concordancer , 2004, ACL.

[25]  Laura A. Dabbish,et al.  Labeling images with a computer game , 2004, AAAI Spring Symposium: Knowledge Collection from Volunteer Contributors.

[26]  G. Elisabeta Marai,et al.  Correcting Automatic Translations through Collaborations between MT and Monolingual Target-\-Lan\-gua\-ge Users , 2009, EACL.

[27]  Peter Andrews,et al.  The nature of the evidence , 1996, Nature.

[28]  Salim Roukos,et al.  Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[29]  Keh-Jiann Chen,et al.  Improving Word Alignment by Adjusting Chinese Word Segmentation , 2008, IJCNLP.

[30]  Francisco Casacuberta,et al.  Online Learning for Interactive Statistical Machine Translation , 2010, NAACL.

[31]  Winnie Cheng,et al.  From n-gram to skipgram to concgram , 2006 .

[32]  Michael Gamon,et al.  Correcting ESL Errors Using Phrasal SMT Techniques , 2006, ACL.

[33]  Jason S. Chang,et al.  Using Sublexical Translations to Handle the OOV Problem in Machine Translation , 2011, TALIP.

[34]  Jianfeng Gao,et al.  Using Statistical Techniques and Web Search to Correct ESL Errors , 2013 .

[35]  C. I. Lewis The Modes of Meaning , 1943 .

[36]  David Wible,et al.  StringNet as a Computational Resource for Discovering and Investigating Linguistic Constructions , 2010, HLT-NAACL 2010.

[37]  Martin Chodorow,et al.  An Unsupervised Method for Detecting Grammatical Errors , 2000, ANLP.

[38]  Lucia Specia,et al.  Source-Language Entailment Modeling for Translating Unknown Terms , 2009, ACL.

[39]  Robert F. Ilson,et al.  The BBI Combinatory Dictionary of English: A guide to word combinations , 1989 .

[40]  Martin Kay,et al.  The MIND System , 1970 .

[41]  Michael Stubbs,et al.  Two quantitative methods of studying phraseology in English , 2002 .

[42]  Jason S. Chang,et al.  An automatic collocation writing assistant for Taiwanese EFL learners: A case of corpus-based NLP technology , 2008 .

[43]  Pierre Isabelle,et al.  Target-Text Mediated Interactive Machine Translation , 2004, Machine Translation.

[44]  Adam Kilgarriff,et al.  The Sketch Engine , 2004 .

[45]  Chris Callison-Burch,et al.  Improved Statistical Machine Translation Using Monolingually-Derived Paraphrases , 2009, EMNLP.

[46]  Yanjun Ma,et al.  Bootstrapping Word Alignment via Word Packing , 2007, ACL.

[47]  D Nicholls,et al.  The Cambridge Learner Corpus-Error coding and analysis , 1999 .