Finite-State Models for Computer Assisted Translation

Current methodologies for automatic translation cannot be expected to produce high quality translations. However, some techniques based on these methodologies can increase the productivity of human translators. The basis of one of these methodologies are finite-state transducers, which are adequate models for computer assisted translation. These models have proved its efficiency in many pattern recognition and artificial intelligence tasks such as speech recognition, handwriting recognition and machine translation for specific domains. These finite-state models present some advantages. On the one hand, finite-state models can be learnt from bilingual corpus to infer transducers. On the other hand, there are well-known and efficient algorithms to perform the parse of the best translation according to these models (e.g. Viterbi search). In this paper, the concept of interactive search will be introduced along with some efficient techniques that solve the problem of producing a translation given a sentence in the source language and a prefix (from the output sentence) typed by the user. Needless to say that this system must run under real-time constraints to be useful for human translators. This approach has been tested on a corpus of printer manuals and the first results reflect that human translators would only need to type the 25% of the characters of the whole translated text, increasing in this way their throughput and reducing their effort.

[1]  Srinivas Bangalore,et al.  A Finite-State Approach to Machine Translation , 2001, NAACL.

[2]  Hermann Ney,et al.  Speech-to-speech translation based on finite-state transducers , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Yaser Al-Onaizan,et al.  Translation with Finite-State Devices , 1998, AMTA.

[4]  Christoph Tillmann,et al.  Word re-ordering and dynamic programming based search algorithm for statistical machine translation , 2002 .

[5]  George F. Foster,et al.  Unit Completion for a Computer-aided Translation Typing System , 2004, Machine Translation.

[6]  Francisco Casacuberta,et al.  TransType2. Un sistema de ayuda a la traducción , 2003, Proces. del Leng. Natural.

[7]  Hermann Ney,et al.  Some approaches to statistical and finite-state speech-to-speech translation , 2004, Comput. Speech Lang..

[8]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[9]  Francisco Casacuberta Inference of Finite-State Transducers by Using Regular Grammars and Morphisms , 2000, ICGI.

[10]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[11]  Francisco Casacuberta,et al.  Some Statistical-Estimation Methods for Stochastic Finite-State Transducers , 2004, Machine Learning.

[12]  Hermann Ney,et al.  Word Reordering and a Dynamic Programming Beam Search Algorithm for Statistical Machine Translation , 2003, CL.

[13]  Enrique Vidal,et al.  Finite-state speech-to-speech translation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.