论文信息 - A Finite-State Approach to Machine Translation

A Finite-State Approach to Machine Translation

The problem of machine translation can be viewed as consisting of two subproblems (a) Lexical Selection and (b) Lexical Reordering. We propose stochastic finite-state models for these two subproblems in this paper. Stochastic finite-state models are efficiently learnable from data, effective for decoding and are associated with a calculus for composing models which allows for tight integration of constraints from various levels of language processing. We present a method for learning stochastic finite-state models for lexical choice and lexical reordering that are trained automatically from pairs of source and target utterances. We use this method to develop models for English-Japanese translation and present the performance of these models for translation on speech and text. We also evaluate the efficacy of such a translation model in the context of a call routing task of unconstrained speech utterances.

Srinivas Bangalore | Giuseppe Riccardi

[1] Enrique Vidal,et al. Text and speech translation by means of subsequential transducers , 1996, Nat. Lang. Eng..

[2] Srinivas Bangalore,et al. Stochastic finite-state models for spoken language machine translation , 2000 .

[3] Giuseppe Riccardi,et al. How may I help you? , 1997, Speech Commun..

[4] Yaser Al-Onaizan,et al. Translation with Finite-State Devices , 1998, AMTA.

[5] Roberto Pieraccini,et al. Non-deterministic stochastic language models for speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6] Mark-Jan Nederhof,et al. Practical Experiments with Regular Approximation of Context-Free Languages , 1999, CL.

[7] Kenneth Ward Church,et al. Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus , 2001, Computational Linguistics.

[8] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[9] Dekai Wu,et al. Stochastic Inversion Transduction Grammars and Bilingual Parsing of Parallel Corpora , 1997, CL.

[10] Michael Riley,et al. Speech Recognition by Composition of Weighted Finite Automata , 1996, ArXiv.

[11] Martin Kay,et al. Regular Models of Phonological Rule Systems , 1994, CL.