论文信息 - Discriminative Training and Maximum Entropy Models for Statistical Machine Translation

Discriminative Training and Maximum Entropy Models for Statistical Machine Translation

We present a framework for statistical machine translation of natural languages based on direct maximum entropy models, which contains the widely used source-channel approach as a special case. All knowledge sources are treated as feature functions, which depend on the source language sentence, the target language sentence and possible hidden variables. This approach allows a baseline machine translation system to be extended easily by adding new feature functions. We show that a baseline statistical machine translation system is significantly improved using this approach.

Hermann Ney | Franz Josef Och | H. Ney | F. Och

[1] J. Darroch,et al. Generalized Iterative Scaling for Log-Linear Models , 1972 .

[2] Lalit R. Bahl,et al. Maximum mutual information estimation of hidden Markov model parameters for speech recognition , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Wolfgang Wahlster,et al. Verbmobil: Translation of Face-To-Face Dialogs , 1993, MTSUMMIT.

[4] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[5] Hermann Ney,et al. On the Probabilistic Interpretation of Neural Network Classifiers and Discriminative Training Criteria , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[6] Biing-Hwang Juang,et al. Statistical and Discriminative Methods for Speech Recognition , 1996 .

[7] Adam L. Berger,et al. A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[8] Salim Roukos,et al. Feature-based language understanding , 1997, EUROSPEECH.

[9] Peter Beyerlein,et al. Discriminative model combination , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[10] Salim Roukos,et al. Maximum likelihood and discriminative training of direct translation models , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[11] Dietrich Klakow,et al. COMPACT MAXIMUM ENTROPY LANGUAGE MODELS , 1999 .

[12] Hermann Ney,et al. Improved Alignment Models for Statistical Machine Translation , 1999, EMNLP.

[13] Hermann Ney,et al. An Evaluation Tool for Machine Translation: Fast Evaluation for MT Research , 2000, LREC.

[14] Hermann Ney,et al. A Comparison of Alignment Models for Statistical Machine Translation , 2000, COLING.

[15] H. Ney,et al. Model-based MCE bound to the true Bayes' error , 2001, IEEE Signal Processing Letters.

[16] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.