论文信息 - Efficient Embedded Decoding of Neural Network Language Models in a Machine Translation System

Efficient Embedded Decoding of Neural Network Language Models in a Machine Translation System

Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing tasks, such as Machine Translation. We introduce in this work a Statistical Machine Translation (SMT) system which fully integrates NNLMs in the decoding stage, breaking the traditional approach based on [Formula: see text]-best list rescoring. The neural net models (both language models (LMs) and translation models) are fully coupled in the decoding stage, allowing to more strongly influence the translation quality. Computational issues were solved by using a novel idea based on memorization and smoothing of the softmax constants to avoid their computation, which introduces a trade-off between LM quality and computational cost. These ideas were studied in a machine translation task with different combinations of neural networks used both as translation models and as target LMs, comparing phrase-based and [Formula: see text]-gram-based systems, showing that the integrated approach seems more promising for [Formula: see text]-gram-based systems, even with nonfull-quality NNLMs.

María José Castro Bleda | Francisco Zamora-Martínez | Francisco Zamora-Martínez

[1] Volkmar Frinken,et al. Neural network language models for off-line handwriting recognition , 2014, Pattern Recognition.

[2] María José Castro Bleda,et al. New Directions in Connectionist Language Modeling , 2003, IWANN.

[3] Yoshua Bengio,et al. A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[4] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[5] Francisco Casacuberta,et al. Machine Translation with Inferred Stochastic Finite-State Transducers , 2004, Computational Linguistics.

[6] Yoshua Bengio,et al. Neural net language models , 2008, Scholarpedia.

[7] Salvador España Boquera,et al. Efficient BP Algorithms for General Feedforward Neural Networks , 2007, IWINAC.

[8] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[9] Meng Cai,et al. Efficient One-Pass Decoding with NNLM for Speech Recognition , 2014, IEEE Signal Processing Letters.

[10] José B. Mariño,et al. N-gram-based Machine Translation , 2006, CL.

[11] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[12] Holger Schwenk,et al. Continuous space language models , 2007, Comput. Speech Lang..

[13] Andrés Ortiz,et al. Ensembles of Deep Learning Architectures for the Early Diagnosis of the Alzheimer's Disease , 2016, Int. J. Neural Syst..