论文信息 - A SomAgent statistical machine translation

A SomAgent statistical machine translation

Abstract: The paper describes the process by which the word alignment task performed within SOMAgent works in collaboration with the statistical machine translation system in order to learn a phrase translation table. We studied improvements in the quality of translation using syntax augmented machine translation. We also experimented with different degrees of linguistic analysis from the lexical level to a syntactic or semantic level, in order to generate a more precise alignment. We developed a contextual environment using the Self-Organizing Map, which can model a semantic agent (SOMAgent) that learns the correct meaning of a word used in context in order to deal with specific phenomena such as ambiguity, and to generate more precise alignments that can improve the first choice of the statistical machine translation system giving linguistic knowledge.

[1] Jorma Laaksonen,et al. SOM_PAK: The Self-Organizing Map Program Package , 1996 .

[2] T. Kohonen,et al. Self-organizing semantic maps , 1989, Biological Cybernetics.

[3] Nick Chater,et al. Models of Language Acquisition: Inductive and Deductive Approaches , 2000 .

[4] Peter Broeder,et al. Models of Language Acquisition: Inductive and Deductive Approaches , 2001 .

[5] Thomas R. Shultz,et al. Connectionist Models of Development: Developmental Processes in Real and Artificial Neural Networks , 2003 .

[6] David Chiang,et al. A Hierarchical Phrase-Based Model for Statistical Machine Translation , 2005, ACL.

[7] Eugene Charniak,et al. Statistical language learning , 1997 .

[8] Gerhard Weiss,et al. Multiagent Systems , 1999 .

[9] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[10] D. Vila. Combining statistical and finite-state methods for machine translation , 2005 .

[11] Teuvo Kohonen,et al. Self-Organizing Maps , 2010 .

[12] Hermann Ney,et al. A Systematic Comparison of Various Statistical Alignment Models , 2003, CL.

[13] J. Scharf. [Language evolution]. , 1973, Gegenbaurs morphologisches Jahrbuch.

[14] I. Dan Melamed,et al. Statistical Machine Translation by Parsing , 2004, ACL.

[15] Jason Eisner,et al. Learning Non-Isomorphic Tree Mappings for Machine Translation , 2003, ACL.

[16] J. C. Scholtes. Resolving Linguistic Ambiguities with a Neural Data-Oriented Parsing (DOP) System , 1992 .

[17] Eiichiro Sumita,et al. The NiCT-ATR statistical machine translation system for IWSLT 2006 , 2006, IWSLT.

[18] Kevin Knight,et al. A Decoder for Syntax-based Statistical MT , 2002, ACL.

[19] Lluís Màrquez i Villodre,et al. The LDV-COMBO system for SMT , 2006, WMT@HLT-NAACL.

[20] Tao Xiong,et al. A combined SVM and LDA approach for classification , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[21] Eugene Charniak,et al. A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[22] George R. Doddington,et al. Automatic Evaluation of Machine Translation Quality Using N-gram Co-Occurrence Statistics , 2002 .

[23] Hans-Jürgen Eikmeyer,et al. The Production of Finnish Nouns: A Psycholinguistically Motivated Connectionist Model , 1997, Connect. Sci..

[24] Philipp Koehn,et al. Statistical Significance Tests for Machine Translation Evaluation , 2004, EMNLP.

[25] Timo Honkela,et al. Simulating Language Learning in Community of Agents Using Self-Organizing Maps , 2003 .

[26] Javier Bajo,et al. A SomAgent statistical machine translation , 2011, Appl. Soft Comput..

[27] Graeme Hirst,et al. Semantic Interpretation and Ambiguity , 1988, Artif. Intell..

[28] Philipp Slusallek,et al. Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[29] Risto Miikkulainen,et al. Lexical Disambiguation Based on Distributed Representations of Context Frequency , 1994, Proceedings of the Sixteenth Annual Conference of the Cognitive Science Society.

[30] João Balsa,et al. A Distributed Approach for a Robust and Evolving NLP System , 2000, Natural Language Processing.

[31] J. Elman,et al. Rethinking Innateness: A Connectionist Perspective on Development , 1996 .

[32] Vera Lúcia Strube de Lima,et al. Distributing linguistic knowledge in a multi-agent natural language processing system: re-modelling the dictionary , 1998 .