Syntactic mismatches in machine translation

This paper addresses one of the central problems arising at the transfer stage in machine translation: syntactic mismatches, that is, mismatches between a source-language sentence structure and its equivalent target-language sentence structure. The level at which we assume the transfer to be carried out is the Deep-Syntactic Structure (DSyntS) as proposed in the Meaning-Text Theory (MTT). DSyntS is abstract enough to avoid all types of divergences that result either from restricted lexical co-occurrence or from surface-syntactic discrepancies between languages. As for the remaining types of syntactic divergences, all of them occur not only interlinguistically, but also intralinguistically; this means that establishing correspondences between semantically equivalent expressions of the source and target languages that diverge with respect to their syntactic structure is nothing else than paraphrasing. This allows us to adapt the powerful intralinguistic paraphrasing mechanism developed in MTT for purposes of interlinguistic transfer.

[1]  Niladri Chatterjee,et al.  Study of divergence for example based English-Hindi machine translation , 2001 .

[2]  Igor Mel’čuk,et al.  Dictionnaire explicatif et combinatoire du francais contemporain. Recherches lexico-semantiques , 1985 .

[3]  Alain Polguère,et al.  A Formal Lexicon in the Meaning-Text Theory (or How to Do Lexica with Words) , 1987, Comput. Linguistics.

[4]  M. Baltin,et al.  The Mental representation of grammatical relations , 1985 .

[5]  H. Altay Güvenir,et al.  Learning Translation Templates from Bilingual Translation Examples , 2004, Applied Intelligence.

[6]  R. Schreuder,et al.  Idioms : structural and psychological perspectives , 1997 .

[7]  Leo Wanner,et al.  Towards a Lexicographic Approach to Lexical Transfer in Machine Translation (Illustrated by the German–Russian Language Pair) , 2001, Machine Translation.

[8]  Igor Mel’čuk,et al.  Lexical functions: a tool for the description of lexical relations in a lexicon , 1996 .

[9]  Daniel Gildea,et al.  Loosely Tree-Based Alignment for Machine Translation , 2003, ACL.

[10]  Klaus Netter,et al.  Translation by Structural Correspondences , 1989, EACL.

[11]  Jacob Hoeksema Germania et Alia: A Linguistic Webschrift for Hans den Besten , 2003 .

[12]  Pushpak Bhattacharyya,et al.  Interlingua-based English–Hindi Machine Translation and Language Divergence , 2001, Machine Translation.

[13]  Igorʹ A. Melʹčuk,et al.  DEC dictionnaire explicatif et combinatoire du français contemporain , 1984 .

[14]  Leo Wanner Lexical choice in text generation and machine translation , 2004, Machine Translation.

[15]  Igor Mel,et al.  Actants in semantics and syntax I: actants in semantics* , 2004 .

[16]  Alla Paslawska,et al.  PERFECT READINGS IN RUSSIAN , 2003 .

[17]  Stephanie Seneff,et al.  Automated English-Korean Translation for Enhanced Coalition Communications , 1997 .

[18]  John T. Maxwell,et al.  Formal issues in lexical-functional grammar , 1998 .

[19]  Martin C. Emele,et al.  Ambiguity Preserving Machine Translation using Packed Representations , 1998, COLING-ACL.

[20]  Igor Mel’čuk,et al.  Communicative organization in natural language , 2001 .

[21]  Owen Rambow,et al.  Handling Stuctural Divergences and Recovering Dropped Arguments in a Korean/English Machine Translation System , 2000, AMTA.

[22]  Uwe Reyle,et al.  From discourse to logic , 1993 .

[23]  Inderjeet Mani,et al.  Capturing Language-Specific Semantic Distinctions in Interlingua-Based MT , 1991 .

[24]  Leo Wanner,et al.  On Using a Parallel Graph Rewriting Formalism in Generation , 2001, EWNLG@ACL.

[25]  B. Sanromán,et al.  Transferencia léxica y reglas de paráfrasis: verbos denominales de SP cognado. , 1999 .

[26]  Leo Wanner,et al.  Selected lexical and grammatical issues in the meaning-text theory : in honour of Igor Melʾčuk , 2007 .

[27]  N. Cocchiarella,et al.  Situations and Attitudes. , 1986 .

[28]  Igor Mel’čuk,et al.  Dependency Syntax: Theory and Practice , 1987 .

[29]  Martin Cmejrek,et al.  Treebanks in Machine Translation , 2003 .

[30]  Bonnie J. Dorr,et al.  Machine Translation Divergences: A Formal Description and Proposed Solution , 1994, CL.

[31]  Monika Doherty,et al.  Sprachspezifische Aspekte der Informationsverteilung , 1999 .

[32]  Igor Mel,et al.  Actants in Semantics and Syntax. II. Actants in Syntax , 2004 .

[33]  Leis J.M,et al.  Proceedings of the Symposium , 1997 .

[34]  Martin C. Emele,et al.  Efficient Implementation of a Semantic-based Transfer Approach , 1996, ECAI.

[35]  Igor Mel’čuk,et al.  Dictionnaire explicatif et combinatoire du français contemporain. Recherches lexico-sémantiques IV: Recherches lexico-sémantiques IV , 1999 .

[36]  Cathrine Fabricius-Hansen,et al.  Information Structure in a Cross-linguistic Perspective , 2002 .

[37]  Andy Way,et al.  Recent Advances in Example-Based Machine Translation , 2004 .

[38]  Uwe Reyle,et al.  Dealing with Ambiguities by Underspecification: Construction, Representation and Deduction , 1993, J. Semant..

[39]  Marc Dymetman,et al.  Context-Free Grammar Rewriting and the Transfer of Packed Linguistic Representations , 2000, COLING.

[40]  Igor Mel’čuk,et al.  Actants in semantics and syntax II: actants in syntax , 2004 .

[41]  Bonnie J. Dorr,et al.  Machine Translation: A View from the Lexicon , 1994, CL.

[42]  Igor Mel’čuk,et al.  Actants in semantics and syntax. I: actants in semantics , 2004 .