Classifying Divergences in Cross-lingual AMR Pairs

Translation divergences are varied and widespread, challenging approaches that rely on parallel text. To annotate translation divergences, we propose a schema grounded in the Abstract Meaning Representation (AMR), a sentence-level semantic framework instantiated for a number of languages. By comparing parallel AMR graphs, we can identify specific points of divergence. Each divergence is labeled with both a type and a cause. We release a small corpus of annotated English-Spanish data, and analyze the annotations in our corpus.

[1]  Dien Dinh,et al.  Improving Neural Machine Translation with AMR Semantic Graphs , 2021, Mathematical Problems in Engineering.

[2]  Marine Carpuat,et al.  Beyond Noise: Mitigating the Impact of Fine-grained Semantic Divergences on Neural Machine Translation , 2021, ACL.

[3]  James Pustejovsky,et al.  Designing a Uniform Meaning Representation for Natural Language Processing , 2021, KI - Künstliche Intelligenz.

[4]  Alexandre Allauzen,et al.  Measure and Evaluation of Semantic Divergence across Two Languages , 2021, ACL.

[5]  Gabriel Illouz,et al.  Detecting Non-literal Translations by Fine-tuning Cross-lingual Pre-trained Language Models , 2020, COLING.

[6]  Roberto Navigli,et al.  Enabling Cross-Lingual AMR Parsing with Transfer Learning Techniques , 2020, EMNLP.

[7]  Thiago Alexandre Salgueiro Pardo,et al.  Semantically Inspired AMR Alignment for the Portuguese Language , 2020, EMNLP.

[8]  Marine Carpuat,et al.  Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank , 2020, EMNLP.

[9]  Omri Abend,et al.  Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences , 2020, ACL.

[10]  Collin Baker,et al.  Exploring Crosslinguistic Frame Alignment , 2020, FRAMENET.

[11]  Marco Damonte,et al.  Understanding and generating language with Abstract Meaning Representation , 2020 .

[12]  Hyejin Park,et al.  Copula and Case-Stacking Annotations for Korean AMR , 2019, Proceedings of the First International Workshop on Designing Meaning Representations.

[13]  Marco Antonio Sobrevilla Cabezudo,et al.  Towards a General Abstract Meaning Representation Corpus for Brazilian Portuguese , 2019, LAW@ACL.

[14]  Yue Zhang,et al.  Semantic Neural Machine Translation Using AMR , 2019, TACL.

[15]  Ha My Linh,et al.  A Case Study on Meaning Representation for Vietnamese , 2019, Proceedings of the First International Workshop on Designing Meaning Representations.

[16]  W. Bruce Croft,et al.  Cross-Linguistic Semantic Annotation: Reconciling the Language-Specific and the Universal , 2019, Proceedings of the First International Workshop on Designing Meaning Representations.

[17]  Yunyao Li,et al.  Towards Universal Semantic Representation , 2019, Proceedings of the First International Workshop on Designing Meaning Representations.

[18]  Arantza Díaz de Ilarraza,et al.  Annotating Abstract Meaning Representations for Spanish , 2018, LREC.

[19]  Marine Carpuat,et al.  Identifying Semantic Divergences in Parallel Text without Annotations , 2018, NAACL.

[20]  Johan Bos,et al.  Evaluating Scoped Meaning Representations , 2018, LREC.

[21]  Shay B. Cohen,et al.  Cross-Lingual Abstract Meaning Representation Parsing , 2017, NAACL.

[22]  Nianwen Xue,et al.  Translation Divergences in Chinese–English Machine Translation: An Empirical Investigation , 2017, CL.

[23]  Marine Carpuat,et al.  Detecting Cross-Lingual Semantic Divergence for Neural Machine Translation , 2017, NMT@ACL.

[24]  Noelia Migueles Abraira A Study Towards Spanish Abstract Meaning Representation , 2017 .

[25]  Johan Bos,et al.  The Parallel Meaning Bank: Towards a Multilingual Corpus of Translations Annotated with Compositional Meaning Representations , 2017, EACL.

[26]  Bin Li,et al.  Annotating the Little Prince with Chinese AMRs , 2016, LAW@ACL.

[27]  Adam Lopez,et al.  AMRICA: an AMR Inspector for Cross-language Alignments , 2015, NAACL.

[28]  Nianwen Xue,et al.  Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech , 2014, LREC.

[29]  Johan Bos,et al.  Semantic Annotation Issues in Parallel Meaning Banking , 2014, ACL 2014.

[30]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[31]  Kevin Knight,et al.  Smatch: an Evaluation Metric for Semantic Feature Structures , 2013, ACL.

[32]  Ari Rappoport,et al.  UCCA: A Semantics-based Grammatical Annotation Scheme , 2013, IWCS.

[33]  Emmanuel Navarro,et al.  Graph Representation of Synonymy and Translation Resources for Crosslinguistic Modelisation of Meaning , 2010, PACLIC.

[34]  Martin Cmejrek,et al.  Prague Czech-English Dependency Treebank: Any Hopes for a Common Annotation Scheme? , 2004, FCP@NAACL-HLT.

[35]  Bonnie J. Dorr,et al.  Machine Translation Divergences: A Formal Description and Proposed Solution , 1994, CL.

[36]  Clare R. Voss,et al.  Constraints on the Space of MT Divergences , 1993 .

[37]  Bonnie J. Dorr,et al.  Solving Thematic Divergences in Machine Translation , 1990, ACL.