Cross-Lingual Abstract Meaning Representation Parsing

Abstract Meaning Representation (AMR) research has mostly focused on English. We show that it is possible to use AMR annotations for English as a semantic representation for sentences written in other languages. We exploit an AMR parser for English and parallel corpora to learn AMR parsers for Italian, Spanish, German and Chinese. Qualitative analysis show that the new parsers overcome structural differences between the languages. We further propose a method to evaluate the parsers that does not require gold standard data in the target languages. This method highly correlates with the gold standard evaluation, obtaining a Pearson correlation coefficient of 0.95.

[1]  Nianwen Xue,et al.  Not an Interlingua, But Close: Comparison of English AMRs to Chinese and Czech , 2014, LREC.

[2]  Ivan Titov,et al.  A Latent Variable Model of Synchronous Syntactic-Semantic Parsing for Multiple Languages , 2009, CoNLL Shared Task.

[3]  Giorgio Satta,et al.  An Incremental Parser for Abstract Meaning Representation , 2016, EACL.

[4]  Bin Li,et al.  Annotating the Little Prince with Chinese AMRs , 2016, LAW@ACL.

[5]  Bonnie J. Dorr,et al.  Machine Translation Divergences: A Formal Description and Proposed Solution , 1994, CL.

[6]  Rico Sennrich,et al.  Nematus: a Toolkit for Neural Machine Translation , 2017, EACL.

[7]  Ari Rappoport,et al.  Conceptual Annotations Preserve Structure Across Translations: A French-English Case Study , 2015 .

[8]  Nizar Habash,et al.  Improved Word-Level Alignment: Injecting Knowledge about MT Divergences , 2002 .

[9]  Kevin Knight,et al.  Smatch: an Evaluation Metric for Semantic Feature Structures , 2013, ACL.

[10]  Lonneke van der Plas,et al.  Cross-Lingual Validity of PropBank in the Manual Annotation of French , 2010, Linguistic Annotation Workshop.

[11]  Mauro Cettolo,et al.  WIT3: Web Inventory of Transcribed and Translated Talks , 2012, EAMT.

[12]  Heng Ji,et al.  CAMR at SemEval-2016 Task 8: An Extended Transition-based AMR Parser , 2016, SemEval@NAACL-HLT.

[13]  Johan Bos,et al.  Cross-lingual Learning of an Open-domain Semantic Parser , 2016, COLING.

[14]  Giovanni Moretti,et al.  Italy goes to Stanford: a collection of CoreNLP modules for Italian , 2016, ArXiv.

[15]  M. Kendall The treatment of ties in ranking problems. , 1945, Biometrika.

[16]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[17]  Anders Søgaard Data point selection for cross-language adaptation of dependency parsers , 2011, ACL.

[18]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[19]  Joakim Nivre,et al.  Incrementality in Deterministic Dependency Parsing , 2004 .

[20]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.

[21]  David Yarowsky,et al.  Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora , 2001, HLT.

[22]  Xavier Carreras,et al.  FreeLing: An Open-Source Suite of Language Analyzers , 2004, LREC.

[23]  Noah A. Smith,et al.  Unsupervised Structure Prediction with Non-Parallel Multilingual Guidance , 2011, EMNLP.

[24]  Philipp Koehn,et al.  Europarl: A Parallel Corpus for Statistical Machine Translation , 2005, MTSUMMIT.

[25]  Chris Quirk,et al.  An AMR parser for English, French, German, Spanish and Japanese and a new AMR-annotated corpus , 2015, NAACL.

[26]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[27]  Slav Petrov,et al.  Multi-Source Transfer of Delexicalized Dependency Parsers , 2011, EMNLP.

[28]  Ondrej Bojar,et al.  Comparing Czech and English AMRs , 2014, LG-LP@COLING.

[29]  Philip Resnik,et al.  Bootstrapping parsers via syntactic projection across parallel texts , 2005, Natural Language Engineering.

[30]  Jaime G. Carbonell,et al.  A Discriminative Graph-Based Parser for the Abstract Meaning Representation , 2014, ACL.

[31]  Silvie Cinková,et al.  Tectogrammatical Annotation of the Wall Street Journal , 2009, Prague Bull. Math. Linguistics.

[32]  Yunyao Li,et al.  Generating High Quality Proposition Banks for Multilingual Semantic Role Labeling , 2015, ACL.

[33]  Mirella Lapata,et al.  Cross-lingual Annotation Projection for Semantic Roles , 2009, J. Artif. Intell. Res..

[34]  Marcin Junczys-Dowmunt,et al.  The United Nations Parallel Corpus v1.0 , 2016, LREC.

[35]  Philip Resnik,et al.  Cross-Language Parser Adaptation between Related Languages , 2008, IJCNLP.

[36]  Noah A. Smith,et al.  Shared Logistic Normal Distributions for Soft Parameter Tying in Unsupervised Grammar Induction , 2009, NAACL.

[37]  Beryl Hoffman A CCG Approach to Free Word Order Languages , 1992, ACL.

[38]  Noah A. Smith,et al.  A Simple, Fast, and Effective Reparameterization of IBM Model 2 , 2013, NAACL.