论文信息 - Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data

Translate & Fill: Improving Zero-Shot Multilingual Semantic Parsing with Synthetic Data

While multilingual pretrained language models (LMs) ﬁne-tuned on a single language have shown substantial cross-lingual task transfer capabilities, there is still a wide performance gap in semantic parsing tasks when target language supervision is available. In this paper, we propose a novel Translate-and-Fill (TaF) method to produce silver training data for a multilingual semantic parser. This method simpliﬁes the popular Translate-Align-Project (TAP) pipeline and consists of a sequence-to-sequence ﬁller model that constructs a full parse conditioned on an utterance and a view of the same parse. Our ﬁller is trained on English data only but can accurately complete instances in other languages (i.e., translations of the English training utterances), in a zero-shot fashion. Experimental results on three multilingual semantic parsing datasets show that data augmentation with TaF reaches accuracies competitive with similar systems which rely on traditional alignment techniques.

[1] Zachary C. Lipton,et al. Entity Projection via Machine Translation for Cross-Lingual NER , 2019, EMNLP.

[2] Qun Liu,et al. Accurate Word Alignment Induction from Neural Machine Translation , 2020, EMNLP.

[3] Shrikanth S. Narayanan,et al. A Multi-task Approach to Learning Multilingual Representations , 2018, ACL.

[4] David Yarowsky,et al. Inducing Multilingual Text Analysis Tools via Robust Projection across Aligned Corpora , 2001, HLT.

[5] Barbara Plank,et al. Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging , 2018, EMNLP.

[6] Rico Sennrich,et al. Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[7] Stefano Soatto,et al. Structured Prediction as Translation between Augmented Natural Languages , 2021, ICLR.

[8] Orhan Firat,et al. Zero-Shot Cross-lingual Classification Using Multilingual Neural Machine Translation , 2018, ArXiv.

[9] Xing Fan,et al. Transfer Learning for Neural Semantic Parsing , 2017, Rep4NLP@ACL.

[10] John DeNero,et al. End-to-End Neural Word Alignment Outperforms GIZA++ , 2020, ACL.

[11] Ayah Zirikly,et al. Cross-lingual Transfer of Named Entity Recognizers without Parallel Corpora , 2015, ACL.

[12] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[13] Jun Zhao,et al. AdaNSP: Uncertainty-driven Adaptive Decoding in Neural Semantic Parsing , 2019, ACL.

[14] Sonal Gupta,et al. Semantic Parsing for Task Oriented Dialog using Hierarchical Representations , 2018, EMNLP.

[15] Antoine Raux,et al. The Dialog State Tracking Challenge Series: A Review , 2016, Dialogue Discourse.

[16] Prabhu Kaliamoorthi,et al. Distilling Large Language Models into Tiny and Effective Students using pQRNN , 2021, ArXiv.

[17] Donghong Ji,et al. Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus , 2020, ACL.

[18] Danqi Chen,et al. of the Association for Computational Linguistics: , 2001 .

[19] Hermann Ney,et al. HMM-Based Word Alignment in Statistical Translation , 1996, COLING.

[20] Jaime G. Carbonell,et al. Neural Cross-Lingual Named Entity Recognition with Minimal Resources , 2018, EMNLP.

[21] Emilio Monti,et al. Don’t Parse, Generate! A Sequence to Sequence Architecture for Task-Oriented Semantic Parsing , 2020, WWW.

[22] David Yarowsky,et al. Cross-lingual Dependency Parsing Based on Distributed Representations , 2015, ACL.

[23] Anette Frank,et al. X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset , 2020, EMNLP.

[24] Zhang Yue,et al. Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank , 2019, Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP).

[25] Myle Ott,et al. Unsupervised Cross-lingual Representation Learning at Scale , 2019, ACL.

[26] Colin Raffel,et al. mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer , 2020, NAACL.

[27] Noah A. Smith,et al. A Simple, Fast, and Effective Reparameterization of IBM Model 2 , 2013, NAACL.

[28] Haoran Li,et al. Multilingual Seq2seq Training with Similarity Loss for Cross-Lingual Document Classification , 2018, Rep4NLP@ACL.

[29] Graham Neubig,et al. Word Alignment by Fine-tuning Embeddings on Parallel Corpora , 2021, EACL.

[30] Masoud Jalili Sabet,et al. SimAlign: High Quality Word Alignments without Parallel Training Data using Static and Contextualized Embeddings , 2020, FINDINGS.

[31] Goran Glavaš,et al. From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers , 2020, EMNLP.

[32] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[33] P. J. Price,et al. Evaluation of Spoken Language Systems: the ATIS Domain , 1990, HLT.

[34] Chih-Li Huo,et al. Slot-Gated Modeling for Joint Slot Filling and Intent Prediction , 2018, NAACL.

[35] Alexander M. Rush,et al. Learning Neural Templates for Text Generation , 2018, EMNLP.

[36] Nanyun Peng,et al. Cross-Lingual Dependency Parsing with Unlabeled Auxiliary Languages , 2019, CoNLL.