论文信息 - Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation - 字舞流文

Overcoming Catastrophic Forgetting in Zero-Shot Cross-Lingual Generation

In this paper, we explore the challenging problem of performing a generative task (i.e., summarization) in a target language when labeled data is only available in English. We as-sume a strict setting with no access to parallel data or machine translation. Prior work has shown, and we conﬁrm, that standard transfer learning techniques struggle in this setting, as a generative multilingual model ﬁne-tuned purely on English catastrophically forgets how to generate non-English. Given the recent rise of parameter-efﬁcient adaptation techniques (e.g., prompt tuning), we conduct the ﬁrst investigation into how well these methods can overcome catastrophic forgetting to enable zero-shot cross-lingual generation. We ﬁnd that parameter-efﬁcient adaptation provides gains over standard ﬁne-tuning when transferring between less-related languages, e.g., from English to Thai. However, a signif-icant gap still remains between these methods and fully-supervised baselines. To improve cross-lingual transfer further, we explore three approaches: (1) mixing in unlabeled multilingual data, (2) pre-training prompts on target language data, and (3) explicitly factoring prompts into recombinable language and task components. Our methods can provide further quality gains, suggesting that robust zero-shot cross-lingual generation is within reach.

Noah Constant | Daniel Matthew Cer | Mohit Iyyer | Tu Vu | Aditya Barua | Brian Lester

[1] Weizhu Chen,et al. Input-Tuning: Adapting Unfamiliar Inputs to Frozen Pretrained Models , 2022, ArXiv.

[2] M. Lewis,et al. MetaICL: Learning to Learn In Context , 2021, NAACL.

[3] Alexander M. Rush,et al. Multitask Prompted Training Enables Zero-Shot Task Generalization , 2021, ICLR.

[4] Brian Lester,et al. SPoT: Better Frozen Model Adaptation through Soft Prompt Transfer , 2021, ACL.

[5] Quoc V. Le,et al. Finetuned Language Models Are Zero-Shot Learners , 2021, ICLR.

[6] Marc'Aurelio Ranzato,et al. The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation , 2021, TACL.

[7] Hinrich Schutze,et al. Discrete and Soft Prompting for Multilingual Models , 2021, EMNLP.

[8] Sebastian Ruder,et al. Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks , 2021, ACL.

[9] Maunendra Sankar Desarkar,et al. ZmBART: An Unsupervised Cross-lingual Transfer Framework for Language Generation , 2021, FINDINGS.

[10] Timothy Baldwin,et al. Evaluating the Efficacy of Summarization Evaluation across Languages , 2021, FINDINGS.

[11] Brian Lester,et al. The Power of Scale for Parameter-Efficient Prompt Tuning , 2021, EMNLP.

[12] Jinlan Fu,et al. XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation , 2021, EMNLP.

[13] Diyi Yang,et al. The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics , 2021, GEM.

[14] Colin Raffel,et al. mT5: A Massively Multilingual Pre-trained Text-to-Text Transformer , 2020, NAACL.

[15] Towards Zero-Shot Multilingual Synthetic Question and Answer Generation for Cross-Lingual Reading Comprehension , 2020, INLG.

[16] Percy Liang,et al. Prefix-Tuning: Optimizing Continuous Prompts for Generation , 2021, ACL.

[17] Claire Cardie,et al. WikiLingua: A New Benchmark Dataset for Multilingual Abstractive Summarization , 2020, FINDINGS.

[18] Samuel R. Bowman,et al. English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too , 2020, AACL.

[19] Subhransu Maji,et al. Exploring and Predicting Transferability across NLP Tasks , 2020, EMNLP.

[20] Iryna Gurevych,et al. MAD-X: An Adapter-based Framework for Multi-task Cross-lingual Transfer , 2020, EMNLP.

[21] Doug Downey,et al. Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks , 2020, ACL.

[22] Thibault Sellam,et al. BLEURT: Learning Robust Metrics for Text Generation , 2020, ACL.

[23] Orhan Firat,et al. XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization , 2020, ICML.

[24] Eunsol Choi,et al. TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages , 2020, Transactions of the Association for Computational Linguistics.

[25] Myle Ott,et al. Unsupervised Cross-lingual Representation Learning at Scale , 2019, ACL.

[26] Mikel Artetxe,et al. On the Cross-lingual Transferability of Monolingual Representations , 2019, ACL.

[27] Colin Raffel,et al. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer , 2019, J. Mach. Learn. Res..

[28] Holger Schwenk,et al. MLQA: Evaluating Cross-lingual Extractive Question Answering , 2019, ACL.

[29] Li Dong,et al. Cross-Lingual Natural Language Generation via Pre-Training , 2019, AAAI.

[30] Jason Baldridge,et al. PAWS-X: A Cross-lingual Adversarial Dataset for Paraphrase Identification , 2019, EMNLP.

[31] Mona Attariyan,et al. Parameter-Efficient Transfer Learning for NLP , 2019, ICML.

[32] Guillaume Lample,et al. Cross-lingual Language Model Pretraining , 2019, NeurIPS.

[33] Samuel R. Bowman,et al. Sentence Encoders on STILTs: Supplementary Training on Intermediate Labeled-data Tasks , 2018, ArXiv.

[34] Guillaume Lample,et al. XNLI: Evaluating Cross-lingual Sentence Representations , 2018, EMNLP.

[35] Taku Kudo,et al. SentencePiece: A simple and language independent subword tokenizer and detokenizer for Neural Text Processing , 2018, EMNLP.

[36] Sebastian Ruder,et al. Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[37] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[38] Heng Ji,et al. Cross-lingual Name Tagging and Linking for 282 Languages , 2017, ACL.

[39] Junyi Jessy Li,et al. Assessing the Discourse Factors that Influence the Quality of Machine Translation , 2014, ACL.

[40] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[41] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[42] Anthony V. Robins,et al. Catastrophic Forgetting, Rehearsal and Pseudorehearsal , 1995, Connect. Sci..