论文信息 - Efficient Transfer Learning for Quality Estimation with Bottleneck Adapter Layer

Efficient Transfer Learning for Quality Estimation with Bottleneck Adapter Layer

The Predictor-Estimator framework for quality estimation (QE) is commonly used for its strong performance. Where the predictor and estimator works on feature extraction and quality evaluation, respectively. However, training the predictor from scratch is computationally expensive. In this paper, we propose an efficient transfer learning framework to transfer knowledge from NMT dataset into QE models. A Predictor-Estimator alike model named BAL-QE is also proposed, aiming to extract high quality features with pre-trained NMT model, and make classification with a fine-tuned Bottleneck Adapter Layer (BAL). The experiment shows that BAL-QE achieves 97% of the SOTA performance in WMT19 En-De and En-Ru QE tasks by only training 3% of parameters within 4 hours on 4 Titan XP GPUs. Compared with the commonly used NuQE baseline, BAL-QE achieves 47% (En-Ru) and 75% (En-De) of performance promotions.

[1] Myle Ott,et al. Facebook FAIR’s WMT19 News Translation Task Submission , 2019, WMT.

[2] Jong-Hyeok Lee,et al. Recurrent Neural Network based Translation Quality Estimation , 2016, WMT.

[3] Jong-Hyeok Lee,et al. Predictor-Estimator using Multilevel Task Learning with Stack Propagation for Neural Quality Estimation , 2017, WMT.

[4] Holger Schwenk,et al. Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond , 2018, Transactions of the Association for Computational Linguistics.

[5] Stefan Riezler,et al. QUality Estimation from ScraTCH (QUETCH): Deep Learning for Word-level Translation Quality Estimation , 2015, WMT@EMNLP.

[6] Hao Yang,et al. Domain Specific NMT based on Knowledge Graph Embedding and Attention , 2019, 2019 21st International Conference on Advanced Communication Technology (ICACT).

[7] Marcin Junczys-Dowmunt,et al. Microsoft Translator at WMT 2019: Towards Large-Scale Document-Level Neural Machine Translation , 2019, WMT.

[8] Bo Li,et al. Alibaba Submission for WMT18 Quality Estimation Task , 2018, WMT.

[9] Jingbo Zhu,et al. NiuTrans Submission for CCMT19 Quality Estimation Task , 2019, CCMT.

[10] Zhiming Chen,et al. A Unified Neural Network for Quality Estimation of Machine Translation , 2018, IEICE Trans. Inf. Syst..

[11] Lucia Specia,et al. QuEst - A translation quality estimation framework , 2013, ACL.

[12] Shujian Huang,et al. CCMT 2019 Machine Translation Evaluation Report , 2019 .

[13] Sebastian Ruder,et al. Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[14] Andrea Vedaldi,et al. Learning multiple visual domains with residual adapters , 2017, NIPS.

[15] Mona Attariyan,et al. Parameter-Efficient Transfer Learning for NLP , 2019, ICML.

[16] Alec Radford,et al. Improving Language Understanding by Generative Pre-Training , 2018 .

[17] Ramón Fernández Astudillo,et al. Unbabel's Participation in the WMT16 Word-Level Translation Quality Estimation Shared Task , 2016, WMT.