论文信息 - Exploring Cross-Lingual Transfer of Morphological Knowledge In Sequence-to-Sequence Models - 字舞流文

Exploring Cross-Lingual Transfer of Morphological Knowledge In Sequence-to-Sequence Models

Multi-task training is an effective method to mitigate the data sparsity problem. It has recently been applied for cross-lingual transfer learning for paradigm completion—the task of producing inflected forms of lemmata—with sequence-to-sequence networks. However, it is still vague how the model transfers knowledge across languages, as well as if and which information is shared. To investigate this, we propose a set of data-dependent experiments using an existing encoder-decoder recurrent neural network for the task. Our results show that indeed the performance gains surpass a pure regularization effect and that knowledge about language and morphology can be transferred.

Katharina Kann | Huiming Jin | Katharina Kann | Huiming Jin

[1] Ryan Cotterell,et al. The SIGMORPHON 2016 Shared Task—Morphological Reinflection , 2016, SIGMORPHON.

[2] Ryan Cotterell,et al. One-Shot Neural Cross-Lingual Transfer for Paradigm Completion , 2017, ACL.

[3] John DeNero,et al. Supervised Learning of Complete Morphological Paradigms , 2013, NAACL.

[4] Grzegorz Kondrak,et al. Inflection Generation as Discriminative String Transduction , 2015, HLT-NAACL.

[5] Ryan Cotterell,et al. CoNLL-SIGMORPHON 2017 Shared Task: Universal Morphological Reinflection in 52 Languages , 2017, CoNLL.

[6] Barbara Plank,et al. Multitask learning for semantic sequence prediction under varying data conditions , 2016, ArXiv.

[7] Yoav Goldberg,et al. Sequence to Sequence Transduction with Hard Monotonic Attention , 2016, ArXiv.

[8] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[9] P. Lewis. Ethnologue : languages of the world , 2009 .

[10] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.

[11] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[12] Graham Neubig,et al. Multi-space Variational Encoder-Decoders for Semi-supervised Labeled Sequence Transduction , 2017, ACL.

[13] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[14] Katharina Kann,et al. Single-Model Encoder-Decoder with Explicit Morphological Representation for Reinflection , 2016, ACL.

[15] Yulia Tsvetkov,et al. Morphological Inflection Generation Using Character Sequence to Sequence Learning , 2015, NAACL.

[16] Yoshua Bengio,et al. Multi-Way, Multilingual Neural Machine Translation with a Shared Attention Mechanism , 2016, NAACL.

[17] Dianhai Yu,et al. Multi-Task Learning for Multiple Language Translation , 2015, ACL.

[18] Jan Niehues,et al. Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder , 2016, IWSLT.

[19] Martin Wattenberg,et al. Google’s Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation , 2016, TACL.

[20] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.