论文信息 - Low Resource Sequence Tagging using Sentence Reconstruction

Low Resource Sequence Tagging using Sentence Reconstruction

This work revisits the task of training sequence tagging models with limited resources using transfer learning. We investigate several proposed approaches introduced in recent works and suggest a new loss that relies on sentence reconstruction from normalized embeddings. Specifically, our method demonstrates how by adding a decoding layer for sentence reconstruction, we can improve the performance of various baselines. We show improved results on the CoNLL02 NER and UD 1.2 POS datasets and demonstrate the power of the method for transfer learning with low-resources achieving 0.6 F1 score in Dutch using only one sample from it.

[1] Heike Adel,et al. Adversarial Neural Networks for Cross-lingual Sequence Tagging , 2018, ArXiv.

[2] Zaiqing Nie,et al. Joint Entity Recognition and Disambiguation , 2015, EMNLP.

[3] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.

[4] Trevor Darrell,et al. Adversarial Discriminative Domain Adaptation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Guillaume Lample,et al. Word Translation Without Parallel Data , 2017, ICLR.

[6] Jan Hajic,et al. Neural Architectures for Nested NER through Linearization , 2019, ACL.

[7] Jason Weston,et al. Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[8] Yiming Yang,et al. DARTS: Differentiable Architecture Search , 2018, ICLR.

[9] Barbara Plank,et al. Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss , 2016, ACL.

[10] Wei Xu,et al. Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[11] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[12] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[13] Xuanjing Huang,et al. How to Fine-Tune BERT for Text Classification? , 2019, CCL.

[14] Quoc V. Le,et al. Semi-Supervised Sequence Modeling with Cross-View Training , 2018, EMNLP.

[15] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[16] Heng Ji,et al. A Multi-lingual Multi-task Architecture for Low-resource Sequence Labeling , 2018, ACL.