论文信息 - Korean Morphological Analysis with Tied Sequence-to-Sequence Multi-Task Model

Korean Morphological Analysis with Tied Sequence-to-Sequence Multi-Task Model

Korean morphological analysis has been considered as a sequence of morpheme processing and POS tagging. Thus, a pipeline model of the tasks has been adopted widely by previous studies. However, the model has a problem that it cannot utilize interactions among the tasks. This paper formulates Korean morphological analysis as a combination of the tasks and presents a tied sequence-to-sequence multi-task model for training the two tasks simultaneously without any explicit regularization. The experiments prove the proposed model achieves the state-of-the-art performance.

Seong-Bae Park | Hyun-Je Song | Seong-Bae Park | Hyun-Je Song

[1] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Jihun Choi,et al. A grapheme-level approach for constructing a Korean morphological analyzer without linguistic knowledge , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[3] Dianhai Yu,et al. Multi-Task Learning for Multiple Language Translation , 2015, ACL.

[4] Hae-Chang Rim,et al. Probabilistic Modeling of Korean Morphology , 2009, IEEE Transactions on Audio, Speech, and Language Processing.

[5] Christopher D. Manning. Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics? , 2011, CICLing.

[6] Andrew McCallum,et al. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7] Josef van Genabith,et al. Neural Morphological Tagging from Characters for Morphologically Rich Languages , 2016, ArXiv.

[8] Young-Bum Kim,et al. Rich Character-Level Information for Korean Morphological Analysis and Part-of-Speech Tagging , 2018, COLING.

[9] Quoc V. Le,et al. Multi-task Sequence to Sequence Learning , 2015, ICLR.

[10] Seong-Bae Park,et al. Korean Part-of-speech Tagging Based on Morpheme Generation , 2020, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[11] David Chiang,et al. Tied Multitask Learning for Neural Speech Translation , 2018, NAACL.

[12] Christopher D. Manning,et al. Get To The Point: Summarization with Pointer-Generator Networks , 2017, ACL.

[13] Xuanjing Huang,et al. A Feature-Enriched Neural Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging , 2016, IJCAI.

[14] Jörg Tiedemann,et al. Character-based Joint Segmentation and POS Tagging for Chinese using Bidirectional RNN-CRF , 2017, IJCNLP.

[15] Stephen Clark,et al. Joint Word Segmentation and POS Tagging Using a Single Perceptron , 2008, ACL.

[16] Wei Xu,et al. Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[17] Seung-Hoon Na,et al. Conditional Random Fields for Korean Morpheme Segmentation and POS Tagging , 2015, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[18] Dan Klein,et al. Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[19] Jae Jung Song,et al. THE KOREAN LANGUAGE: Structure, use and context , 2005 .

[20] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[21] Guillaume Lample,et al. Neural Architectures for Named Entity Recognition , 2016, NAACL.