论文信息 - On Multilingual Training of Neural Dependency Parsers

On Multilingual Training of Neural Dependency Parsers

We show that a recently proposed neural dependency parser can be improved by joint training on multiple languages from the same family. The parser is implemented as a deep neural network whose only input is orthographic representations of words. In order to successfully parse, the network has to discover how linguistically relevant concepts can be inferred from word spellings. We analyze the representations of characters and words that are learned by the network to establish which properties of languages were accounted for. In particular we show that the parser has approximately learned to associate Latin characters with their Cyrillic counterparts and that it can group Polish and Russian words that have a similar grammatical function. Finally, we evaluate the parser on selected languages from the Universal Dependencies dataset and show that it is competitive with other recently proposed state-of-the art methods, while having a simple structure.

[1] Noah A. Smith,et al. Many Languages, One Parser , 2016, TACL.

[2] Rich Caruana,et al. Multitask Learning , 1997, Machine-mediated learning.

[3] Noah A. Smith,et al. Transition-Based Dependency Parsing with Stack Long Short-Term Memory , 2015, ACL.

[4] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[5] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[6] Lukás Burget,et al. Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[7] Razvan Pascanu,et al. Theano: A CPU and GPU Math Compiler in Python , 2010, SciPy.

[8] Yoshua Bengio,et al. Maxout Networks , 2013, ICML.

[9] Jan Chorowski,et al. Read, Tag, and Parse All at Once, or Fully-neural Dependency Parsing , 2016, ArXiv.

[10] Slav Petrov,et al. Globally Normalized Transition-Based Neural Networks , 2016, ACL.

[11] Joakim Nivre,et al. Algorithms for Deterministic Incremental Dependency Parsing , 2008, CL.

[12] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[13] Yoshua Bengio,et al. End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results , 2014, ArXiv.

[14] Yoshua Bengio,et al. Blocks and Fuel: Frameworks for deep learning , 2015, ArXiv.

[15] Ivan Titov,et al. A Latent Variable Model for Generative Dependency Parsing , 2007, Trends in Parsing Technology.

[16] Emily M. Bender. Linguistic I Ssues in L Anguage Technology Lilt on Achieving and Evaluating Language-independence in Nlp on Achieving and Evaluating Language-independence in Nlp , 2022 .

[17] Ji Ma,et al. SyntaxNet Models for the CoNLL 2017 Shared Task , 2017, ArXiv.

[18] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.