论文信息 - Deep Learning for Efficient Discriminative Parsing

Deep Learning for Efficient Discriminative Parsing

We propose a new fast purely discriminative algorithm for natural language parsing, based on a “deep” recurrent convolutional graph transformer network (GTN). Assuming a decomposition of a parse tree into a stack of “levels”, the network predicts a level of the tree taking into account predictions of previous levels. Using only few basic text features which leverage word representations from Collobert and Weston (2008), we show similar performance (in F1 score) to existing pure discriminative parsers and existing “benchmark” parsers (like Collins parser, probabilistic context-free grammars based), with a huge speed advantage.

Ronan Collobert | Ronan Collobert

[1] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[2] Yann LeCun,et al. Une procedure d'apprentissage pour reseau a seuil asymmetrique (A learning scheme for asymmetric threshold networks) , 1985 .

[3] D. Rumelhart. Learning internal representations by back-propagating errors , 1986 .

[4] Geoffrey E. Hinton,et al. Learning sets of filters using back-propagation , 1987 .

[5] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[6] L. Bottou. Stochastic Gradient Learning in Neural Networks , 1991 .

[7] Beatrice Santorini,et al. Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[8] David M. Magerman. Statistical Decision-Tree Models for Parsing , 1995, ACL.

[9] Michael Collins,et al. A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[10] Steven P. Abney. Partial parsing via finite-state cascades , 1996, Natural Language Engineering.

[11] Yoshua Bengio,et al. Global training of document processing systems using graph transformer networks , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.