Deep Learning for Natural Language Parsing

Natural language processing problems (such as speech recognition, text-based data mining, and text or speech generation) are becoming increasingly important. Before effectively approaching many of these problems, it is necessary to process the syntactic structures of the sentences. Syntactic parsing is the task of constructing a syntactic parse tree over a sentence which describes the structure of the sentence. Parse trees are used as part of many language processing applications. In this paper, we present a multi-lingual dependency parser. Using advanced deep learning techniques, our parser architecture tackles common issues with parsing such as long-distance head attachment, while using ‘architecture engineering’ to adapt to each target language in order to reduce the feature engineering often required for parsing tasks. We implement a parser based on this architecture to utilize transfer learning techniques to address important issues related with limited-resourced language. We exceed the accuracy of state-of-the-art parsers on languages with limited training resources by a considerable margin. We present promising results for solving core problems in natural language parsing, while also performing at state-of-the-art accuracy on general parsing tasks.

[1]  Khaled Shaalan,et al.  Arabic Natural Language Processing: Challenges and Solutions , 2009, TALIP.

[2]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[3]  Noah A. Smith,et al.  Transition-Based Dependency Parsing with Stack Long Short-Term Memory , 2015, ACL.

[4]  Yuji Matsumoto,et al.  Universal Dependencies 2.1 , 2017 .

[5]  Carlos Gómez-Rodríguez,et al.  An Efficient Dynamic Oracle for Unrestricted Non-Projective Parsing , 2015, ACL.

[6]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[7]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[8]  Noah A. Smith,et al.  Improved Transition-based Parsing by Modeling Characters instead of Words with LSTMs , 2015, EMNLP.

[9]  Qiang Zhou,et al.  A SVM-Based Model for Chinese Functional Chunk Parsing , 2006, SIGHAN@COLING/ACL.

[10]  Karin M. Verspoor,et al.  An Improved Neural Network Model for Joint POS Tagging and Dependency Parsing , 2018, CoNLL.

[11]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[12]  Joakim Nivre,et al.  A Dynamic Oracle for Arc-Eager Dependency Parsing , 2012, COLING.

[13]  Baobao Chang,et al.  An Effective Neural Network Model for Graph-based Dependency Parsing , 2015, ACL.

[14]  Joakim Nivre,et al.  Algorithms for Deterministic Incremental Dependency Parsing , 2008, CL.

[15]  Hai Zhao,et al.  Joint Learning of POS and Dependencies for Multilingual Universal Dependency Parsing , 2018, CoNLL.

[16]  Stefan Daniel Dumitrescu,et al.  NLP-Cube: End-to-End Raw Text Processing With Neural Networks , 2018, CoNLL.

[17]  Baobao Chang,et al.  Graph-based Dependency Parsing with Bidirectional LSTM , 2016, ACL.

[18]  Giuseppe Attardi,et al.  Dependency Parsing Domain Adaptation using Transductive SVM , 2012, EACL 2012.

[19]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[20]  Prakhar Gupta,et al.  Learning Word Vectors for 157 Languages , 2018, LREC.

[21]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[22]  Yuji Matsumoto,et al.  Statistical Dependency Analysis with Support Vector Machines , 2003, IWPT.

[23]  Dat Quoc Nguyen,et al.  A Novel Neural Network Model for Joint POS Tagging and Graph-based Dependency Parsing , 2017, CoNLL.

[24]  Allan Ramsay,et al.  The Application of Constraint Rules to Data-driven Parsing , 2015, RANLP.

[25]  Nizar Habash,et al.  CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies , 2017, CoNLL.

[26]  Eliyahu Kiperwasser,et al.  Simple and Accurate Dependency Parsing Using Bidirectional LSTM Feature Representations , 2016, TACL.

[27]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[28]  Allan Ramsay,et al.  The Selection a Classifier for Data-driven Parsing , 2015 .

[29]  Danqi Chen,et al.  A Fast and Accurate Dependency Parser using Neural Networks , 2014, EMNLP.

[30]  Joakim Nivre,et al.  Deterministic Dependency Parsing of English Text , 2004, COLING.

[31]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[32]  Michael A. Covington,et al.  A Fundamental Algorithm for Dependency Parsing , 2004 .

[33]  James H. Martin,et al.  Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition , 2000 .

[34]  Hrant Khachatrian,et al.  Towards JointUD: Part-of-speech Tagging and Lemmatization using Recurrent Neural Networks , 2018, CoNLL Shared Task.