A* CCG Parsing with a Supertag-factored Model

We introduce a new CCG parsing model which is factored on lexical category assignments. Parsing is then simply a deterministic search for the most probable category sequence that supports a CCG derivation. The parser is extremely simple, with a tiny feature set, no POS tagger, and no statistical model of the derivation or dependencies. Formulating the model in this way allows a highly effective heuristic for A parsing, which makes parsing extremely fast. Compared to the standard C&C CCG parser, our model is more accurate out-of-domain, is four times faster, has higher coverage, and is greatly simplified. We also show that using our parser improves the performance of a state-ofthe-art question answering system.

[1]  Mark Steedman,et al.  Unsupervised Induction of Cross-Lingual Semantic Relations , 2013, EMNLP.

[2]  James R. Curran,et al.  Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models , 2007, Computational Linguistics.

[3]  Adam Lopez,et al.  Efficient CCG Parsing: A* versus Adaptive Supertagging , 2011, ACL.

[4]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[5]  Stephen Clark,et al.  Shift-Reduce CCG Parsing with a Dependency Model , 2014, ACL.

[6]  Mark Steedman,et al.  Semi-supervised CCG Lexicon Extension , 2011, EMNLP.

[7]  Jun'ichi Tsujii,et al.  Extremely Lexicalized Models for Accurate and Fast HPSG Parsing , 2006, EMNLP.

[8]  Yonatan Bisk,et al.  Normal-form parsing for Combinatory Categorial Grammars with generalized composition and type-raising , 2010, COLING.

[9]  James R. Curran,et al.  Faster Parsing by Supertagger Adaptation , 2010, ACL.

[10]  Mark Steedman,et al.  Simple Semi-Supervised Learning for Prepositional Phrase Attachment , 2011, IWPT.

[11]  Mark Steedman,et al.  Improved CCG Parsing with Semi-supervised Supertagging , 2014, TACL.

[12]  Stephen Clark,et al.  Shift-Reduce CCG Parsing , 2011, ACL.

[13]  Mark Steedman,et al.  Large-scale Semantic Parsing without Question-Answer Pairs , 2014, TACL.

[14]  Gerald Penn,et al.  Accurate Context-Free Parsing with Combinatory Categorial Grammar , 2010, ACL.

[15]  Yoshua Bengio,et al.  Word Representations: A Simple and General Method for Semi-Supervised Learning , 2010, ACL.

[16]  Martin Kay,et al.  Syntactic Process , 1979, ACL.

[17]  Stephen Clark,et al.  Adapting a Lexicalized-Grammar Parser to Contrasting Domains , 2008, EMNLP.

[18]  Dan Klein,et al.  A* Parsing: Fast Exact Viterbi Parse Selection , 2003, NAACL.

[19]  James R. Curran,et al.  Fully Lexicalising CCGbank with Hat Categories , 2009, EMNLP.

[20]  Jason Baldridge,et al.  Multi-Modal Combinatory Categorial Grammar , 2003, EACL.

[21]  Ronan Collobert,et al.  Deep Learning for Efficient Discriminative Parsing , 2011, AISTATS.

[22]  Tom M. Mitchell,et al.  Joint Syntactic and Semantic Parsing with Combinatory Categorial Grammar , 2014, ACL.

[23]  Mark Steedman,et al.  CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank , 2007, CL.

[24]  Joakim Nivre,et al.  An Efficient Algorithm for Projective Dependency Parsing , 2003, IWPT.

[25]  Srinivas Bangalore,et al.  Supertagging: An Approach to Almost Parsing , 1999, CL.

[26]  Mark Steedman,et al.  Generalizing a Strongly Lexicalized Parser using Unlabeled Data , 2014, EACL.

[27]  Jason Eisner Efficient Normal-Form Parsing for Combinatory Categorial Grammar , 1996, ACL.

[28]  Jari Björne,et al.  BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[29]  Dan Klein,et al.  Improved Inference for Unlexicalized Parsing , 2007, NAACL.

[30]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[31]  Mark Steedman,et al.  Combined Distributional and Logical Semantics , 2013, TACL.

[32]  Yoav Goldberg,et al.  An Efficient Algorithm for Easy-First Non-Directional Dependency Parsing , 2010, NAACL.

[33]  Adam Lopez,et al.  A Comparison of Loopy Belief Propagation and Dual Decomposition for Integrated CCG Supertagging and Parsing , 2011, ACL.

[34]  Johan Bos,et al.  Wide-Coverage Semantic Analysis with Boxer , 2008, STEP.

[35]  Julia Hockenmaier,et al.  Data and models for statistical parsing with combinatory categorial grammar , 2003 .

[36]  Philipp Koehn,et al.  Abstract Meaning Representation for Sembanking , 2013, LAW@ACL.

[37]  Ari Rappoport,et al.  Universal Conceptual Cognitive Annotation (UCCA) , 2013, ACL.

[38]  E. Hovy,et al.  A Fast , Effective , Non-Projective , Semantically-Enriched Parser , 2011 .

[39]  Mark Steedman,et al.  Taking Scope - The Natural Semantics of Quantifiers , 2011 .

[40]  Andrew Chou,et al.  Semantic Parsing on Freebase from Question-Answer Pairs , 2013, EMNLP.

[41]  Brian Harrington A Semantic Network Approach to Measuring Relatedness , 2010, COLING.