The Java Version of the C & C Parser Version 0 . 95

Acknowledgements The first Java version of the C&C parser was created by Stephen Clark, based heavily on the C++ implementation of C&C. This first attempt had a working version of the DP-based chart parser, but a buggy version of the beam-search chart parser. Darren Foong, for his 2014/15 Part II project in the Computer Laboratory [15], did an outstanding job in taking this buggy version and producing a working beam-search parser, as well as considerably cleaning up the Java code. Luana Bulat spent considerable time in producing a new markedup file covering all the lexical categories in CCGbank, one of the features of the new parser. Wenduan Xu provided invaluable assistance along the way. Thanks to Laura Rimell, a long-suffering user of C&C and collaborator of the first author, for acting as a sounding board while the Java version was being built. Thanks to Yue Zhang, whose original shift-reduce beam search parser was the precursor to the chart-based beam search parser presented here; and Ryan McDonald, whose Cambridge NLIP seminar in 2014 provided the inspiration for appying beam search to the chart. Thanks to Julia Hockenmaier for helping Luana create the new markedup file, and of course for creating CCGbank, without which none of this research would exist. Thanks to Mark Steedman, whose shoulders we have been standing on for some time, and finally James Curran, whose original design for C&C is heavily reflected in this new version, and from whom the first author learnt most of what he knows about practical computing.

[1]  Martin Kay,et al.  Syntactic Process , 1979, ACL.

[2]  M. F.,et al.  Bibliography , 1985, Experimental Gerontology.

[3]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[4]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[5]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[6]  Mark Johnson,et al.  Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 2002, ACL.

[7]  Mark Steedman,et al.  Generative Models for Statistical Parsing with Combinatory Categorial Grammar , 2002, ACL.

[8]  Brian Roark,et al.  Incremental Parsing with the Perceptron Algorithm , 2004, ACL.

[9]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[10]  Ben Taskar,et al.  An End-to-End Discriminative Approach to Machine Translation , 2006, ACL.

[11]  Ted Briscoe,et al.  An introduction to tag sequence grammars and the RASP system parser , 2006 .

[12]  James R. Curran,et al.  Multi-Tagging for Lexicalized-Grammar Parsing , 2006, ACL.

[13]  Ted Briscoe,et al.  Evaluating the Accuracy of an Unlexicalized Statistical Parser on the PARC DepBank , 2006, ACL.

[14]  James R. Curran,et al.  Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models , 2007, Computational Linguistics.

[15]  Mark Steedman,et al.  CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank , 2007, CL.

[16]  Johan Bos,et al.  Linguistically Motivated Large-Scale NLP with C&C and Boxer , 2007, ACL.

[17]  James R. Curran,et al.  Perceptron Training for a Wide-Coverage Lexicalized-Grammar Parser , 2007, ACL 2007.

[18]  Johan Bos,et al.  Wide-Coverage Semantic Analysis with Boxer , 2008, STEP.

[19]  Stephen Clark,et al.  Adapting a Lexicalized-Grammar Parser to Contrasting Domains , 2008, EMNLP.

[20]  Stephen Clark,et al.  A Tale of Two Parsers: Investigating and Combining Graph-based and Transition-based Dependency Parsing , 2008, EMNLP.

[21]  Stephen Clark,et al.  Shift-Reduce CCG Parsing , 2011, ACL.

[22]  Hao Zhang,et al.  Generalized Higher-Order Dependency Parsing with Cube Pruning , 2012, EMNLP.

[23]  Yang Guo,et al.  Structured Perceptron with Inexact Search , 2012, NAACL.

[24]  Michael Auli,et al.  Integrated Supertagging and Parsing , 2012 .

[25]  Hao Zhang,et al.  Online Learning for Inexact Hypergraph Search , 2013, EMNLP.

[26]  Brendan T. O'Connor,et al.  A Framework for (Under)specifying Dependency Syntax without Overloading Annotators , 2013, LAW@ACL.

[27]  Stephen Clark,et al.  Evaluating a Wide-Coverage CCG Parser , 2013 .

[28]  Stephen Clark,et al.  Shift-Reduce CCG Parsing with a Dependency Model , 2014, ACL.

[29]  Stephen Clark,et al.  CCG Supertagging with a Recurrent Neural Network , 2015, ACL.