Pushing the boundaries of deep parsing

I examine the application of deep parsing techniques to a range of Natural Language Processing tasks as well as methods to improve their performance. Focussing specifically on the English Resource Grammar, a hand-crafted grammar of English based on the Head-Driven Phrase Structure Grammar formalism, I examine some techniques for improving parsing accuracy in diverse domains and methods for evaluating these improvements. I also evaluate the utility of the in-depth linguistic analyses available from this grammar for some specific NLP applications such as biomedical information extraction, as well as investigating other applications of the semantic output available from this grammar.

[1]  Thorsten Brants,et al.  TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[2]  Yvan Saeys,et al.  Analyzing text in search of bio-molecular events: a high-precision machine learning framework , 2009, BioNLP@HLT-NAACL.

[3]  Jun'ichi Tsujii,et al.  A Markov Logic Approach to Bio-Molecular Event Extraction , 2009, BioNLP@HLT-NAACL.

[4]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[5]  J. Bresnan Lexical-Functional Syntax , 2000 .

[6]  Sophia Ananiadou,et al.  Developing a Robust Part-of-Speech Tagger for Biomedical Text , 2005, Panhellenic Conference on Informatics.

[7]  Erik Velldal,et al.  Empirical Realization Ranking , 2009 .

[8]  Peter Sells,et al.  Lectures on contemporary syntactic theories , 1985 .

[9]  Julian Kupiec An Algorithm for Estimating the Parameters of Unrestricted Hidden Stochastic Context-Free Grammars , 1992, COLING.

[10]  Alexander M. Rush,et al.  On Dual Decomposition and Linear Programming Relaxations for Natural Language Processing , 2010, EMNLP.

[11]  Jun'ichi Tsujii,et al.  Corpus annotation for mining biomedical events from literature , 2008, BMC Bioinformatics.

[12]  Jun'ichi Tsujii,et al.  Part-of-Speech Annotation of Biology Research Abstracts , 2004, LREC.

[13]  Timothy Baldwin,et al.  Biomedical Event Annotation with CRFs and Precision Grammars , 2009, BioNLP@HLT-NAACL.

[14]  Akinori Yonezawa,et al.  Overview of Genia Event Task in BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[15]  Georgiana Dinu,et al.  Inference Rules and their Application to Recognizing Textual Entailment , 2009, EACL.

[16]  Fernando Pereira,et al.  Identifying gene and protein mentions in text using conditional random fields , 2005, BMC Bioinformatics.

[17]  Jun'ichi Tsujii,et al.  Evaluating contributions of natural language parsers to protein–protein interaction extraction , 2008, Bioinform..

[18]  Aravind K. Joshi,et al.  Tree-Adjoining Grammars , 1997, Handbook of Formal Languages.

[19]  Daniel Gildea,et al.  Corpus Variation and Parser Performance , 2001, EMNLP.

[20]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[21]  Jari Björne,et al.  BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[22]  Yi Zhang,et al.  Discriminant Ranking for Efficient Treebanking , 2010, COLING.

[23]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[24]  Eugene Charniak,et al.  Automatic Domain Adaptation for Parsing , 2010, NAACL.

[25]  Ulrich Schäfer,et al.  The ACL Anthology Searchbench , 2011, ACL.

[26]  Eugene Charniak,et al.  Reranking and Self-Training for Parser Adaptation , 2006, ACL.

[27]  Khalil Sima'an,et al.  Accurate Unlexicalized Parsing for Modern Hebrew , 2007, TSD.

[28]  Martial Hebert,et al.  Semi-Supervised Self-Training of Object Detection Models , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[29]  Christopher D. Manning,et al.  LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[30]  Stephan Oepen,et al.  SEM-I rational MT : enriching deep grammars with a semantic interface for scalable machine translation. , 2005 .

[31]  Mark Aronoff,et al.  Contemporary linguistics: An introduction , 1989 .

[32]  Mark Steedman,et al.  Building Deep Dependency Structures using a Wide-Coverage CCG Parser , 2002, ACL.

[33]  Matthew Lease,et al.  Parsing Biomedical Literature , 2005, IJCNLP.

[34]  Stephan Oepen,et al.  Discriminant-Based MRS Banking , 2006, LREC.

[35]  Eugene Charniak,et al.  Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[36]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[37]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[38]  Alfonso Valencia,et al.  Overview of BioCreAtIvE: critical assessment of information extraction for biology , 2005, BMC Bioinformatics.

[39]  Diana McCarthy,et al.  Domain-Speci(cid:12)c Sense Distributions and Predominant Sense Acquisition , 2022 .

[40]  Thorsten Joachims,et al.  Cutting-plane training of structural SVMs , 2009, Machine Learning.

[41]  Mark Steedman,et al.  Acquiring Compact Lexicalized Grammars from a Cleaner Treebank , 2002, LREC.

[42]  Mark Steedman,et al.  The syntactic process , 2004, Language, speech, and communication.

[43]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[44]  Ted Briscoe,et al.  Lexical rules in constraint based grammars , 1999, CL.

[45]  Jari Björne,et al.  A Graph Kernel for Protein-Protein Interaction Extraction , 2008, BioNLP.

[46]  Wendy W. Chapman,et al.  A Simple Algorithm for Identifying Negated Findings and Diseases in Discharge Summaries , 2001, J. Biomed. Informatics.

[47]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[48]  Ivan A. Sag,et al.  Syntactic Theory: A Formal Introduction , 1999, Computational Linguistics.

[49]  Joakim Nivre,et al.  MaltParser: A Data-Driven Parser-Generator for Dependency Parsing , 2006, LREC.

[50]  Stuart M. Shieber,et al.  Evidence against the context-freeness of natural language , 1985 .

[51]  Steven Abney,et al.  Statistical Methods and Linguistics , 2002 .

[52]  Yi Zhang,et al.  Disambiguating Compound Nouns for a Dynamic HPSG Treebank of Wall Street Journal Texts , 2010, LREC.

[53]  Andy Way,et al.  Wide-Coverage Deep Statistical Parsing Using Automatic Dependency Structure Annotation , 2008, CL.

[54]  Thorsten Brants,et al.  The LinGO Redwoods Treebank: Motivation and Preliminary Applications , 2002, COLING.

[55]  Mark Johnson,et al.  A Simple Pattern-matching Algorithm for Recovering Empty Nodes and their Antecedents , 2002, ACL.

[56]  Paul Rayson,et al.  Comparing Corpora using Frequency Profiling , 2000, Proceedings of the workshop on Comparing corpora -.

[57]  Stephan Oepen,et al.  Resolving Speculation: MaxEnt Cue Classification and Dependency-Based Scope Rules , 2010, CoNLL Shared Task.

[58]  Gertjan van Noord,et al.  The Alpino Dependency Treebank , 2001, CLIN.

[59]  Nigel Collier,et al.  Introduction to the Bio-entity Recognition Task at JNLPBA , 2004, NLPBA/BioNLP.

[60]  Dan Flickinger,et al.  A New Well-Formedness Criterion for Semantics Debugging , 2005 .

[61]  Ido Dagan,et al.  PROBABILISTIC TEXTUAL ENTAILMENT: GENERIC APPLIED MODELING OF LANGUAGE VARIABILITY , 2004 .

[62]  Jari Björne,et al.  Extracting Complex Biological Events with Rich Graph-Based Feature Sets , 2009, BioNLP@HLT-NAACL.

[63]  Mary Dalrymple,et al.  The PARC 700 Dependency Bank , 2003, LINC@EACL.

[64]  Allen C. Browne,et al.  Lexical methods for managing variation in biomedical terminologies. , 1994, Proceedings. Symposium on Computer Applications in Medical Care.

[65]  Andrew MacKinlay,et al.  The effects of part-of-speech tagsets on tagger performance , 2005 .

[66]  Michael White Glue Rules for Robust Chart Realization , 2011, ENLG.

[67]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[68]  Dan Flickinger,et al.  On building a more effcient grammar by exploiting types , 2000, Natural Language Engineering.

[69]  Preslav Nakov,et al.  Search Engine Statistics Beyond the n-Gram: Application to Noun Compound Bracketing , 2005, CoNLL.

[70]  Berthold Crysmann,et al.  Relative Clause Extraposition in German: An Efficient and Portable Implementation , 2005 .

[71]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[72]  H. Somers,et al.  On the validity of the complement-adjunct distinction in valency grammar , 1984 .

[73]  Rayid Ghani,et al.  Analyzing the effectiveness and applicability of co-training , 2000, CIKM '00.

[74]  Zhiyong Lu,et al.  Overview of the BioCreative III Workshop , 2011, BMC Bioinformatics.

[75]  Alexander Clark Unsupervised induction of stochastic context-free grammars using distributional clustering , 2001, CoNLL.

[76]  Anette Frank Constraint-based RMRS Construction from Shallow Grammars , 2004, COLING.

[77]  Stefan Müller,et al.  HPSG Analysis of German , 2000 .

[78]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[79]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Prepositional Phrase Attachment , 1994, HLT.

[80]  Jun'ichi Tsujii,et al.  Probabilistic Disambiguation Models for Wide-Coverage HPSG Parsing , 2005, ACL.

[81]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[82]  János Csirik,et al.  The BioScope corpus: biomedical texts annotated for uncertainty, negation and their scopes , 2008, BMC Bioinformatics.

[83]  Regina Barzilay,et al.  Learning to Paraphrase: An Unsupervised Approach Using Multiple-Sequence Alignment , 2003, NAACL.

[84]  Halil Kilicoglu,et al.  Syntactic Dependency Based Heuristics for Biological Event Extraction , 2009, BioNLP@HLT-NAACL.

[85]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[86]  Ann Bies,et al.  Bracketing Guidelines For Treebank II Style Penn Treebank Project , 1995 .

[87]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[88]  Keh-Yih Su,et al.  An Automatic Treebank Conversion Algorithm for Corpus Sharing , 1994, ACL.

[89]  Patrick Pantel,et al.  DIRT @SBT@discovery of inference rules from text , 2001, KDD '01.

[90]  Hwee Tou Ng,et al.  Improved Statistical Machine Translation for Resource-Poor Languages Using Related Resource-Rich Languages , 2009, EMNLP.

[91]  Fei Xia,et al.  Multilingual Structural Projection across Interlinear Text , 2007, HLT-NAACL.

[92]  Srinivas Bangalore,et al.  Supertagging: An Approach to Almost Parsing , 1999, CL.

[93]  Deyu Zhou,et al.  Methodological Review: Extracting interactions between proteins from the literature , 2008 .

[94]  Eugene Charniak,et al.  Self-Training for Biomedical Parsing , 2008, ACL.

[95]  Jun'ichi Tsujii,et al.  Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data , 2005, HLT.

[96]  R. Huddleston English Grammar: An Outline , 1988 .

[97]  Geoffrey Leech,et al.  Corpus Annotation: Linguistic Information from Computer Text Corpora , 1997 .

[98]  Andreas Vlachos,et al.  Two Strong Baselines for the BioNLP 2009 Event Extraction Task , 2010, BioNLP@ACL.

[99]  Jun'ichi Tsujii,et al.  Syntax Annotation for the GENIA Corpus , 2005, IJCNLP.

[100]  Ulrich Callmeier,et al.  PET – a platform for experimentation with efficient HPSG processing techniques , 2000, Natural Language Engineering.

[101]  David Schlangen,et al.  The interpretation of non-sentential utterances in dialogue , 2003, SIGDIAL Workshop.

[102]  Daniel M. Bikel,et al.  Design of a multi-lingual, parallel-processing statistical parsing engine , 2002 .

[103]  Geoffrey K. Pullum,et al.  Generalized Phrase Structure Grammar , 1985 .

[104]  Kenji Sagae Self-Training without Reranking for Parser Domain Adaptation and Its Impact on Semantic Role Labeling , 2010 .

[105]  Mark Lauer,et al.  Designing Statistical Language Learners: Experiments on Noun Compounds , 1996, ArXiv.

[106]  Geoffrey K. Pullum,et al.  Natural languages and context-free languages , 1982 .

[107]  Hans-Ulrich Krieger,et al.  A Bag of Useful Techniques for Efficient and Robust Parsing , 1999, ACL.

[108]  Stephan Oepen,et al.  Efficiency in Unification-Based N-Best Parsing , 2007, Trends in Parsing Technology.

[109]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[110]  Mitchell P. Marcus,et al.  Text Chunking using Transformation-Based Learning , 1995, VLC@ACL.

[111]  Jun'ichi Tsujii,et al.  Adapting a Probabilistic Disambiguation Model of an HPSG Parser to a New Domain , 2005, IJCNLP.

[112]  Udo Hahn,et al.  Event Extraction from Trimmed Dependency Graphs , 2009, BioNLP@HLT-NAACL.

[113]  Dan Klein,et al.  A Generative Constituent-Context Model for Improved Grammar Induction , 2002, ACL.

[114]  Alexander S. Yeh,et al.  More accurate tests for the statistical significance of result differences , 2000, COLING.

[115]  Martha Palmer,et al.  Extracting Tree Adjoining Grammars from Bracketed Corpora , 2009 .

[116]  Timothy Baldwin,et al.  Unsupervised Parse Selection for HPSG , 2010, EMNLP.

[117]  Andrew McCallum,et al.  Robust Biomedical Event Extraction with Dual Decomposition and Minimal Domain Adaptation , 2011, BioNLP@ACL.

[118]  Su Jian,et al.  Exploring Deep Knowledge Resources in Biomedical Name Recognition , 2004, NLPBA/BioNLP.

[119]  Eric Nichols,et al.  The Hinoki Treebank A Treebank for Text Understanding , 2004, IJCNLP.

[120]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[121]  Virginia Teller Review of Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition by Daniel Jurafsky and James H. Martin. Prentice Hall 2000. , 2000 .

[122]  Sophia Ananiadou,et al.  How to make the most of NE dictionaries in statistical NER , 2008, BMC Bioinformatics.

[123]  Wolfgang Lezius,et al.  TIGER: Linguistic Interpretation of a German Corpus , 2004 .

[124]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[125]  Andrew B. Clegg,et al.  Evaluating and Integrating Treebank Parsers on a Biomedical Corpus , 2005, ACL 2005.

[126]  James R. Curran,et al.  Investigating GIS and Smoothing for Maximum Entropy Taggers , 2003, EACL.

[127]  Adam Kilgarriff,et al.  How Dominant Is the Commonest Sense of a Word? , 2004, TSD.

[128]  K. Bretonnel Cohen,et al.  A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools , 2012, BMC Bioinformatics.

[129]  Yusuke Miyao,et al.  AKANE System : Protein-Protein Interaction 1 AKANE System : Protein-Protein Interaction Pairs in the BioCreAtIvE 2 Challenge , PPI-IPS subtask , 2007 .

[130]  Stephan Oepen,et al.  LinGO Redwoods , 2004 .

[131]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[132]  増市 博,et al.  Japanese Parser on the basis of the Lexical-Functional Grammar Formalism and its Evaluation , 2003 .

[133]  Stephen Clark,et al.  Evaluating a Wide-Coverage CCG Parser , 2013 .

[134]  Miriam Butt,et al.  The Parallel Grammar Project , 2002, COLING 2002.

[135]  Brian Roark,et al.  Supervised and unsupervised PCFG adaptation to novel domains , 2003, NAACL.

[136]  Mark Johnson,et al.  Estimators for Stochastic “Unification-Based” Grammars , 1999, ACL.

[137]  Zheng-Yu Niu,et al.  Exploiting Heterogeneous Treebanks for Parsing , 2009, ACL/IJCNLP.

[138]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[139]  Hans-Ulrich Krieger,et al.  TDL-A Type Description Language for Constraint-Based Grammars , 1994, COLING.

[140]  Timothy Baldwin,et al.  A parser-based approach to detecting modification of biomedical events , 2011, DTMBIO '11.

[141]  Stephan Oepen,et al.  Parser Evaluation Using Elementary Dependency Matching , 2011, IWPT.

[142]  Rob Malouf,et al.  A Comparison of Algorithms for Maximum Entropy Parameter Estimation , 2002, CoNLL.

[143]  Timothy Baldwin,et al.  An Empirical Model of Multiword Expression Decomposability , 2003, ACL 2003.

[144]  Timothy Baldwin,et al.  Treeblazing: Using External Treebanks to Filter Parse Forests for Parse Selection and Treebanking , 2011, IJCNLP.

[145]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[146]  Sampo Pyysalo,et al.  Overview of BioNLP’09 Shared Task on Event Extraction , 2009, BioNLP@HLT-NAACL.

[147]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[148]  Martin Kay,et al.  Algorithm schemata and data structures in syntactic processing , 1986 .

[149]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[150]  Fei Xia,et al.  The Penn Chinese TreeBank: Phrase structure annotation of a large corpus , 2005, Natural Language Engineering.

[151]  James R. Curran,et al.  Parsing the WSJ Using CCG and Log-Linear Models , 2004, ACL.

[152]  Mark Steedman,et al.  Wide-Coverage Semantic Representations from a CCG Parser , 2004, COLING.

[153]  Emily M. Bender,et al.  Efficient Deep Processing of Japanese , 2002, ALR@COLING.

[154]  K. Bretonnel Cohen,et al.  The structural and content aspects of abstracts versus bodies of full text journal articles are different , 2010, BMC Bioinformatics.

[155]  Stephen Clark,et al.  Porting a lexicalized-grammar parser to the biomedical domain , 2009, J. Biomed. Informatics.

[156]  Robert Malouf,et al.  Wide Coverage Parsing with Stochastic Attribute Value Grammars , 2004 .

[157]  Halil Kilicoglu,et al.  Adapting a General Semantic Interpretation Approach to Biological Event Extraction , 2011, BioNLP@ACL.

[158]  Ted Briscoe,et al.  Biomedical Event Extraction without Training Data , 2009, BioNLP@HLT-NAACL.

[159]  Michael I. Jordan,et al.  On Discriminative vs. Generative Classifiers: A comparison of logistic regression and naive Bayes , 2001, NIPS.

[160]  Stephan Oepen,et al.  Parser engineering and performance profiling , 2000, Natural Language Engineering.

[161]  Halil Kilicoglu,et al.  Recognizing speculative language in biomedical research articles: a linguistically motivated perspective , 2008, BMC Bioinformatics.

[162]  Stanley F. Chen,et al.  A Gaussian Prior for Smoothing Maximum Entropy Models , 1999 .

[163]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[164]  Jingbo Zhu,et al.  Heterogeneous Parsing via Collaborative Decoding , 2010, COLING.

[165]  Barbara Plank,et al.  Exploring an Auxiliary Distribution Based Approach to Domain Adaptation of a Syntactic Disambiguation Model , 2008, CF+CDPE@COLING.

[166]  Jun'ichi Tsujii,et al.  Corpus-Oriented Grammar Development for Acquiring a Head-Driven Phrase Structure Grammar from the Penn Treebank , 2004, IJCNLP.

[167]  Eugene Charniak,et al.  Coarse-to-Fine n-Best Parsing and MaxEnt Discriminative Reranking , 2005, ACL.

[168]  James R. Curran,et al.  Adding Noun Phrase Structure to the Penn Treebank , 2007, ACL.

[169]  Jun'ichi Tsujii,et al.  Feature Forest Models for Probabilistic HPSG Parsing , 2008, CL.

[170]  Timothy Baldwin,et al.  Prepositions in Applications: A Survey and Introduction to the Special Issue , 2009, CL.

[171]  Jun'ichi Tsujii,et al.  GENIA corpus - a semantically annotated corpus for bio-textmining , 2003, ISMB.

[172]  K. Bretonnel Cohen,et al.  The textual characteristics of traditional and Open Access scientific journals are similar , 2008, BMC Bioinformatics.

[173]  Joel Nothman,et al.  Evaluating a Statistical CCG Parser on Wikipedia , 2009, PWNLP@IJCNLP.

[174]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[175]  Ted Briscoe,et al.  The Second Release of the RASP System , 2006, ACL.

[176]  Peter C. Chapin Formal languages I , 1973, CSC '73.

[177]  Seth Kulick,et al.  Integrated Annotation for Biomedical Information Extraction , 2004, HLT-NAACL 2004.

[178]  Dekang Lin,et al.  DIRT – Discovery of Inference Rules from Text , 2001 .

[179]  Dan Flickinger,et al.  An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG , 2000, LREC.

[180]  János Csirik,et al.  The CoNLL-2010 Shared Task: Learning to Detect Hedges and their Scope in Natural Language Text , 2010, CoNLL Shared Task.

[181]  Alex Lascarides,et al.  An Algebra for Semantic Construction in Constraint-based Grammars , 2001, ACL.

[182]  Daniel Marcu,et al.  Syntax-based Alignment of Multiple Translations: Extracting Paraphrases and Generating New Sentences , 2003, NAACL.

[183]  A. Valencia,et al.  Evaluation of text-mining systems for biology: overview of the Second BioCreative community challenge , 2008, Genome Biology.

[184]  Mihai Surdeanu,et al.  Event Extraction as Dependency Parsing for BioNLP 2011 , 2011, BioNLP@ACL.

[185]  Eric Nichols,et al.  Improving statistical machine translation by paraphrasing the training data. , 2008, IWSLT.

[186]  Jari Björne,et al.  Generalizing Biomedical Event Extraction , 2011, BioNLP@ACL.

[187]  Emiel Krahmer,et al.  Comparing Phrase-based and Syntax-based Paraphrase Generation , 2011, Monolingual@ACL.

[188]  Ralph Grishman,et al.  A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[189]  Mark Johnson,et al.  Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 2002, ACL.

[190]  Stephan Oepen,et al.  High Precision Treebanking—Blazing Useful Trees Using POS Information , 2005, ACL.

[191]  James R. Curran,et al.  Wide-Coverage Efficient Statistical Parsing with CCG and Log-Linear Models , 2007, Computational Linguistics.

[192]  Stephan Oepen,et al.  Stochastic HPSG Parse Disambiguation using the Redwoods Corpus , 2005 .

[193]  Suzanne Stevenson,et al.  Statistical Measures of the Semi-Productivity of Light Verb Constructions , 2004 .

[194]  Stephan Oepen,et al.  Extracting and Annotating Wikipedia Sub-Domains — Towards a New eScience Community Resource , 2008 .

[195]  Wolfgang Wahlster,et al.  Verbmobil: Foundations of Speech-to-Speech Translation , 2000, Artificial Intelligence.