论文信息 - Pushing the boundaries of deep parsing

Pushing the boundaries of deep parsing

I examine the application of deep parsing techniques to a range of Natural Language Processing tasks as well as methods to improve their performance. Focussing specifically on the English Resource Grammar, a hand-crafted grammar of English based on the Head-Driven Phrase Structure Grammar formalism, I examine some techniques for improving parsing accuracy in diverse domains and methods for evaluating these improvements. I also evaluate the utility of the in-depth linguistic analyses available from this grammar for some specific NLP applications such as biomedical information extraction, as well as investigating other applications of the semantic output available from this grammar.

Andrew Mackinlay | Andrew D. MacKinlay

[1] Thorsten Brants,et al. TnT – A Statistical Part-of-Speech Tagger , 2000, ANLP.

[2] Yvan Saeys,et al. Analyzing text in search of bio-molecular events: a high-precision machine learning framework , 2009, BioNLP@HLT-NAACL.

[3] Jun'ichi Tsujii,et al. A Markov Logic Approach to Bio-Molecular Event Extraction , 2009, BioNLP@HLT-NAACL.

[4] Michael Collins,et al. Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[5] J. Bresnan. Lexical-Functional Syntax , 2000 .

[6] Sophia Ananiadou,et al. Developing a Robust Part-of-Speech Tagger for Biomedical Text , 2005, Panhellenic Conference on Informatics.

[7] Erik Velldal,et al. Empirical Realization Ranking , 2009 .

[8] Peter Sells,et al. Lectures on contemporary syntactic theories , 1985 .

[9] Julian Kupiec. An Algorithm for Estimating the Parameters of Unrestricted Hidden Stochastic Context-Free Grammars , 1992, COLING.

[10] Alexander M. Rush,et al. On Dual Decomposition and Linear Programming Relaxations for Natural Language Processing , 2010, EMNLP.

[11] Jun'ichi Tsujii,et al. Corpus annotation for mining biomedical events from literature , 2008, BMC Bioinformatics.

[12] Jun'ichi Tsujii,et al. Part-of-Speech Annotation of Biology Research Abstracts , 2004, LREC.

[13] Timothy Baldwin,et al. Biomedical Event Annotation with CRFs and Precision Grammars , 2009, BioNLP@HLT-NAACL.

[14] Akinori Yonezawa,et al. Overview of Genia Event Task in BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[15] Georgiana Dinu,et al. Inference Rules and their Application to Recognizing Textual Entailment , 2009, EACL.

[16] Fernando Pereira,et al. Identifying gene and protein mentions in text using conditional random fields , 2005, BMC Bioinformatics.

[17] Jun'ichi Tsujii,et al. Evaluating contributions of natural language parsers to protein–protein interaction extraction , 2008, Bioinform..

[18] Aravind K. Joshi,et al. Tree-Adjoining Grammars , 1997, Handbook of Formal Languages.

[19] Daniel Gildea,et al. Corpus Variation and Parser Performance , 2001, EMNLP.

[20] Daniel Gildea,et al. Automatic Labeling of Semantic Roles , 2000, ACL.

[21] Jari Björne,et al. BioInfer: a corpus for information extraction in the biomedical domain , 2007, BMC Bioinformatics.

[22] Yi Zhang,et al. Discriminant Ranking for Efficient Treebanking , 2010, COLING.

[23] Christopher D. Manning,et al. Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[24] Eugene Charniak,et al. Automatic Domain Adaptation for Parsing , 2010, NAACL.

[25] Ulrich Schäfer,et al. The ACL Anthology Searchbench , 2011, ACL.

[26] Eugene Charniak,et al. Reranking and Self-Training for Parser Adaptation , 2006, ACL.

[27] Khalil Sima'an,et al. Accurate Unlexicalized Parsing for Modern Hebrew , 2007, TSD.

[28] Martial Hebert,et al. Semi-Supervised Self-Training of Object Detection Models , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[29] Christopher D. Manning,et al. LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[30] Stephan Oepen,et al. SEM-I rational MT : enriching deep grammars with a semantic interface for scalable machine translation. , 2005 .

[31] Mark Aronoff,et al. Contemporary linguistics: An introduction , 1989 .

[32] Mark Steedman,et al. Building Deep Dependency Structures using a Wide-Coverage CCG Parser , 2002, ACL.

[33] Matthew Lease,et al. Parsing Biomedical Literature , 2005, IJCNLP.

[34] Stephan Oepen,et al. Discriminant-Based MRS Banking , 2006, LREC.

[35] Eugene Charniak,et al. Tree-Bank Grammars , 1996, AAAI/IAAI, Vol. 2.

[36] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[37] David Yarowsky,et al. Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[38] Alfonso Valencia,et al. Overview of BioCreAtIvE: critical assessment of information extraction for biology , 2005, BMC Bioinformatics.

[39] Diana McCarthy,et al. Domain-Speci(cid:12)c Sense Distributions and Predominant Sense Acquisition , 2022 .

[40] Thorsten Joachims,et al. Cutting-plane training of structural SVMs , 2009, Machine Learning.