Pattern Dictionary of English Prepositions

We present a new lexical resource for the study of preposition behavior, the Pattern Dictionary of English Prepositions (PDEP). This dictionary, which follows principles laid out in Hanks’ theory of norms and exploitations, is linked to 81,509 sentences for 304 prepositions, which have been made available under The Preposition Project (TPP). Notably, 47,285 sentences, initially untagged, provide a representative sample of preposition use, unlike the tagged sentences used in previous studies. Each sentence has been parsed with a dependency parser and our system has near-instantaneous access to features developed with this parser to explore and annotate properties of individual senses. The features make extensive use of WordNet. We have extended feature exploration to include lookup of FrameNet lexical units and VerbNet classes for use in characterizing preposition behavior. We have designed our system to allow public access to any of the data available in the system.

[1]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[2]  Dirk Hovy,et al.  What’s in a Preposition? Dimensions of Sense Disambiguation for an Interesting Word Class , 2010, COLING.

[3]  Constantin Orasan,et al.  Barbecued Opakapaka: Using Semantic Preferences for Ontology Population , 2015, RANLP.

[4]  Deniz Yuret,et al.  KU: Word Sense Disambiguation by Substitution , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[5]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[6]  Patrick Hanks Corpus pattern analysis , 2004 .

[7]  Jason Weston,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2002, Machine Learning.

[8]  Adam Kilgarriff,et al.  The Sketch Engine , 2004 .

[9]  Katrin Erk,et al.  A WordNet Detour to FrameNet , 2005 .

[10]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[11]  Judy Pearsall,et al.  Oxford Dictionary of English , 2010 .

[12]  Eneko Agirre,et al.  Selectional Preferences for Semantic Role Classification , 2013, CL.

[13]  Ken Litkowski Preposition Disambiguation : Still a Problem , 2013 .

[14]  Timothy Baldwin,et al.  A Classification Schema for Fast Disambiguation of Spatial Prepositions , 2015, IWGS.

[15]  Patrick Hanks,et al.  Lexical Analysis: Norms and Exploitations , 2013 .

[16]  Timothy Baldwin,et al.  MELB-YB: Preposition Sense Disambiguation Using Rich Semantic Features , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[17]  Adam Kilgarriff,et al.  Word Sketches for Turkish , 2012, LREC.

[18]  Dan Roth,et al.  Modeling Semantic Relations Expressed by Prepositions , 2013, TACL.

[19]  Stephen Tratz Semantically-enriched parsing for natural language understanding , 2011 .

[20]  Steven Bethard,et al.  Finding event, temporal and causal structure in text: a machine learning approach , 2007 .

[21]  Eduard H. Hovy,et al.  A Fast, Accurate, Non-Projective, Semantically-Enriched Parser , 2011, EMNLP.

[22]  Silvie Cinková,et al.  A database of semantic clusters of verb usages , 2012, LREC.

[23]  Kenneth C. Litkowski,et al.  Coverage and Inheritance in The Preposition Project , 2006, ACL 2006.

[24]  Kenneth C. Litkowski,et al.  SemEval-2007 Task 06: Word-Sense Disambiguation of Prepositions , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[25]  Graeme Hirst,et al.  Comparison of different feature sets for identification of variants in progressive aphasia , 2014, CLPsych@ACL.

[26]  James Pustejovsky,et al.  Towards a Generative Lexical Resource: The Brandeis Semantic Ontology , 2006, LREC.

[27]  Dan Roth,et al.  A Joint Model for Extended Semantic Role Labeling , 2011, EMNLP.

[28]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[29]  Ken Litkowski,et al.  The Preposition Project , 2021, ArXiv.