Inferring Selectional Preferences from Part-Of-Speech N-grams

We present the PONG method to compute selectional preferences using part-of-speech (POS) N-grams. From a corpus labeled with grammatical dependencies, PONG learns the distribution of word relations for each POS N-gram. From the much larger but unlabeled Google N-grams corpus, PONG learns the distribution of POS N-grams for a given pair of words. We derive the probability that one word has a given grammatical relation to the other. PONG estimates this probability by combining both distributions, whether or not either word occurs in the labeled corpus. PONG achieves higher average precision on 16 relations than a state-of-the-art baseline in a pseudo-disambiguation task, but lower coverage and recall.

[1]  Mark Dredze,et al.  Shared Components Topic Models with Application to Selectional Preference , 2011 .

[2]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[3]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[4]  Kenneth Ward Church,et al.  Work on Statistical Methods for Word Sense Disambiguation , 1992 .

[5]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[6]  Hinrich Schfitze Context Space , 2001 .

[7]  Christopher D. Manning,et al.  Stanford typed dependencies manual , 2010 .

[8]  Oren Etzioni,et al.  A Latent Dirichlet Allocation Method for Selectional Preferences , 2010, ACL.

[9]  Mark Dredze,et al.  Shared Components Topic Models , 2012, HLT-NAACL.

[10]  Philip Resnik,et al.  Selectional Preference and Sense Disambiguation , 1997 .

[11]  Sabine Schulte im Walde,et al.  Combining EM Training and the MDL Principle for an Automatic Verb Classification Incorporating Selectional Preferences , 2008, ACL.

[12]  G. A. Barnard,et al.  Transmission of Information: A Statistical Theory of Communications. , 1961 .

[13]  Slav Petrov,et al.  A Universal Part-of-Speech Tagset , 2011, LREC.

[14]  P. Resnik Selectional constraints: an information-theoretic model and its computational realization , 1996, Cognition.

[15]  Katrin Erk,et al.  A Simple, Similarity-based Model for Selectional Preferences , 2007, ACL.

[16]  Katrin Erk,et al.  A Flexible, Corpus-Driven Model of Regular and Inverse Selectional Preferences , 2010, CL.

[17]  Li Cai,et al.  Exploiting Web-Derived Selectional Preference to Improve Statistical Dependency Parsing , 2011, ACL.

[18]  Mats Rooth,et al.  Inducing a Semantically Annotated Lexicon via EM-Based Clustering , 1999, ACL.

[19]  A. Thomasian Review of 'Transmission of Information, A Statistical Theory of Communications' (Fano, R. M.; 1961) , 1962 .