Exploiting Semantic Role Resources for Preposition Disambiguation

This article describes how semantic role resources can be exploited for preposition disambiguation. The main resources include the semantic role annotations provided by the Penn Treebank and FrameNet tagged corpora. The resources also include the assertions contained in the Factotum knowledge base, as well as information from Cyc and Conceptual Graphs. A common inventory is derived from these in support of definition analysis, which is the motivation for this work. The disambiguation concentrates on relations indicated by prepositional phrases, and is framed as word-sense disambiguation for the preposition in question. A new type of feature for word-sense disambiguation is introduced, using WordNet hypernyms as collocations rather than just words. Various experiments over the Penn Treebank and FrameNet data are presented, including prepositions classified separately versus together, and illustrating the effects of filtering. Similar experimentation is done over the Factotum data, including a method for inferring likely preposition usage from corpora, as knowledge bases do not generally indicate how relationships are expressed in English (in contrast to the explicit annotations on this in the Penn Treebank and FrameNet). Other experiments are included with the FrameNet data mapped into the common relation inventory developed for definition analysis, illustrating how preposition disambiguation might be applied in lexical acquisition.

[1]  Dan Roth,et al.  Generalized Inference with Multiple Semantic Role Labeling Systems , 2005, CoNLL.

[2]  Kenneth C. Litkowski,et al.  SemEval-2007 Task 06: Word-Sense Disambiguation of Prepositions , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[3]  Stanley Starosta,et al.  Valency and case in computational linguistics , 1990, Machine Translation.

[4]  Rohini K. Srihari,et al.  A Hybrid Approach for Named Entity and Sub-Type Tagging , 2000, ANLP.

[5]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  Ken Litkowski,et al.  Senseval-3 task: Automatic labeling of semantic roles , 2004, SENSEVAL@ACL.

[8]  Martha Stone Palmer Semantic Processing for Finite Domains (Studies in Natural Language Processing) , 2005 .

[9]  Timothy Baldwin,et al.  Prepositions in Applications: A Survey and Introduction to the Special Issue , 2009, CL.

[10]  P. Cassidy An Investigation of the Semantic Relations in the Roget ’ s Thesaurus : Preliminary Results , 2010 .

[11]  J. Moake,et al.  This article has been cited by other articles , 2003 .

[12]  Janyce Wiebe,et al.  Mapping Collocational Properties into Machine Learning Features , 1998, VLC@COLING/ACL.

[13]  Martha Palmer,et al.  Semantic Processing for Finite Domains , 1990, CL.

[14]  Von-Wun Soo,et al.  An Empirical Study on Thematic Knowledge Acquisition Based on Syntactic Clues and Heuristics , 1993, ACL.

[15]  John Lyons,et al.  语义学引论 = Linguistic Semantics , 2000 .

[16]  Xavier Carreras,et al.  Introduction to the CoNLL-2005 Shared Task: Semantic Role Labeling , 2005, CoNLL.

[17]  Ann Bies,et al.  Bracketing Guidelines for Treebank II Style , 2002 .

[18]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[19]  Ann Bies,et al.  Bracketing Guidelines For Treebank II Style Penn Treebank Project , 1995 .

[20]  Stan Matwin,et al.  Text Classification Using WordNet Hypernyms , 1998, WordNet@ACL/COLING.

[21]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[22]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[23]  Stan Szpakowicz,et al.  Semiautomatic recognition of semantic relationships in english technical texts , 1998 .

[24]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[25]  Janyce Wiebe,et al.  Decomposable Modeling in Natural Language Processing , 1999, CL.

[26]  Douglas B. Lenat,et al.  CYC: a large-scale investment in knowledge infrastructure , 1995, CACM.

[27]  Fritz Lehmann,et al.  Big Posets of Participatings and Thematic Roles , 1996, ICCS.

[28]  Janyce Wiebe,et al.  Classifying Functional Relations in Factotum via WordNet Hypernym Associations , 2003, CICLing.

[29]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[30]  Daniel Jurafsky,et al.  Support Vector Learning for Semantic Argument Classification , 2005, Machine Learning.

[31]  Rada Mihalcea,et al.  Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation , 2002, COLING.

[32]  Xavier Carreras,et al.  Introduction to the CoNLL-2004 Shared Task: Semantic Role Labeling , 2004, CoNLL.

[33]  Adam Kilgarriff,et al.  SENSEVAL: an exercise in evaluating world sense disambiguation programs , 1998, LREC.

[34]  Janyce Wiebe,et al.  Class-based collocations for Word Sense Disambiguation , 2004, SENSEVAL@ACL.

[35]  Ann Bies,et al.  The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.

[36]  Cristian Grozea,et al.  Finding optimal parameter settings for high performance word sense disambiguation , 2004 .

[37]  Adam Kilgarriff,et al.  The Senseval-3 English lexical sample task , 2004, SENSEVAL@ACL.

[38]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[39]  Srini Narayanan,et al.  Semantic Extraction with Wide-Coverage Lexical Resources , 2003, HLT-NAACL.

[40]  Janyce Wiebe,et al.  Empirical acquisition of conceptual distinctions via dictionary definitions , 2005 .

[41]  Bertram C. Bruce Case Systems for Natural Language , 1975, Artif. Intell..

[42]  Stuart C. Shapiro Review of Knowledge representation: logical, philosophical, and computational foundations by John F. Sowa. Brooks/Cole 2000. , 2001 .

[43]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[44]  Yukiko Sasaki Alam,et al.  Decision Trees for Sense Disambiguation of Prepositions: Case of Over , 2004, HLT-NAACL 2004.

[45]  Collin F. Baker,et al.  Building a Large Lexical Databank Which Provides Deep Semantics , 2001, PACLIC.

[46]  Timothy Baldwin,et al.  MELB-YB: Preposition Sense Disambiguation Using Rich Semantic Features , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[47]  Eugene Charniak,et al.  Assigning Function Tags to Parsed Text , 2000, ANLP.

[48]  Christiane Fellbaum,et al.  Modifiers in WordNet , 1998 .

[49]  Kenneth C. Litkowski Digraph Analysis of Dictionary Preposition definition , 2002, SENSEVAL.

[50]  Timothy Baldwin,et al.  Semantic role labeling of prepositional phrases , 2006, TALIP.

[51]  Kenneth C. Litkowski,et al.  Coverage and Inheritance in The Preposition Project , 2006, ACL 2006.

[52]  Chutima Boonthum-Denecke,et al.  Preposition Senses: Generalized Disambiguation Model , 2006, CICLing.

[53]  John F. Sowa,et al.  Knowledge representation: logical, philosophical, and computational foundations , 2000 .

[54]  Jason Eisner,et al.  Lexical Semantics , 2020, The Handbook of English Linguistics.