Automatic Labeling of Semantic Roles

We present a system for identifying the semantic relationships, or semantic roles, filled by constituents of a sentence within a semantic frame. Given an input sentence and a target word and frame, the system labels constituents with either abstract semantic roles, such as Agent or Patient, or more domain-specific semantic roles, such as Speaker, Message, and Topic. The system is based on statistical classifiers trained on roughly 50,000 sentences that were hand-annotated with semantic roles by the FrameNet semantic labeling project. We then parsed each training sentence into a syntactic tree and extracted various lexical and syntactic features, including the phrase type of each constituent, its grammatical function, and its position in the sentence. These features were combined with knowledge of the predicate verb, noun, or adjective, as well as information such as the prior probabilities of various combinations of semantic roles. We used various lexical clustering algorithms to generalize across possible fillers of roles. Test sentences were parsed, were annotated with these features, and were then passed through the classifiers. Our system achieves 82 accuracy in identifying the semantic role of presegmented constituents. At the more difficult task of simultaneously segmenting constituents and identifying their semantic role, the system achieved 65 precision and 61 recall. Our study also allowed us to compare the usefulness of different features and feature combination methods in the semantic role labeling task. We also explore the integration of role labeling with statistical syntactic parsing and attempt to generalize to predicates unseen in the training data.

[1]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[2]  Rosane Rocher "Agent" et "Objet" chez Pāṇini@@@"Agent" et "Objet" chez Panini , 1964 .

[3]  C. Fillmore Some Problems for Case Grammar , 1971 .

[4]  Roger C. Schank,et al.  Conceptual dependency: A theory of natural language understanding , 1972 .

[5]  Ray Jackendoff,et al.  Semantic Interpretation in Generative Grammar , 1972 .

[6]  Terry Winograd,et al.  Understanding natural language , 1974 .

[7]  C. Fillmore FRAME SEMANTICS AND THE NATURE OF LANGUAGE * , 1976 .

[8]  Frederick Jelinek,et al.  Interpolated estimation of Markov source parameters from sparse data , 1980 .

[9]  C. Pollard,et al.  Center for the Study of Language and Information , 2022 .

[10]  Stuart M. Shieber,et al.  An Introduction to Unification-Based Approaches to Grammar , 1986, CSLI Lecture Notes.

[11]  Charles J. Fillmore,et al.  Pragmatically Controlled Zero Anaphora , 1986 .

[12]  Harold L. Somers,et al.  Valency and case in computational linguistics , 1987 .

[13]  John Cocke,et al.  A Statistical Approach to Machine Translation , 1990, CL.

[14]  David R. Dowty Thematic proto-roles and argument selection , 1991 .

[15]  R. Valin A synopsis of role and reference grammar , 1992 .

[16]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[17]  Naftali Tishby,et al.  Distributional Clustering of English Words , 1993, ACL.

[18]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[19]  Ellen Riloff,et al.  Automatically Constructing a Dictionary for Information Extraction Tasks , 1993, AAAI.

[20]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[21]  Ann Bies,et al.  The Penn Treebank: Annotating Predicate Argument Structure , 1994, HLT.

[22]  Yajan Veer Pāṇini as a linguist: Ideas and patterns , 1995 .

[23]  David Yarowsky,et al.  Unsupervised Word Sense Disambiguation Rivaling Supervised Methods , 1995, ACL.

[24]  Mats Rooth,et al.  Two-dimensional clusters in grammatical relations , 1995 .

[25]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[26]  B. Levin,et al.  From Lexical Semantics to Argument Realization , 1996 .

[27]  Richard M. Schwartz,et al.  A Fully Statistical Approach to Natural Language Interfaces , 1996, ACL.

[28]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[29]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[30]  Adwait Ratnaparkhi,et al.  A Linear Observed Time Statistical Parser Based on Maximum Entropy Models , 1997, EMNLP.

[31]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[32]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Emmanuel Roche,et al.  Finite-State Language Processing , 1997 .

[34]  Douglas E. Appelt,et al.  FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text , 1997, ArXiv.

[35]  Thomas Hofmann,et al.  Statistical Models for Co-occurrence Data , 1998 .

[36]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[37]  Jerome R. Bellegarda,et al.  Exploiting both local and global constraints for multi-span statistical language modeling , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[38]  Ellen Riloff,et al.  An Empirical Approach to Conceptual Case Frame Acquisition , 1998, VLC@COLING/ACL.

[39]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[40]  Mats Rooth,et al.  Valence Induction with a Head-Lexicalized PCFG , 1998, EMNLP.

[41]  Marti A. Hearst Untangling Text Data Mining , 1999, ACL.

[42]  Thomas Hofmann,et al.  Topic-based language models using EM , 1999, EUROSPEECH.

[43]  Mats Rooth,et al.  Inducing a Semantically Annotated Lexicon via EM-Based Clustering , 1999, ACL.

[44]  Mirella Lapata,et al.  Using Subcategorization to Resolve Verb Class Ambiguity , 1999, EMNLP.

[45]  Frederick Jelinek,et al.  Recognition performance of a structured language model , 2000, EUROSPEECH.

[46]  David Stallard,et al.  Talk'N'Travel: A Conversational System For Air Travel Planning , 2000, ANLP.

[47]  Diana McCarthy,et al.  Using Semantic Preferences to Identify Verbal Participation in Role Switching Alternations , 2000, ANLP.

[48]  Martha Palmer,et al.  Building a class-based verb lexicon using TAGs , 2000, TAG+.

[49]  Eugene Charniak,et al.  Assigning Function Tags to Parsed Text , 2000, ANLP.

[50]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[51]  Scott Miller,et al.  A Novel Use of Statistical Parsing to Extract Information from Text , 2000, ANLP.

[52]  Josef Ruppenhofer,et al.  The framenet project: tools for lexicon building , 2001 .

[53]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.