LIARc: Labeling Implicit ARguments in Spanish Deverbal Nominalizations

This paper deals with the automatic identification and annotation of the implicit arguments of deverbal nominalizations in Spanish. We present the first version of the LIAR system focusing on its classifier component. We have built a supervised Machine Learning feature based model that uses a subset of AnCora-Es as a training corpus. We have built four different models and the overall F-Measure is 89.9%, which means an increase F-Measure performance approximately 35 points over the baseline (55%). However, a detailed analysis of the feature performance is still needed. Future work will focus on using LIAR to automatically annotate the implicit arguments in the whole AnCora-Es.

[1]  Johan Bos,et al.  *SEM 2012: The First Joint Conference on Lexical and Computational Semantics -- Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012) , 2012 .

[2]  Preslav Nakov,et al.  Proceedings of the ACL 2011 Workshop on Relational Models of Semantics, RELMS@ACL 2011, Portland, Oregon, USA, June 23, 2011 , 2011, RELMS@ACL.

[3]  Josef Ruppenhofer,et al.  In Search of Missing Arguments: A Linguistic Approach , 2011, RANLP.

[4]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[5]  Sara Tonelli,et al.  Desperately Seeking Implicit Arguments in Text , 2011, RELMS@ACL.

[6]  Noah A. Smith,et al.  SEMAFOR: Frame Argument Resolution with Log-Linear Models , 2010, SemEval@ACL.

[7]  Mariona Taulé,et al.  AnCora: Multilevel Annotated Corpora for Catalan and Spanish , 2008, LREC.

[8]  Joyce Yue Chai,et al.  Semantic Role Labeling of Implicit Arguments for Nominal Predicates , 2012, CL.

[9]  Joyce Yue Chai,et al.  Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates , 2010, ACL.

[10]  Carina Silberer,et al.  Casting Implicit Role Linking as an Anaphora Resolution Task , 2012, *SEMEVAL.

[11]  Roser Morante,et al.  SemEval-2010 Task 10: Linking Events and Their Participants in Discourse , 2009, SemEval@ACL.

[12]  Juan Aparicio,et al.  AnCora-Verb: A Lexical Resource for the Semantic Annotation of Corpora , 2008, LREC.

[13]  Aina Peris,et al.  Annotating the argument structure of deverbal nominalizations in Spanish , 2012, Lang. Resour. Evaluation.

[14]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[15]  Sara Tonelli,et al.  VENSES++: Adapting a deep semantic processing system to the identification of null instantiations , 2010, SemEval@ACL.

[16]  Adam Meyers Annotation Guidelines for NomBank ñ Noun Argument Structure for PropBank 2007 , 2007 .

[17]  Carlo Strapparava,et al.  Proceedings of the 5th International Workshop on Semantic Evaluation , 2010 .

[18]  Ian Witten,et al.  Data Mining , 2000 .

[19]  Charles J. Fillmore,et al.  Pragmatically Controlled Zero Anaphora , 1986 .

[20]  Egoitz Laparra,et al.  Exploiting Explicit Annotations and Semantic Types for Implicit Argument Resolution , 2012, 2012 IEEE Sixth International Conference on Semantic Computing.

[21]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[22]  Martha Palmer,et al.  Verbnet: a broad-coverage, comprehensive verb lexicon , 2005 .

[23]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[24]  Aina Peris,et al.  AnCora-Nom: A Spanish Lexicon of Deverbal Nominalizations , 2011, Proces. del Leng. Natural.