Extraction non supervisée de relations sémantiques lexicales

Nous presentons une base de connaissances comportant des triplets de paires de verbes associes avec une relation semantique/discursive, extraits du corpus francais frWaC par une methode s’appuyant sur la presence d’un connecteur discursif reliant deux verbes. Nous detaillons plusieurs mesures visant a evaluer la pertinence des triplets et la force d’association entre la relation semantique/discursive et la paire de verbes. L’evaluation intrinseque est realisee par rapport a des annotations manuelles. Une evaluation de la couverture de la ressource est egalement realisee par rapport au corpus Annodis annote discursivement. Cette etude produit des resultats prometteurs demontrant l’utilite potentielle de notre ressource pour les tâches d’analyse discursive mais aussi des tâches de nature semantique.

[1]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[2]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[3]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[4]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[5]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[6]  Patrick Pantel,et al.  Concept Discovery from Text , 2002, COLING.

[7]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[8]  Patrick Pantel,et al.  VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations , 2004, EMNLP.

[9]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.

[10]  Joakim Nivre,et al.  MaltParser: A Language-Independent System for Data-Driven Dependency Parsing , 2007, Natural Language Engineering.

[11]  James Pustejovsky,et al.  Classification of Discourse Coherence Relations: An Exploratory Study using Multiple Knowledge Sources , 2006, SIGDIAL Workshop.

[12]  Qin Lu,et al.  Extractive Summarization Based on Event Term Clustering , 2007, ACL.

[13]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[14]  Alex Lascarides,et al.  Edinburgh Research Explorer Using automatically labelled examples to classify rhetorical relations: an assessment , 2022 .

[15]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[16]  Silvia Bernardini,et al.  The WaCky wide web: a collection of very large linguistically processed web-crawled corpora , 2009, Lang. Resour. Evaluation.

[17]  Masaki Murata,et al.  Large-Scale Verb Entailment Acquisition from the Web , 2009, EMNLP.

[18]  Mitsuru Ishizuka,et al.  HILDA: A Discourse Parser Using Support Vector Machine Classification , 2010, Dialogue Discourse.

[19]  Laurence Danlos,et al.  LEXCONN: A French Lexicon of Discourse Connectives , 2010 .

[20]  Pascal Denis,et al.  Statistical French Dependency Parsing: Treebank Conversion and First Results , 2010, LREC.

[21]  Benoît Sagot,et al.  The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French , 2010, LREC.

[22]  Dan Roth,et al.  Minimally Supervised Event Causality Identification , 2011, EMNLP.

[23]  Zornitsa Kozareva Cause-Effect Relation Learning , 2012, TextGraphs@ACL.

[24]  Graeme Hirst,et al.  Text-level Discourse Parsing with Rich Linguistic Features , 2012, ACL.

[25]  Hwee Tou Ng,et al.  A PDTB-styled end-to-end discourse parser , 2012, Natural Language Engineering.

[26]  Ludovic Tanguy,et al.  An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus , 2012, LREC.

[27]  Pascal Denis,et al.  Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging , 2012, Lang. Resour. Evaluation.

[28]  James Pustejovsky,et al.  SemEval-2013 Task 1: TempEval-3: Evaluating Time Expressions, Events, and Temporal Relations , 2013, *SEMEVAL.

[29]  P. Denis,et al.  Identification automatique des relations discursives « implicites » à partir de données annotées et de corpus bruts , 2013 .

[30]  Anette Frank,et al.  A Discriminative Analysis of Fine-Grained Semantic Relations including Presupposition: Annotation and Classification , 2013, Dialogue Discourse.

[31]  Alexis Nasr,et al.  Enforcing Subcategorization Constraints in a Parser Using Sub-parses Recombining , 2013, HLT-NAACL.