Unsupervised extraction of semantic relations using discourse cues

This paper presents a knowledge base containing triples involving pairs of verbs associated with semantic or discourse relations. The relations in these triples are marked by discourse connectors between two adjacent instances of the verbs in the triple in the large French corpus, frWaC. We detail several measures that evaluate the relevance of the triples and the strength of their association. We use manual annotations to evaluate our method, and also study the coverage of our resource with respect to the discourse annotated corpus Annodis. Our positive results show the potential impact of our resource for discourse analysis tasks as well as other semantically oriented tasks like temporal and causal information extraction

[1]  Gregory Grefenstette,et al.  Explorations in automatic thesaurus discovery , 1994 .

[2]  Pascal Denis,et al.  Constrained Decoding for Text-Level Discourse Parsing , 2012, COLING.

[3]  Silvia Bernardini,et al.  The WaCky wide web: a collection of very large linguistically processed web-crawled corpora , 2009, Lang. Resour. Evaluation.

[4]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[5]  Pascal Denis,et al.  Statistical French Dependency Parsing: Treebank Conversion and First Results , 2010, LREC.

[6]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[7]  P. Denis,et al.  Identification automatique des relations discursives « implicites » à partir de données annotées et de corpus bruts , 2013 .

[8]  Dan Roth,et al.  Minimally Supervised Event Causality Identification , 2011, EMNLP.

[9]  Masaki Murata,et al.  Large-Scale Verb Entailment Acquisition from the Web , 2009, EMNLP.

[10]  Graeme Hirst,et al.  Text-level Discourse Parsing with Rich Linguistic Features , 2012, ACL.

[11]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[12]  Livio Robaldo,et al.  The Penn Discourse TreeBank 2.0. , 2008, LREC.

[13]  Alexis Nasr,et al.  Enforcing Subcategorization Constraints in a Parser Using Sub-parses Recombining , 2013, HLT-NAACL.

[14]  Benoît Sagot,et al.  The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French , 2010, LREC.

[15]  Alex Lascarides,et al.  Logics of Conversation , 2005, Studies in natural language processing.

[16]  Hwee Tou Ng,et al.  A PDTB-styled end-to-end discourse parser , 2012, Natural Language Engineering.

[17]  Patrick Pantel,et al.  Concept Discovery from Text , 2002, COLING.

[18]  Zornitsa Kozareva Cause-Effect Relation Learning , 2012, TextGraphs@ACL.

[19]  Pascal Denis,et al.  Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging , 2012, Lang. Resour. Evaluation.

[20]  Nathanael Chambers,et al.  Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[21]  Alex Lascarides,et al.  Edinburgh Research Explorer Using automatically labelled examples to classify rhetorical relations: an assessment , 2022 .

[22]  Laurence Danlos,et al.  LEXCONN: A French Lexicon of Discourse Connectives , 2010 .

[23]  James Pustejovsky,et al.  Classification of Discourse Coherence Relations: An Exploratory Study using Multiple Knowledge Sources , 2006, SIGDIAL Workshop.

[24]  Ludovic Tanguy,et al.  An empirical resource for discovering cognitive principles of discourse organisation: the ANNODIS corpus , 2012, LREC.

[25]  Qin Lu,et al.  Extractive Summarization Based on Event Term Clustering , 2007, ACL.

[26]  Patrick Pantel,et al.  VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations , 2004, EMNLP.

[27]  James Pustejovsky,et al.  SemEval-2013 Task 1: TempEval-3: Evaluating Time Expressions, Events, and Temporal Relations , 2013, *SEMEVAL.

[28]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[29]  Anette Frank,et al.  A Discriminative Analysis of Fine-Grained Semantic Relations including Presupposition: Annotation and Classification , 2013, Dialogue Discourse.

[30]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[31]  Mitsuru Ishizuka,et al.  HILDA: A Discourse Parser Using Support Vector Machine Classification , 2010, Dialogue Discourse.