论文信息 - Modeling Preconditions in Text with a Crowd-sourced Dataset - 字舞流文

Modeling Preconditions in Text with a Crowd-sourced Dataset

Preconditions provide a form of logical connection between events that explains why some events occur together and information that is complementary to the more widely studied relations such as causation, temporal ordering, entailment, and discourse relations. Modeling preconditions in text has been hampered in part due to the lack of large scale labeled data grounded in text. This paper introduces PeKo, a crowd-sourced annotation of preconditions between event pairs in newswire, an order of magnitude larger than prior text annotations. To complement this new corpus, we also introduce two challenge tasks aimed at modeling preconditions: (i) Precondition Identification -- a standard classification task defined over pairs of event mentions, and (ii) Precondition Generation -- a generative task aimed at testing a more general ability to reason about a given event. Evaluation on both tasks shows that modeling preconditions is challenging even for today's large language models (LM). This suggests that precondition knowledge is not easily accessible in LM-derived representations alone. Our generation results show that fine-tuning an LM on PeKo yields better conditional relations than when trained on raw text or temporally-ordered corpora.

Niranjan Balasubramanian | Nathanael Chambers | Mahnaz Koupaee | Pratyush Singh | Heeyoung Kwon | Gargi Sawhney | Anmol Shukla | Keerthi Kumar Kallur

[1] Richard Fikes,et al. STRIPS: A New Approach to the Application of Theorem Proving to Problem Solving , 1971, IJCAI.

[2] Tommaso Caselli,et al. The Event StoryLine Corpus: A New Benchmark for Causal and Temporal Relation Extraction , 2017, NEWS@ACL.

[3] Ido Dagan,et al. Supervised Open Information Extraction , 2018, NAACL.

[4] Dekang Lin,et al. DIRT – Discovery of Inference Rules from Text , 2001 .

[5] Yejin Choi,et al. ATOMIC: An Atlas of Machine Commonsense for If-Then Reasoning , 2019, AAAI.

[6] Yoshua Bengio,et al. On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[7] Benjamin Van Durme,et al. Fine-Grained Temporal Relation Extraction , 2019, ACL.

[8] Ido Dagan,et al. The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[9] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[10] Steven Bethard,et al. ClearTK-TimeML: A minimalist approach to TempEval 2013 , 2013, *SEMEVAL.

[11] 悠太菊池,et al. 大規模要約資源としてのNew York Times Annotated Corpus , 2015 .

[12] Martha Palmer,et al. Richer Event Description: Integrating event coreference with temporal, causal and bridging annotation , 2016 .

[13] Avirup Sil,et al. Extracting STRIPS Representations of Actions and Events , 2011, RANLP.

[14] Taylor Cassidy,et al. Dense Event Ordering with a Multi-Pass Architecture , 2014, TACL.

[15] Yiming Yang,et al. XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[16] Roy Schwartz,et al. Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets , 2019, NAACL.

[17] Avirup Sil,et al. Extracting Action and Event Semantics from Web Text , 2010, AAAI Fall Symposium: Commonsense Knowledge.

[18] Nanyun Peng,et al. Joint Event and Temporal Relation Extraction with Shared Representations and Structured Prediction , 2019, EMNLP/IJCNLP.

[19] Noga Alon,et al. Efficient Global Learning of Entailment Graphs , 2015, CL.

[20] Nathanael Chambers,et al. CaTeRS: Causal and Temporal Relation Scheme for Semantic Annotation of Event Structures , 2016, EVENTS@HLT-NAACL.

[21] Luke S. Zettlemoyer,et al. AllenNLP: A Deep Semantic Natural Language Processing Platform , 2018, ArXiv.

[22] Regina Barzilay,et al. Learning High-Level Planning from Text , 2012, ACL.

[23] Patrick Pantel,et al. DIRT @SBT@discovery of inference rules from text , 2001, KDD '01.

[24] Kathleen R. McKeown,et al. Integrating Rhetorical-Semantic Relation Models for Query-Focused Summarization , 2006 .

[25] Frank Hutter,et al. Decoupled Weight Decay Regularization , 2017, ICLR.

[26] Ilya Sutskever,et al. Language Models are Unsupervised Multitask Learners , 2019 .

[27] Hao Wu,et al. Improving Temporal Relation Extraction with a Globally Acquired Statistical Resource , 2018, NAACL.

[28] Paramita Mirza,et al. Annotating Causality in the TempEval-3 Corpus , 2014, EACL 2014.

[29] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[30] Roxana Gîrju,et al. Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.