论文信息 - Harry Potter and the Action Prediction Challenge from Natural Language - 字舞流文

Harry Potter and the Action Prediction Challenge from Natural Language

We explore the challenge of action prediction from textual descriptions of scenes, a testbed to approximate whether text inference can be used to predict upcoming actions. As a case of study, we consider the world of the Harry Potter fantasy novels and inferring what spell will be cast next given a fragment of a story. Spells act as keywords that abstract actions (e.g. 'Alohomora' to open a door) and denote a response to the environment. This idea is used to automatically build HPAC, a corpus containing 82,836 samples and 85 actions. We then evaluate different baselines. Among the tested models, an LSTM-based approach obtains the best performance for frequent actions and large scene descriptions, but approaches such as logistic regression behave well on infrequent actions.

David Vilares | Carlos Gómez-Rodríguez | David Vilares | Carlos Gómez-Rodríguez

[1] Raymond J. Mooney,et al. Using Sentence-Level LSTM Language Models for Script Inference , 2016, ACL.

[2] David Bamman,et al. Beyond Canonical Texts: A Computational Analysis of Fanfiction , 2016, EMNLP.

[3] F ROSENBLATT,et al. The perceptron: a probabilistic model for information storage and organization in the brain. , 1958, Psychological review.

[4] Danqi Chen,et al. A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task , 2016, ACL.

[5] Jonathan Berant,et al. Contextualized Word Representations for Reading Comprehension , 2017, NAACL.

[6] Jian Zhang,et al. SQuAD: 100,000+ Questions for Machine Comprehension of Text , 2016, EMNLP.

[7] Salim Roukos,et al. Bleu: a Method for Automatic Evaluation of Machine Translation , 2002, ACL.

[8] Rudolf Kadlec,et al. Embracing data abundance: BookTest Dataset for Reading Comprehension , 2016, ICLR.

[9] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[10] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[11] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[12] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[13] Jason Weston,et al. Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks , 2015, ICLR.

[14] Zachary C. Lipton,et al. How Much Reading Does Reading Comprehension Require? A Critical Investigation of Popular Benchmarks , 2018, EMNLP.

[15] Michael A. Arbib,et al. The handbook of brain theory and neural networks , 1995, A Bradford book.

[16] Raymond J. Mooney,et al. Statistical Script Learning with Multi-Argument Events , 2014, EACL.

[17] Yoshua Bengio,et al. Convolutional networks for images, speech, and time series , 1998 .

[18] Chris Dyer,et al. The NarrativeQA Reading Comprehension Challenge , 2017, TACL.

[19] Mihai Surdeanu,et al. The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[20] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[21] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[22] Roger C. Schank,et al. SCRIPTS, PLANS, GOALS, AND UNDERSTANDING , 1988 .

[23] Nathanael Chambers,et al. Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[24] Mirella Lapata,et al. Whodunnit? Crime Drama as a Case for Natural Language Understanding , 2018, Transactions of the Association for Computational Linguistics.

[25] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[26] Bo Pang,et al. Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[27] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.