论文信息 - What Happens Next? Future Subevent Prediction Using Contextual Hierarchical LSTM - 字舞流文

What Happens Next? Future Subevent Prediction Using Contextual Hierarchical LSTM

Events are typically composed of a sequence of subevents. Predicting a future subevent of an event is of great importance for many real-world applications. Most previous work on event prediction relied on hand-crafted features and can only predict events that already exist in the training data. In this paper, we develop an end-to-end model which directly takes the texts describing previous subevents as input and automatically generates a short text describing a possible future subevent. Our model captures the two-level sequential structure of a subevent sequence, namely, the word sequence for each subevent and the temporal order of subevents. In addition, our model incorporates the topics of the past subevents to make context-aware prediction of future subevents. Extensive experiments on a real-world dataset demonstrate the superiority of our model over several state-of-the-art methods.

Juan-Zi Li | Xiaoli Li | Linmei Hu | Liqiang Nie | Chao Shao | Xiaoli Li | Liqiang Nie | Chao Shao | Linmei Hu | Juan-Zi Li

[1] Andreas Stolcke,et al. SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[2] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.

[3] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[4] Yiming Yang,et al. Topic Detection and Tracking Pilot Study Final Report , 1998 .

[5] Kira Radinsky,et al. Learning causality for news events prediction , 2012, WWW.

[6] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[7] Raymond J. Mooney,et al. Learning Statistical Scripts with LSTM Recurrent Neural Networks , 2016, AAAI.

[8] Eric Horvitz,et al. Mining the web to predict future events , 2013, WSDM.

[9] Geoffrey Zweig,et al. Context dependent recurrent neural network language model , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[10] Sang Joon Kim,et al. A Mathematical Theory of Communication , 2006 .

[11] Yutaka Matsuo,et al. Earthquake shakes Twitter users: real-time event detection by social sensors , 2010, WWW '10.

[12] Nathanael Chambers,et al. Unsupervised Learning of Narrative Event Chains , 2008, ACL.

[13] Daniel Jurafsky,et al. A Hierarchical Neural Autoencoder for Paragraphs and Documents , 2015, ACL.

[14] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[15] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[16] Stephen Clark,et al. What Happens Next? Event Prediction Using a Compositional Neural Network Model , 2016, AAAI.

[17] Reid Swanson,et al. Learning a Probabilistic Model of Event Sequences from Internet Weblog Stories , 2008, FLAIRS Conference.

[18] Yoshua Bengio,et al. Neural Probabilistic Language Models , 2006 .

[19] Jeremy Ginsberg,et al. Detecting influenza epidemics using search engine query data , 2009, Nature.

[20] Jakob Grue Simonsen,et al. A Hierarchical Recurrent Encoder-Decoder for Generative Context-Aware Query Suggestion , 2015, CIKM.

[21] Yi Yang,et al. Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Joelle Pineau,et al. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models , 2015, AAAI.

[23] Larry P. Heck,et al. Contextual LSTM (CLSTM) models for Large scale NLP tasks , 2016, ArXiv.

[24] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.