Story Understanding with External Knowledge Based Attention

How we use commonsense knowledge to guide the learning of the model is critical for natural language understanding. In this paper, we introduce a keywords extraction weight based word-level attention and a sentiment analysis based sentence-level attention network so as to achieve a better understanding of the story. We train on the publicly available ROC story cloze task data. We mainly focus on two aspects of the narrative: topic correlation and sentiment polar. Since the train data set contains a lot of distractor, we train our model on the validation set and achieve the state of art accuracy 80.5%. We mainly provide a way to use external knowledge to guide training process.

[1]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[2]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[3]  Lifu Tu,et al.  Pay Attention to the Ending:Strong Neural Baselines for the ROC Story Cloze Task , 2017, ACL.

[4]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[5]  Eric Gilbert,et al.  VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text , 2014, ICWSM.

[6]  Yejin Choi,et al.  Story Cloze Task: UW NLP System , 2017, LSDSem@EACL.

[7]  Dan Roth,et al.  Two Discourse Driven Language Models for Semantics , 2016, ACL.

[8]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[9]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[10]  Nathanael Chambers,et al.  A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories , 2016, NAACL.

[11]  Dan Roth,et al.  Story Comprehension for Predicting What Happens Next , 2017, EMNLP.

[12]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[13]  Snigdha Chaturvedi,et al.  Unsupervised Learning of Evolving Relationships Between Literary Characters , 2017, AAAI.

[14]  Hongyu Lin,et al.  Reasoning with Heterogeneous Knowledge for Commonsense Machine Comprehension , 2017, EMNLP.

[15]  Brendan T. O'Connor,et al.  Learning Latent Personas of Film Characters , 2013, ACL.