论文信息 - Stance Detection for the Fake News Challenge with Attention and Conditional Encoding

Stance Detection for the Fake News Challenge with Attention and Conditional Encoding

The in-progress Fake News Challenge is a public challenge tasking competitors to develop a stance detection tool that could ultimately be incorporated into a larger automatic fact-checking pipeline. 49,972 body-headline pairs are labeled with either ”Unrelated”, ”Discusses”, ”Agrees”, or ”Disagrees”, and it is the goal of the stance detection task to predict these labels. We applied the concepts of neural attention and conditional encoding to long short-term memory networks (LSTM) ultimately achieving a preliminary competition score of 0.808, improving over the competition baseline of 0.795 that relies on several hand-crafted linguistic features. Four models were evaluated: Bag of Words (BOW), basic LSTM, LSTM with attention, conditional encoding LSTM with attention (CEA LSTM). The attention models outperformed the simpler models on all performance metrics on the test set. In particular, the models with neural attention were able to achieve significantly higher F1 scores predicting the infrequent stances ”Agrees” and ”Disagrees”.

Stephen Pfohl

[1] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[2] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[3] Ewan Klein,et al. Natural Language Processing with Python , 2009 .

[4] Andreas Vlachos,et al. Emergent: a novel data-set for stance classification , 2016, NAACL.

[5] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[6] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[7] Kalina Bontcheva,et al. Stance Detection with Bidirectional Conditional Encoding , 2016, EMNLP.

[8] Phil Blunsom,et al. Reasoning about Entailment with Neural Attention , 2015, ICLR.

[9] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.