Tweet Stance Detection Using an Attention based Neural Ensemble Model

Stance detection in twitter aims at mining user stances expressed in a tweet towards a single or multiple target entities. To tackle this problem, most of the prior studies have been explored the traditional deep learning models, e.g., LSTM and GRU. However, in compared to these traditional approaches, recently proposed densely connected Bi-LSTM and nested LSTMs architectures effectively address the vanishing-gradient and overfitting problems as well as dealing with long-term dependencies. In this paper, we propose a neural ensemble model that adopts the strengths of these two LSTM variants to learn better long-term dependencies, where each module coupled with an attention mechanism that amplifies the contribution of important elements in the final representation. We also employ a multi-kernel convolution on top of them to extract the higher-level tweet representations. Results of extensive experiments on single and multi-target stance detection datasets show that our proposed method achieves substantial improvement over the current state-of-the-art deep learning based methods.

[1]  Xiao Zhang,et al.  pkudblab at SemEval-2016 Task 6 : A Specific Convolutional Neural Network System for Effective Stance Detection , 2016, *SEMEVAL.

[2]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[3]  Joel Ruben Antony Moniz,et al.  Nested LSTMs , 2018, ACML.

[4]  Junjie Lin,et al.  Multi-Target Stance Detection via a Dynamic Memory-Augmented Network , 2018, SIGIR.

[5]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.

[6]  Zhiyuan Liu,et al.  A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[7]  Saif Mohammad,et al.  Stance and Sentiment in Tweets , 2016, ACM Trans. Internet Techn..

[8]  Lei Shi,et al.  Connecting Targets to Tweets: Semantic Attention-Based Model for Target-Specific Stance Detection , 2017, WISE.

[9]  Daniel P. W. Ellis,et al.  Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems , 2015, ArXiv.

[10]  Saroj Kaushik,et al.  Topical Stance Detection for Twitter: A Two-Phase LSTM Model Using Attention , 2018, ECIR.

[11]  Chuhan Wu,et al.  THU_NGN at IJCNLP-2017 Task 2: Dimensional Sentiment Analysis for Chinese Phrases with Deep LSTM , 2017, IJCNLP.

[12]  Diana Inkpen,et al.  A Dataset for Multi-Target Stance Detection , 2017, EACL.

[13]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[14]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[15]  Wenji Mao,et al.  A Target-Guided Neural Memory Model for Stance Detection in Twitter , 2018, 2018 International Joint Conference on Neural Networks (IJCNN).

[16]  Yoon Kim,et al.  Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[17]  Saif Mohammad,et al.  SemEval-2016 Task 6: Detecting Stance in Tweets , 2016, *SEMEVAL.

[18]  Yulan He,et al.  Stance Classification with Target-Specific Neural Attention Networks , 2017 .

[19]  Chen Huang,et al.  Multimodal Gesture Recognition Using Densely Connected Convolution and BLSTM , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[20]  Xiang Li,et al.  Densely Connected Bidirectional LSTM with Applications to Sentence Classification , 2018, NLPCC.

[21]  Guido Zarrella,et al.  MITRE at SemEval-2016 Task 6: Transfer Learning for Stance Detection , 2016, *SEMEVAL.