Extract, Attend, Predict: Aspect-Based Sentiment Analysis with Deep Self-Attention Network

Aspect-based sentiment analysis aims to predict sentiment polarities for given aspect terms in a sentence. Previous work typically encodes the aspect and the sentence separately, with either RNNs or CNNs along with sophisticated attention mechanisms. However, CNNs and RNNs suffer from problems such as restricted local receptive field and long-term dependency, respectively. Besides, separately encoding aspects and sentences also results in problems such as the aspect has no context information and neighboring aspects are not considered. To address these problems, we propose a novel approach that conducts an extract-attend-predict process with deep self-attention for aspect-based sentiment analysis. Unlike previous methods that use either RNNs or CNNs as the basic encoder, we utilizes a pre-trained deep self-attention encoder to avoid the difficulty in capturing long-distance words. Moreover, instead of performing separately encoding, our model directly extracts the aspect representation from contextualized sentence representations based on the span boundary of target aspect. A multi-granularity attending mechanism is further applied to capture the interaction between aspects and sentences, which is later used to predict the sentiment polarity. We conduct experiments on two benchmark datasets and the results show that our approach outperforms previous state-of-the-art models.

[1]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[2]  S. C. Kremer,et al.  Gradient Flow in Recurrent Nets: the Difficulty of Learning Long-Term Dependencies , 2001 .

[3]  Houfeng Wang,et al.  Interactive Attention Networks for Aspect-Level Sentiment Classification , 2017, IJCAI.

[4]  Jason Weston,et al.  End-To-End Memory Networks , 2015, NIPS.

[5]  Asif Ekbal,et al.  IARM: Inter-Aspect Relation Modeling with Memory Networks in Aspect-Based Sentiment Analysis , 2018, EMNLP.

[6]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[7]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[8]  Ting Liu,et al.  Aspect Level Sentiment Classification with Deep Memory Network , 2016, EMNLP.

[9]  Tao Li,et al.  Aspect Based Sentiment Analysis with Gated Convolutional Networks , 2018, ACL.

[10]  Chao Yang,et al.  Aspect-based sentiment analysis with alternating coattention networks , 2019, Inf. Process. Manag..

[11]  Lidong Bing,et al.  Recurrent Attention Network on Memory for Aspect Sentiment Analysis , 2017, EMNLP.

[12]  Yue Zhang,et al.  Attention Modeling for Targeted Sentiment , 2017, EACL.

[13]  Kathleen M. Carley,et al.  Parameterized Convolutional Neural Networks for Aspect Level Sentiment Classification , 2019, EMNLP.

[14]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[15]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[16]  Siu Cheung Hui,et al.  Dyadic Memory Networks for Aspect-based Sentiment Analysis , 2017, CIKM.

[17]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[18]  Dongyan Zhao,et al.  Multi-grained Attention Network for Aspect-Level Sentiment Classification , 2018, EMNLP.

[19]  Xiaocheng Feng,et al.  Effective LSTMs for Target-Dependent Sentiment Classification , 2015, COLING.

[20]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[21]  Li Zhao,et al.  Attention-based LSTM for Aspect-level Sentiment Classification , 2016, EMNLP.

[22]  Siu Cheung Hui,et al.  Learning to Attend via Word-Aspect Associative Fusion for Aspect-based Sentiment Analysis , 2017, AAAI.

[23]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[24]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[25]  Joachim Wagner,et al.  DCU: Aspect-based Polarity Classification for SemEval Task 4 , 2014, *SEMEVAL.

[26]  Kathleen M. Carley,et al.  Aspect Level Sentiment Classification with Attention-over-Attention Neural Networks , 2018, SBP-BRiMS.

[27]  Quoc V. Le,et al.  Semi-supervised Sequence Learning , 2015, NIPS.

[28]  Yue Zhang,et al.  Target-Dependent Twitter Sentiment Classification with Rich Automatic Features , 2015, IJCAI.

[29]  Saif Mohammad,et al.  NRC-Canada-2014: Detecting Aspects and Sentiment in Customer Reviews , 2014, *SEMEVAL.