A Simple and Effective biLSTM Approach to Aspect-Based Sentiment Analysis in Social Media Customer Feedback

This paper describes a system for aspectbased sentiment analysis (ABSA) using a straight-forward supervised sequence labeling approach. Specifically, we apply a bidirectional, recurrent long short-term memory (biLSTM) architecture with a multilayer perceptron on top that predicts the labels token by token. We deal with the issue of rare words by dynamically switching between character-level and token-level representations depending on an occurrence threshold. A simple encoding of the aspects and their sentiments, a careful preprocessing of the data, and a generous ensemble of 24 single models beats the published state-of-the-art results for the GermEval 2017 ABSA data set for aspect-based sentiment analysis on the document level (joint prediction of aspect and sentiment in task C). For task D, the opinion target expression (OPE) detection task, our approach improves the current state-of-the-art even by 2.7-14.3 percentage points.

[1]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[2]  Barbara Plank,et al.  Multilingual Part-of-Speech Tagging with Bidirectional Long Short-Term Memory Models and Auxiliary Loss , 2016, ACL.

[3]  Nikos Pelekis,et al.  DataStories at SemEval-2017 Task 4: Deep LSTM with Attention for Message-level and Topic-based Sentiment Analysis , 2017, *SEMEVAL.

[4]  Jürgen Schmidhuber,et al.  LSTM: A Search Space Odyssey , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[5]  Chris Biemann,et al.  LT-ABSA : An Extensible Open-Source System for Document-Level and Aspect-Based Sentiment Analysis , 2017 .

[6]  Philipp Koehn,et al.  Synthesis Lectures on Human Language Technologies , 2016 .

[7]  Iryna Gurevych,et al.  UKP TU-DA at GermEval 2017: Deep Learning for Aspect Based Sentiment Detection , 2017 .

[8]  Wang Ling,et al.  Finding Function in Form: Compositional Character Models for Open Vocabulary Word Representation , 2015, EMNLP.

[9]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[10]  Chris Biemann,et al.  Shared Task on Aspect-based Sentiment in Social Media Customer Feedback , 2017 .

[11]  Erik F. Tjong Kim Sang,et al.  Representing Text Chunks , 1999, EACL.

[12]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[13]  J. Schmidhuber,et al.  Framewise phoneme classification with bidirectional LSTM networks , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[14]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[15]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.