Exploiting BERT with Global-Local Context and Label Dependency for Aspect Term Extraction

Aspect term extraction (ATE) is a subtask of aspect-based sentiment analysis (ABSA), which aims to extract all aspect-specific words in a sentence. Recent neural network methods ignore the problem that word may play different semantic roles in different sentences and have limitation in handling dependencies between labels. In this work, we first exploit BERT as embedding layer to obtain word-level representations and utilize BERT architecture to capture global sequence features. Then, a position-aware attention is proposed to extract local context information. Global-local context representations of words are built by merging the global sequence features and local context information, which can select related information from both sides: global sequence and local context. Finally, to model the label dependency, we construct a label dependency module based on RNN and CRF, where the previous label features are introduced as additional information for label relationship modeling. Experimental results on four benchmark datasets show that our proposed model obtains the state-of-the-art performance.

[1]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[2]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3]  Maryna Chernyshevich,et al.  IHS R&D Belarus: Cross-domain extraction of product features using CRF , 2014, *SEMEVAL.

[4]  Luke S. Zettlemoyer,et al.  Deep Contextualized Word Representations , 2018, NAACL.

[5]  Shengyi Jiang,et al.  Recurrent Neural CRF for Aspect Term Extraction with Dependency Transmission , 2018, NLPCC.

[6]  Alec Radford,et al.  Improving Language Understanding by Generative Pre-Training , 2018 .

[7]  Ilya Sutskever,et al.  Language Models are Unsupervised Multitask Learners , 2019 .

[8]  Xing Xie,et al.  Exploring Sequence-to-Sequence Learning in Aspect Term Extraction , 2019, ACL.

[9]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[10]  Qing Wang,et al.  MTNA: A Neural Multi-task Model for Aspect Category Classification and Aspect Term Extraction On Restaurant Reviews , 2017, IJCNLP.

[11]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[12]  Jian Su,et al.  NLANGP at SemEval-2016 Task 5: Improving Aspect Based Sentiment Analysis using Neural Network Features , 2016, *SEMEVAL.

[13]  Jianfei Yu,et al.  Recurrent Neural Networks with Auxiliary Labels for Cross-Domain Opinion Target Extraction , 2017, AAAI.

[14]  P. Deepa Shenoy,et al.  Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier , 2016, World Wide Web.

[15]  Xin Li,et al.  Deep Multi-Task Learning for Aspect Term Extraction with Memory Interaction , 2017, EMNLP.

[16]  Claudiu Musat,et al.  Unsupervised Aspect Term Extraction with B-LSTM & CRF using Automatically Labelled Datasets , 2017, WASSA@EMNLP.

[17]  Xiaojun Wan,et al.  Representation Learning for Aspect Category Detection in Online Reviews , 2015, AAAI.

[18]  Guillaume Lample,et al.  Cross-lingual Language Model Pretraining , 2019, NeurIPS.

[19]  Erik Cambria,et al.  Aspect extraction for opinion mining with a deep convolutional neural network , 2016, Knowl. Based Syst..

[20]  Hwee Tou Ng,et al.  An Unsupervised Neural Attention Model for Aspect Extraction , 2017, ACL.

[21]  Claire Cardie,et al.  Investigating LSTMs for Joint Extraction of Opinion Entities and Relations , 2016, ACL.

[22]  Xin Li,et al.  Aspect Term Extraction with History Attention and Selective Transformation , 2018, IJCAI.

[23]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[24]  Philip S. Yu,et al.  Double Embeddings and CNN-based Sequence Labeling for Aspect Extraction , 2018, ACL.

[25]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[26]  Xiaokui Xiao,et al.  Recursive Neural Conditional Random Fields for Aspect-based Sentiment Analysis , 2016, EMNLP.

[27]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[28]  Andrzej Zolnierek,et al.  The Empirical Study of the Naive Bayes Classifier in the Case of Markov Chain Recognition Task , 2005, CORES.

[29]  Ming Zhou,et al.  Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction , 2016, IJCAI.

[30]  Bin Wang,et al.  Improving Aspect Term Extraction With Bidirectional Dependency Tree Representation , 2018, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[31]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[32]  Zhiqiang Toh,et al.  DLIREC: Aspect Term Extraction and Term Polarity Classification System , 2014, *SEMEVAL.

[33]  Haris Papageorgiou,et al.  SemEval-2016 Task 5: Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[34]  Bing Liu,et al.  DOER: Dual Cross-Shared RNN for Aspect Term-Polarity Co-Extraction , 2019, ACL.

[35]  Xiaokui Xiao,et al.  Coupled Multi-Layer Attentions for Co-Extraction of Aspect and Opinion Terms , 2017, AAAI.

[36]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.