On Vietnamese Sentiment Analysis: A Transfer Learning Method

Sentiment analysis (also known as opinion mining or emotion AI) refers to the use of natural language processing (NLP) and text analysis to systematically identify, extract and quantify states and subjective information. Transfer learning (TL) [1] is a study in machine learning focusing on storing knowledge gained while solving one problem and applying it to a different but related problem. In NLP, recent results also demonstrated the effectiveness of models using pre-training on a language modeling task [2], [3]. The transfer learning based models help to rapidly increase understanding of words and sentences arrangement in which semantics and connections are easily grasped. In this paper, we present the results from applying BERT [16], a transfer learning method, in Vietnamese benchmark [17] for one of text classification problems, the Aspect Based Sentiment Analysis problem. The experiments were conducted on two data sets, named Hotel and Restaurant [17], in two task (A) Aspect Detection and (B) Aspect Polarity. The obtained results have outperformed some previous systems [18]–[20] in precision, recall, as well as the F1 measures.

[1]  Roland Vollgraf,et al.  Contextual String Embeddings for Sequence Labeling , 2018, COLING.

[2]  Xiaoning Qian,et al.  Bayesian multi-domain learning for cancer subtype discovery from next-generation sequencing count data , 2018, NeurIPS.

[3]  Hua Xu,et al.  Constrained LDA for Grouping Product Features in Opinion Mining , 2011, PAKDD.

[4]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[5]  Flora D. Salim,et al.  DA-HOC: semi-supervised domain adaptation for room occupancy prediction using CO2 sensor data , 2017, BuildSys@SenSys.

[6]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[7]  Sebastian Ruder,et al.  Universal Language Model Fine-tuning for Text Classification , 2018, ACL.

[8]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[9]  Flora D. Salim,et al.  A Scalable Room Occupancy Prediction with Transferable Time Series Decomposition of CO2 Sensor Data , 2018, ACM Trans. Sens. Networks.

[10]  Grigorios Tsoumakas,et al.  Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[11]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[12]  Rajat Raina,et al.  Constructing informative priors using transfer learning , 2006, ICML.

[13]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[14]  Kiet Van Nguyen,et al.  Deep Learning for Aspect Detection on Vietnamese Reviews , 2018, 2018 5th NAFOSTED Conference on Information and Computer Science (NICS).

[15]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[16]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[17]  Alec Radford,et al.  Improving Language Understanding by Generative Pre-Training , 2018 .

[18]  Bikramjit Banerjee,et al.  General Game Learning Using Knowledge Transfer , 2007, IJCAI.

[19]  Lorien Y. Pratt,et al.  Discriminability-Based Transfer between Neural Networks , 1992, NIPS.

[20]  Kaiming He,et al.  Exploring the Limits of Weakly Supervised Pretraining , 2018, ECCV.

[21]  Ilaria Tiddi,et al.  Good location, terrible food: detecting feature sentiment in user-generated reviews , 2013, Social Network Analysis and Mining.

[22]  Ujjwal Bhattacharya,et al.  CNN based common approach to handwritten character recognition of multiple scripts , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[23]  Erik Cambria,et al.  Aspect extraction for opinion mining with a deep convolutional neural network , 2016, Knowl. Based Syst..

[24]  Philip S. Yu,et al.  BERT Post-Training for Review Reading Comprehension and Aspect-based Sentiment Analysis , 2019, NAACL.

[25]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[26]  Andrew Y. Ng,et al.  Transfer learning for text classification , 2005, NIPS.