SA2SL: From Aspect-Based Sentiment Analysis to Social Listening System for Business Intelligence

In this paper, we present a process of building a social listening system based on aspect-based sentiment analysis in Vietnamese from creating a dataset to building a real application. Firstly, we create UITViSFD, a Vietnamese Smartphone Feedback Dataset as a new benchmark corpus built based on a strict annotation schemes for evaluating aspect-based sentiment analysis, consisting of 11,122 human-annotated comments for mobile e-commerce, which is freely available for research purposes. We also present a proposed approach based on the Bi-LSTM architecture with the fastText word embeddings for the Vietnamese aspectbased sentiment task. Our experiments show that our approach achieves the best performances with the F1-score of 84.48% for the aspect task and 63.06% for the sentiment task, which performs several conventional machine learning and deep learning systems. Last but not least, we build SA2SL, a social listening system based on the best performance model on our dataset, which will inspire more social listening systems in future.

[1]  Kiet Van Nguyen,et al.  UIT-VSFC: Vietnamese Students’ Feedback Corpus for Sentiment Analysis , 2018, 2018 10th International Conference on Knowledge and Systems Engineering (KSE).

[2]  Haris Papageorgiou,et al.  SemEval-2016 Task 5: Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[3]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[4]  Cícero Nogueira dos Santos,et al.  Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts , 2014, COLING.

[5]  Suresh Manandhar,et al.  SemEval-2014 Task 4: Aspect Based Sentiment Analysis , 2014, *SEMEVAL.

[6]  Meikang Qiu,et al.  Reinforcement Learning-based Content-Centric Services in Mobile Sensing , 2018, IEEE Network.

[7]  Hoai Bac Le,et al.  Aspect-Based Sentiment Analysis of Vietnamese Texts with Deep Learning , 2018, ACIIDS.

[8]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[9]  Dang Van Thin,et al.  A corpus for aspect-based sentiment analysis in Vietnamese , 2019, 2019 11th International Conference on Knowledge and Systems Engineering (KSE).

[10]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[11]  Nidhi Mishra,et al.  Aspect based opinion mining for mobile phones , 2016, 2016 2nd International Conference on Next Generation Computing Technologies (NGCT).

[12]  Suresh Manandhar,et al.  SemEval-2015 Task 12: Aspect Based Sentiment Analysis , 2015, *SEMEVAL.

[13]  John G. Breslin,et al.  A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis , 2016, EMNLP.

[14]  Kiet Van Nguyen,et al.  Hate Speech Detection on Vietnamese Social Media Text using the Bi-GRU-LSTM-CNN Model , 2019, ArXiv.

[15]  Kiet Van Nguyen,et al.  Emotion Recognition for Vietnamese Social Media Text , 2019, PACLING.

[16]  Saumya Chaturvedi,et al.  Sentiment analysis using machine learning for business intelligence , 2017, 2017 IEEE International Conference on Power, Control, Signals and Instrumentation Engineering (ICPCSI).

[17]  Kiet Van Nguyen,et al.  Vietnamese Open-domain Complaint Detection in E-Commerce Websites , 2021, ArXiv.

[18]  Sangeet Srivastava,et al.  Aspect-based Sentiment Analysis on mobile phone reviews with LDA , 2019, ICMLT 2019.

[19]  Thomas J. Watson,et al.  An empirical study of the naive Bayes classifier , 2001 .

[20]  Kiet Van Nguyen,et al.  Hate Speech Detection on Vietnamese Social Media Text using the Bidirectional-LSTM Model , 2019, ArXiv.

[21]  Lei Hu,et al.  A study on the relationship between the rank of input data and the performance of random weight neural network , 2020, Neural Computing and Applications.

[22]  Anupam Basu,et al.  An Agreement Measure for Determining Inter-Annotator Reliability of Human Judgements on Affective Text , 2008, Proceedings of the Workshop on Human Judgements in Computational Linguistics - HumanJudge '08.

[23]  VLSP SHARED TASK: SENTIMENT ANALYSIS , 2019, Journal of Computer Science and Cybernetics.

[24]  Kiet Van Nguyen,et al.  Constructive and Toxic Speech Detection for Open-domain Social Media Comments in Vietnamese , 2021, IEA/AIE.

[25]  Mahmoud Al-Ayyoub,et al.  Deep Recurrent neural network vs. support vector machine for aspect-based sentiment analysis of Arabic hotels' reviews , 2017, J. Comput. Sci..

[26]  Anna Rumshisky,et al.  A Primer in BERTology: What We Know About How BERT Works , 2020, Transactions of the Association for Computational Linguistics.

[27]  Peng Zhou,et al.  Text Classification Improved by Integrating Bidirectional LSTM with Two-dimensional Max Pooling , 2016, COLING.

[28]  Kiet Van Nguyen,et al.  A Simple and Efficient Ensemble Classifier Combining Multiple Neural Network Models on Social Media Datasets in Vietnamese , 2020, ArXiv.

[29]  Thomas Wolf,et al.  Transfer Learning in Natural Language Processing , 2019, NAACL.