Domain Specific Sentiment Dictionary for Opinion Mining of Vietnamese Text

Knowing public opinions from subjective text messages vastly available on the Web is very useful for many different purposes. Technically, extracting efficiently and accurately the opinions from a huge amount of unstructured text messages is challenging. For English language, a common approach to this problem is using sentiment dictionaries. However, building a sentiment dictionary for less popular languages, such as Vietnamese, is difficult and time consuming. This paper proposes an approach to mining public opinions from Vietnamese text using a domain specific sentiment dictionary in order to improve the accuracy. The sentiment dictionary is built incrementally using statistical methods for a specific domain. The efficiency of the approach is demonstrated through an application which is built to extract public opinions on online products and services. Even though this approach is designed initially for Vietnamese text, we believe that it is also applicable to other languages.

[1]  Guoshi Wu,et al.  Aspect Opinion Mining on Customer Reviews , 2011 .

[2]  Mingliang Chen,et al.  Building emotional dictionary for sentiment analysis of online news , 2014, World Wide Web.

[3]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[4]  Akshi Kumar,et al.  Sentiment Analysis: A Perspective on its Past, Present and Future , 2012 .

[5]  Chao-Fu Hong,et al.  Semantic Methods for Knowledge Management and Communication , 2011 .

[6]  Sasha Blair-Goldensohn,et al.  Building a Sentiment Summarizer for Local Service Reviews , 2008 .

[7]  Liangzhong Jiang Proceedings of the 2011 International Conference on Informatics, Cybernetics, and Computer Engineering (ICCE2011) November 19-20, 2011, Melbourne , 2011 .

[8]  Alaa Hamouda,et al.  Building Machine Learning Based Senti-word Lexicon for Sentiment Analysis , 2011 .

[9]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[10]  Chi-Ching Lee,et al.  Unsupervised Opinion Phrase Extraction and Rating in Chinese Blog Posts , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[11]  Quang-Thuy Ha,et al.  A Feature-Based Opinion Mining Model on Product Reviews in Vietnamese , 2011 .

[12]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[13]  Jeff Heflin,et al.  The Semantic Web – ISWC 2012 , 2012, Lecture Notes in Computer Science.

[14]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.