Thailand -- Tourism and Conflict: Modeling Sentiment from Twitter Tweets Using Naïve Bayes and Unsupervised Artificial Neural Nets

In this paper we mine over 80 million twitter micro logs in order to explore whether data from this social media initiative can be used to identify sentiment about tourism and Thailand amid the unrest in that country during the early part of 2010 and further whether analysis of tweets can be used to discern the effect of that unrest on Phuket's tourism environment. It is proposed that this analysis can provide measurable insights through summarization, keyword analysis and clustering. We measure sentiment using a binary choice keyword algorithm. A multi-knowledge based approach is proposed using, Self-Organizing Maps along with sentiment polarity in order to model sentiment. We develop a visual model to express a sentiment concept vocabulary and then apply this model to maximums and minimums in the time series sentiment data. The results show actionable knowledge can be extracted in real time.

[1]  Ajit Narayanan,et al.  Unsupervised Artificial Neural Nets for Modelling the Effects of Climate Change on New Zealand Grape Wines , 2009 .

[2]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[3]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[4]  Samuel Kaski,et al.  Mining massive document collections by the WEBSOM method , 2004, Inf. Sci..

[5]  Björn W. Schuller,et al.  “The Godfather” vs. “Chaos”: Comparing Linguistic Analysis Based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009 .

[7]  Namhee Kwon,et al.  Multidimensional text analysis for eRulemaking , 2006, DG.O.

[8]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[9]  Barry Smyth,et al.  Using twitter to recommend real-time topical news , 2009, RecSys '09.

[10]  Bing Liu,et al.  Identifying comparative sentences in text documents , 2006, SIGIR.

[11]  F. Scientific Online Shopping , 1998, Current Biology.

[12]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[13]  Claire Cardie,et al.  Using natural language processing to improve eRulemaking: project highlight , 2006, DG.O.

[14]  G. Shoba,et al.  Classification of Documents Using Kohonen’s Self-Organizing Map , 2009 .