An Approach of Semi-automatic Public Sentiment Analysis for Opinion and District

The contents generated by netizens on the Web can reflect public sentiments to a great extent, so analyzing these contents is very useful for government agencies in guiding their public information, propaganda programs, and decision support. Because of the civilization diversity and economy difference, the netizens inhabiting or employing in different districts may have the different sentiments for the same topic or event. Analyzing the sentiment difference of different districts will help government agencies make more pertinent decision. However, current researches in this domain have less considered the opinion distribution on different districts. In this paper, we propose an approach of semi-automatic public sentiment analysis for opinion and district, which includes automatic data acquiring, sentiment modeling, opinion clustering, and district clustering, and manual threshold setting and result analysis. In detail, on the one hand, we group public sentiment into some opinion clusters by means of clustering technique. On the other hand, based on the opinion clusters, we further partition every opinion cluster on district into district opinion and analyze the result. Experiment results in Tencent comments show the feasibility and validity of our approach.

[1]  Qiong Wu,et al.  A two-stage framework for cross-domain sentiment classification , 2011, Expert Syst. Appl..

[2]  Jin Zhang,et al.  An empirical study of sentiment analysis for chinese documents , 2008, Expert Syst. Appl..

[3]  Susumu Horiguchi,et al.  Learning to classify short and sparse text & web with hidden topics from large-scale data collections , 2008, WWW.

[4]  Qiang Ye,et al.  Sentiment classification of online reviews to travel destinations by supervised machine learning approaches , 2009, Expert Syst. Appl..

[5]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[6]  Eyas El-Qawasmeh,et al.  Software Engineering and Computer Systems - Second International Conference, ICSECS 2011, Kuantan, Pahang, Malaysia, June 27-29, 2011, Proceedings, Part I , 2011, ICSECS.

[7]  Prem Melville,et al.  Sentiment analysis of blogs by combining lexical knowledge with text classification , 2009, KDD.

[8]  Khairullah Khan,et al.  Sentiment Classification from Online Customer Reviews Using Lexical Contextual Sentence Structure , 2011, ICSECS.

[9]  Yan Fu,et al.  Finer Granularity Clustering for Opinion Mining , 2009, 2009 Second International Symposium on Computational Intelligence and Design.

[10]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[11]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[12]  Wei Gao,et al.  Extracting common emotions from blogs based on fine-grained sentiment clustering , 2011, Knowledge and Information Systems.

[13]  Yan Jia,et al.  Toward Public Opinions Detection: Measuring the Similarity between Instant Messages , 2008, ODBIS.

[14]  Hsin-Hsi Chen,et al.  Mining opinions from the Web: Beyond relevance retrieval , 2007 .

[15]  Songbo Tan,et al.  A survey on sentiment detection of reviews , 2009, Expert Syst. Appl..