Feature Based Sentiment Analysis of Tweets in Multiple Languages

Feature based sentiment analysis is normally conducted using review Web sites, since it is difficult to extract accurate product features from tweets. However, Twitter users express sentiment towards a large variety of products in many different languages. Besides, sentiment expressed on Twitter is more up to date and represents the sentiment of a larger population than review articles. Therefore, we propose a method that identifies product features using review articles and then conduct sentiment analysis on tweets containing those features. In that way, we can increase the precision of feature extraction by up to 40% compared to features extracted directly from tweets. Moreover, our method translates and matches the features extracted for multiple languages and ranks them based on how frequently the features are mentioned in the tweets of each language. By doing this, we can highlight the features that are the most relevant for multilingual analysis.

[1]  Steffen Staab,et al.  Feature Sentiment Diversification of User Generated Reviews: The FREuD Approach , 2013, ICWSM.

[2]  Japinder Singh,et al.  Feature-based opinion mining and ranking , 2012, J. Comput. Syst. Sci..

[3]  Tie-Yan Liu,et al.  Information Retrieval Technology , 2014, Lecture Notes in Computer Science.

[4]  Takehito Utsuro,et al.  Mining Cross-Lingual/Cross-Cultural Differences in Concerns and Opinions in Blogs , 2009, ICCPOL.

[5]  Saif Mohammad,et al.  NRC-Canada: Building the State-of-the-Art in Sentiment Analysis of Tweets , 2013, *SEMEVAL.

[6]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[7]  Xiaojun Wan,et al.  Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[8]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[9]  Tobias Günther,et al.  GU-MLT-LT: Sentiment Analysis of Short Messages using Linguistic Features and Stochastic Gradient Descent , 2013, *SEMEVAL.

[10]  Zhong Su,et al.  OpinionIt: a text mining system for cross-lingual opinion analysis , 2010, CIKM.

[11]  Lei Zhang,et al.  A Survey of Opinion Mining and Sentiment Analysis , 2012, Mining Text Data.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[13]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009 .

[14]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[15]  Yiqun Liu,et al.  Emotion Tokens: Bridging the Gap among Multilingual Twitter Sentiment Analysis , 2011, AIRS.