Bias‐Sentiment‐Topic model for microblog sentiment analysis

Unified models of sentiment and topic have been widely employed in unsupervised sentiment analysis, where each word in text carries both sentiment and topic information. In fact, however, some words tend to express objective things while others prefer to express subjective sentiments. Based on this observation, the concept of word bias is put forward firstly, including objective bias and subjective bias. Considering the relations of bias, sentiment, and topic, a unified framework named Bias‐Sentiment‐Topic (BST) model is then presented to jointly model them for microblog sentiment analysis. After that, an improved Gibbs sampler is proposed for the inference of BST by introducing the general Pólya urn model, which incorporates word embedding as the general knowledge. Finally, experiments on standard test datasets illustrate major improvements of BST in sentiment classification and its effectiveness in separation of words with different biases.

[1]  Bing Liu,et al.  Mining Aspect-Specific Opinion using a Holistic Lifelong Topic Model , 2016, WWW.

[2]  Lun-Wei Ku,et al.  Using Polarity Scores of Words for Sentence-level Opinion Extraction , 2007 .

[3]  Yiqun Liu,et al.  Lexicon-Based Sentiment Analysis on Topical Chinese Microblog Messages , 2012, CSWS.

[4]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.

[5]  Stefan M. Rüger,et al.  Weakly Supervised Joint Sentiment-Topic Detection from Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[6]  Mike Thelwall,et al.  The Heart and Soul of the Web? Sentiment Strength Detection in the Social Web with SentiStrength , 2017 .

[7]  Maite Taboada,et al.  Lexicon-Based Methods for Sentiment Analysis , 2011, CL.

[8]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[9]  Bing Liu,et al.  Mining topics in documents: standing on the shoulders of big data , 2014, KDD.

[10]  Yulan He,et al.  A Comparative Study of Bayesian Models for Unsupervised Sentiment Detection , 2010, CoNLL.

[11]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[12]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[13]  J. Pennebaker,et al.  Psychological aspects of natural language. use: our words, our selves. , 2003, Annual review of psychology.

[14]  Alan F. Smeaton,et al.  Classifying sentiment in microblogs: is brevity an advantage? , 2010, CIKM.

[15]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Huan Liu,et al.  Exploiting social relations for sentiment analysis in microblogging , 2013, WSDM.

[17]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[18]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[19]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[20]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[21]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[22]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[23]  Hosam M. Mahmoud,et al.  Polya Urn Models , 2008 .

[24]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[25]  Yiqun Liu,et al.  Microblog Sentiment Analysis with Emoticon Space Model , 2014, Journal of Computer Science and Technology.

[26]  Andrew McCallum,et al.  Rethinking LDA: Why Priors Matter , 2009, NIPS.

[27]  Ke Xu,et al.  MoodLens: an emoticon-based sentiment analysis system for chinese tweets , 2012, KDD.

[28]  Ming Zhou,et al.  Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification , 2014, ACL.

[29]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[30]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[31]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[32]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[33]  Dong-Hong Ji,et al.  A topic-enhanced word embedding for Twitter sentiment classification , 2016, Inf. Sci..