Sentiment Analysis of Microblog text based on joint sentiment-topic model

Sentiment Analysis of Microblog text is a challenging work. Microblog text is different with general user-generated text, because it often contains some special symbols such as “@ # //” and a lot of emotional symbols. Previous joint sentiment-topic models did not consider these features of Microblog texts and then could not well model them. In this paper, considering the structural features and content features of Microblog text, we present a semi-supervised joint sentiment-topic model (MB-PL-ASUM). This new model uses semi-structured information and emotional symbol information to classify sentiments of Microblog text without labeling them. The experiments of sentiment classification on real Sina Microblog texts show that MB-PL-ASUM outperforms word matching, JTS and ASUM model.

[1]  Hila Becker,et al.  Beyond Trending Topics: Real-World Event Identification on Twitter , 2011, ICWSM.

[2]  Bernard J. Jansen,et al.  Twitter power: Tweets as electronic word of mouth , 2009 .

[3]  Nicholas Diakopoulos,et al.  Cooooooooooooooollllllllllllll!!!!!!!!!!!!!! Using Word Lengthening to Detect Sentiment in Microblogs , 2011, EMNLP.

[4]  Xiaoyan Zhu,et al.  Sentiment Analysis with Global Topics and Local Dependency , 2010, AAAI.

[5]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[6]  Yulan He,et al.  Latent Sentiment Model for Weakly-Supervised Cross-Lingual Sentiment Classification , 2011, ECIR.

[7]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[8]  Minyi Guo,et al.  Emoticon Smoothed Language Models for Twitter Sentiment Analysis , 2012, AAAI.

[9]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[10]  Jianling Sun,et al.  Large scale microblog mining using distributed MB-LDA , 2012, WWW.

[11]  Daniele Quercia,et al.  The Social World of Twitter: Topics, Geography, and Emotions , 2012, ICWSM.

[12]  Danah Boyd,et al.  Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[13]  Yang Song,et al.  Identifying Event-related Bursts via Social Media Activities , 2012, EMNLP.

[14]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[15]  Mark Steedman,et al.  Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning , 2012 .

[16]  Houfeng Wang,et al.  Entity-centric topic-oriented opinion summarization in twitter , 2012, KDD.

[17]  Eugene Agichtein,et al.  TM-LDA: efficient online modeling of latent topic transitions in social media , 2012, KDD.

[18]  Jacob Eisenstein,et al.  What to do about bad language on the internet , 2013, NAACL.

[19]  Ke Xu,et al.  MoodLens: an emoticon-based sentiment analysis system for chinese tweets , 2012, KDD.