Sentiment Detection of Short Text via Probabilistic Topic Modeling

As an important medium used to describe events, the short text is effective to convey emotions and communicate affective states. In this paper, we proposed a classification method based on probabilistic topic model, which greatly improve the performance of sentimental categorization methods on short text. To solve the problems of sparsity and context-dependency, we extract hidden topics behind the text and associate different words by the same topic. Evaluation on sentiment detection of short text verified the effectiveness of the proposed method.

[1]  Somnath Banerjee,et al.  Clustering short texts using wikipedia , 2007, SIGIR.

[2]  Haoran Xie,et al.  Community-Aware Resource Profiling for Personalized Search in Folksonomy , 2012, Journal of Computer Science and Technology.

[3]  Mingliang Chen,et al.  Building emotional dictionary for sentiment analysis of online news , 2014, World Wide Web.

[4]  Wenyin Liu,et al.  Affective topic model for social emotion detection , 2014, Neural Networks.

[5]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[6]  Haoran Xie,et al.  Mining Latent User Community for Tag-Based and Content-Based Search in Social Media , 2014, Comput. J..

[7]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[8]  Rong Yan,et al.  Mining Social Emotions from Affective Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[9]  Zhenyu Wang,et al.  An Unsupervised Method for Short-Text Sentiment Analysis Based on Analysis of Massive Data , 2015, ICYCSEE.

[10]  Haoran Xie,et al.  Community-aware user profile enrichment in folksonomy , 2014, Neural Networks.

[11]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[12]  Diego Reforgiato Recupero,et al.  Frame-Based Detection of Opinion Holders and Topics: A Model and a Tool , 2014, IEEE Computational Intelligence Magazine.

[13]  Jun S. Liu,et al.  Monte Carlo strategies in scientific computing , 2001 .

[14]  Hae-Chang Rim,et al.  Some Effective Techniques for Naive Bayes Text Classification , 2006, IEEE Transactions on Knowledge and Data Engineering.

[15]  Richard Wicentowski,et al.  SWAT-MP:The SemEval-2007 Systems for Task 5 and Task 14 , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[16]  Gang Liu,et al.  Short text similarity based on probabilistic topics , 2009, Knowledge and Information Systems.

[17]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[18]  Mehran Sahami,et al.  A web-based kernel function for measuring the similarity of short text snippets , 2006, WWW '06.

[19]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[20]  Yanghui Rao,et al.  Sentiment topic models for social emotion mining , 2014, Inf. Sci..

[21]  Yong Yao,et al.  A new text classification method based on HMM-SVM , 2007, 2007 International Symposium on Communications and Information Technologies.

[22]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.