Affect Computation of Chinese Short Text

Microblogs are a rising social network with distinguishing features such as simplicity and convenience and has already attracted a large number of users and triggered massive information explosion concerning individuals’ own statuses and opinions. While sentiment analysis of the messages in microblogs is of great value, most of present studies are on English microblogs and few are on Chinese microblogs. Compared to English, Chinese has its unique expression style, such as no spaces or other word delimiters. Furthermore, Chinese short text also has its own properties. Thus we are inspired to explore effective features for sentiment classification of Chinese short text. In this paper, we propose to study userrelated sentiment classification of Chinese microblogs in terms of the statistical and semantic characteristics, and deisgn the corresponding features: ratio of positive words and negative words (PNR), position feature (POS ), collocation of verbs (COL), auxiliary words (AU). Then we employ an SVM-based method to classify the sentiment. Experiments show that the features we design is effective in recognizing the sentiment of messages in microblogs. key words: microblog, sentiment classification, statistical feature, semantic feature

[1]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[2]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[3]  François Yvon,et al.  The Contribution of Low Frequencies to Multilingual Sub-sentential Alignment: a Differential Associative Approach , 2011 .

[4]  Ari Rappoport,et al.  Enhanced Sentiment Learning Using Twitter Hashtags and Smileys , 2010, COLING.

[5]  H. Varian,et al.  Predicting the Present with Google Trends , 2009 .

[6]  Ramanathan V. Guha,et al.  The predictive power of online chatter , 2005, KDD '05.

[7]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[8]  Xiaohui Yu,et al.  ARSA: a sentiment-aware model for predicting sales performance using blogs , 2007, SIGIR.

[9]  G. Fricchione Descartes’ Error: Emotion, Reason and the Human Brain , 1995 .

[10]  Tiejun Zhao,et al.  Target-dependent Twitter Sentiment Classification , 2011, ACL.

[11]  Bernardo A. Huberman,et al.  Predicting the Future with Social Media , 2010, Web Intelligence.

[12]  Qun Liu,et al.  HHMM-based Chinese Lexical Analyzer ICTCLAS , 2003, SIGHAN.

[13]  Gilad Mishne,et al.  Predicting Movie Sales from Blogger Sentiment , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[14]  Andrew Ortony,et al.  The Cognitive Structure of Emotions , 1988 .

[15]  Siddharth Patwardhan,et al.  Feature Subsumption for Opinion Analysis , 2006, EMNLP.

[16]  A. Damasio Descartes’ Error. Emotion, Reason and the Human Brain. New York (Grosset/Putnam) 1994. , 1994 .

[17]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[18]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[19]  Mike Thelwall,et al.  A Study of Information Retrieval Weighting Schemes for Sentiment Analysis , 2010, ACL.

[20]  Ramesh C. Jain,et al.  Micro-blogging Sentiment Detection by Collaborative Online Learning , 2010, 2010 IEEE International Conference on Data Mining.

[21]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[22]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.