Exploiting social relations for sentiment analysis in microblogging

Microblogging, like Twitter and Sina Weibo, has become a popular platform of human expressions, through which users can easily produce content on breaking news, public events, or products. The massive amount of microblogging data is a useful and timely source that carries mass sentiment and opinions on various topics. Existing sentiment analysis approaches often assume that texts are independent and identically distributed (i.i.d.), usually focusing on building a sophisticated feature space to handle noisy and short texts, without taking advantage of the fact that the microblogs are networked data. Inspired by the social sciences findings that sentiment consistency and emotional contagion are observed in social networks, we investigate whether social relations can help sentiment analysis by proposing a Sociological Approach to handling Noisy and short Texts (SANT) for sentiment classification. In particular, we present a mathematical optimization formulation that incorporates the sentiment consistency and emotional contagion theories into the supervised learning process; and utilize sparse learning to tackle noisy texts in microblogging. An empirical study of two real-world Twitter datasets shows the superior performance of our framework in handling noisy and short tweets.

[1]  R. Abelson Whatever Became of Consistency Theory? , 1983 .

[2]  D Voss,et al.  Frontiers of computer science. , 1991, Science.

[3]  J. Cacioppo,et al.  Emotional Contagion: Current implications and suggestions for future research , 1993 .

[4]  Noah E. Friedkin,et al.  Network Studies of Social Influence , 1993 .

[5]  E. Hatfield,et al.  Emotional Contagion , 1995 .

[6]  G. Bonanno,et al.  A study of laughter and dissociation: distinct correlates of laughter and smiling during bereavement. , 1997, Journal of personality and social psychology.

[7]  M. McPherson,et al.  Birds of a Feather: Homophily in Social Networks , 2001 .

[8]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[9]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[10]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[11]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[12]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[13]  Yurii Nesterov,et al.  Introductory Lectures on Convex Optimization - A Basic Course , 2014, Applied Optimization.

[14]  Claire Cardie,et al.  Annotating Expressions of Opinions and Emotions in Language , 2005, Lang. Resour. Evaluation.

[15]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[16]  Bing Liu,et al.  The utility of linguistic rules in opinion mining , 2007, SIGIR.

[17]  N. Christakis,et al.  The Dynamic Spread of Happiness in a Large Social Network , 2008 .

[18]  N. Christakis,et al.  Dynamic spread of happiness in a large social network: longitudinal analysis over 20 years in the Framingham Heart Study , 2008, BMJ : British Medical Journal.

[19]  Jieping Ye,et al.  An accelerated gradient method for trace norm minimization , 2009, ICML '09.

[20]  David A. Shamma,et al.  Tweet the debates: understanding community annotation of uncollected sources , 2009, WSM@MM.

[21]  Michael E. Jackson,et al.  Detecting Sadness in 140 Characters: Sentiment Analysis and Mourning Michael Jackson on Twitter , 2009 .

[22]  Karen Rose,et al.  What is Twitter , 2009 .

[23]  Nan Sun,et al.  Exploiting internal and external semantics for the clustering of short texts using world knowledge , 2009, CIKM.

[24]  Jieping Ye,et al.  Multi-Task Feature Learning Via Efficient l2, 1-Norm Minimization , 2009, UAI.

[25]  Brendan T. O'Connor,et al.  From Tweets to Polls: Linking Text Sentiment to Public Opinion Time Series , 2010, ICWSM.

[26]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[27]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[28]  Johan Bollen,et al.  Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena , 2009, ICWSM.

[29]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[30]  Jonathan Harris,et al.  We feel fine and searching the emotional web , 2011, WSDM '11.

[31]  Zhoujun Li,et al.  Integrating rich information for video recommendation with multi-task rank aggregation , 2011, ACM Multimedia.

[32]  Jure Leskovec,et al.  Sentiment Flow Through Hyperlink Networks , 2011, ICWSM.

[33]  Jason Baldridge,et al.  Twitter Polarity Classification with Label Propagation over Lexical Links and the Follower Graph , 2011, ULNLP@EMNLP.

[34]  Jiawei Han,et al.  Towards feature selection in network , 2011, CIKM '11.

[35]  Huan Liu,et al.  Enhancing accessibility of microblogging messages using semantic knowledge , 2011, CIKM '11.

[36]  Kevin Lewis,et al.  Social selection and peer influence in an online social network , 2011, Proceedings of the National Academy of Sciences.

[37]  Huan Liu,et al.  Enriching short text representation in microblog for clustering , 2012, Frontiers of Computer Science.

[38]  Fei Wang,et al.  What Were the Tweets About? Topical Associations between Public Events and Twitter Feeds , 2012, ICWSM.

[39]  Huan Liu,et al.  Text Analytics in Social Media , 2012, Mining Text Data.

[40]  Huan Liu,et al.  Feature Selection with Linked Data in Social Media , 2012, SDM.

[41]  Fei Wang,et al.  ET-LDA: Joint Topic Modeling for Aligning Events and their Twitter Feedback , 2012, AAAI.

[42]  Rizal Setya Perdana What is Twitter , 2013 .