Sentiment lexicon adaptation with context and semantics for the social web

Sentiment analysis over social streams offers governments and organisations a fast and effective way to monitor the publics' feelings towards policies, brands, business, etc. General purpose sentiment lexicons have been used to compute sentiment from social streams, since they are simple and effective. They calculate the overall sentiment of texts by using a general collection of words, with predetermined sentiment orientation and strength. However, words' sentiment often vary with the contexts in which they appear, and new words might be encountered that are not covered by the lexicon, particularly in social media environments where content emerges and changes rapidly and constantly. In this paper, we propose a lexicon adaptation approach that uses contextual as well as semantic information extracted from DBPedia to update the words' weighted sentiment orientations and to add new words to the lexicon. We evaluate our approach on three different Twitter datasets, and show that enriching the lexicon with contextual and semantic information improves sentiment computation by 3.4% in average accuracy, and by 2.8% in average F1 measure.

[1]  Hiroshi Kanayama,et al.  Fully Automatic Lexicon Expansion for Domain-oriented Sentiment Analysis , 2006, EMNLP.

[2]  Preslav Nakov,et al.  SemEval-2013 Task 2: Sentiment Analysis in Twitter , 2013, *SEMEVAL.

[3]  Fabrício Benevenuto,et al.  Comparing and combining sentiment analysis methods , 2013, COSN '13.

[4]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[5]  Viswa Mani Kiran Peddinti,et al.  Domain Adaptation in Sentiment Analysis of Twitter , 2011, Analyzing Microtext.

[6]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[7]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[8]  Erik Cambria,et al.  An Introduction to Concept-Level Sentiment Analysis , 2013, MICAI.

[9]  Huan Liu,et al.  Unsupervised sentiment analysis with emotional signals , 2013, WWW.

[10]  Qiang Yang,et al.  Cross-Domain Co-Extraction of Sentiment and Topic Lexicons , 2012, ACL.

[11]  Natalia V. Loukachevitch,et al.  Two-Step Model for Sentiment Lexicon Extraction from Twitter Streams , 2014, WASSA@ACL.

[12]  Johan Bollen,et al.  Twitter mood predicts the stock market , 2010, J. Comput. Sci..

[13]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[14]  Masnizah Mohd,et al.  Sentiment Lexicon Interpolation and Polarity Estimation of Objective and Out-Of-Vocabulary Words to Improve Sentiment Classification on Microblogging , 2014, PACLIC.

[15]  Erik Cambria,et al.  SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis , 2012, FLAIRS.

[16]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[17]  J. Shaffer Multiple Hypothesis Testing , 1995 .

[18]  Harith Alani,et al.  Contextual semantics for sentiment analysis of Twitter , 2016, Inf. Process. Manag..

[19]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[20]  Harith Alani,et al.  Adapting Sentiment Lexicons Using Contextual Semantics for Sentiment Analysis of Twitter , 2014, ESWC.

[21]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[22]  Songbo Tan,et al.  Adapting information bottleneck method for automatic construction of domain-oriented sentiment lexicon , 2010, WSDM '10.

[23]  Mike Thelwall,et al.  Sentiment strength detection for the social web , 2012, J. Assoc. Inf. Sci. Technol..

[24]  Miriam Fernández,et al.  Sentiment Analysis in Social Streams , 2016, Emotions and Personality in Personalized Services.

[25]  Betsy Jane Becker,et al.  Combining significance levels. , 1994 .

[26]  Rada Mihalcea,et al.  A Bootstrapping Method for Building Subjectivity Lexicons for Languages with Scarce Resources , 2008, LREC.

[27]  Stefan M. Rüger,et al.  Weakly Supervised Joint Sentiment-Topic Detection from Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[28]  Harith Alani,et al.  SentiCircles for Contextual and Conceptual Semantic Sentiment Analysis of Twitter , 2014, ESWC.

[29]  ThelwallMike,et al.  Sentiment strength detection in short informal text , 2010 .

[30]  Yue Lu,et al.  Automatic construction of a context-aware sentiment lexicon: an optimization approach , 2011, WWW.

[31]  Daling Wang,et al.  A word-emoticon mutual reinforcement ranking model for building sentiment lexicon from massive collection of microblogs , 2014, World Wide Web.

[32]  Harith Alani,et al.  Evaluation Datasets for Twitter Sentiment Analysis: A survey and a new dataset, the STS-Gold , 2013, ESSEM@AI*IA.

[33]  Wiltrud Kessler Turney: Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classication of Reviews , 2012 .

[34]  Finn Årup Nielsen,et al.  A New ANEW: Evaluation of a Word List for Sentiment Analysis in Microblogs , 2011, #MSM.

[35]  Yoshua Bengio,et al.  Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[36]  Hinrich Schütze,et al.  Bootstrapping Sentiment Labels For Unannotated Documents With Polarity PageRank , 2012, LREC.

[37]  Claire Cardie,et al.  Adapting a Polarity Lexicon using Integer Linear Programming for Domain-Specific Sentiment Classification , 2009, EMNLP.

[38]  Guillermo Sapiro,et al.  If you are happy and you know it... tweet , 2012, CIKM '12.

[39]  Giuseppe Pirrò,et al.  Explaining and Suggesting Relatedness in Knowledge Graphs , 2015, SEMWEB.

[40]  K. Thompson,et al.  If You're Happy and You Know It , 2012 .

[41]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.