Leveraging semantics for sentiment polarity detection in social media

With the increase use of microblogs and social media platforms as forms of on-line communication, we now have a huge amount of opinionated data reflecting people’s opinions and attitudes in form of reviews, forum discussions, blogs and tweets. This has recently brought great interest to sentiment analysis and opinion mining field that analyzes people’s feelings and attitudes from written language. Most of the existing approaches on sentiment analysis rely mainly on the presence of affect words that explicitly reflect sentiment. However, these approaches are semantically weak, that is, they do not take into account the semantics of words when detecting their sentiment in text. Only recently a few approaches (e.g. sentic computing) started investigating towards this direction. Following this trend, this paper investigates the role of semantics in sentiment analysis of social media. To this end, frame semantics and lexical resources such as BabelNet are employed to extract semantic features from social media that lead to more accurate sentiment analysis models. Experiments are conducted with different types of semantic information by assessing their impact in four social media datasets which incorporate tweets, blogs and movie reviews. A tenfold cross-validation shows that F1 measure increases significantly when using semantics in sentiment analysis in social media. Results show that the proposed approach considering word’s semantics for sentiment analysis surpasses non-semantic approaches for the considered datasets.

[1]  Simone Paolo Ponzetto,et al.  BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network , 2012, Artif. Intell..

[2]  Harith Alani,et al.  SentiCircles for Contextual and Conceptual Semantic Sentiment Analysis of Twitter , 2014, ESWC.

[3]  Harith Alani,et al.  Semantic Sentiment Analysis of Twitter , 2012, SEMWEB.

[4]  Diego Reforgiato Recupero,et al.  Semantic Web Evaluation Challenge: SemWebEval 2014 at ESWC 2014, Anissaras, Crete, Greece, May 25-29, 2014, Revised Selected Papers , 2014 .

[5]  Erik Cambria,et al.  Sentic patterns: Dependency-based rules for concept-level sentiment analysis , 2014, Knowl. Based Syst..

[6]  Bin Gu,et al.  Incremental learning for ν-Support Vector Regression , 2015, Neural Networks.

[7]  Diego Reforgiato Recupero,et al.  Framester: A Wide Coverage Linguistic Linked Data Hub , 2016, EKAW.

[8]  Vladimir Ceperic,et al.  Integration of Neural Network-Based Symbolic Regression in Deep Learning for Scientific Discovery , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[9]  D. V. L. N. Somayajulu,et al.  Classification of Movie Reviews Using Complemented Naive Bayesian Classifier , 2011 .

[10]  Erik Cambria,et al.  A Deeper Look into Sarcastic Tweets Using Deep Convolutional Neural Networks , 2016, COLING.

[11]  Li Chen,et al.  News impact on stock price return via sentiment analysis , 2014, Knowl. Based Syst..

[12]  Eniafe Festus Ayetiran,et al.  A Data Mining-Based Response Model for Target Selection in Direct Marketing , 2012 .

[13]  Christian Biemann,et al.  IIT-TUDA at SemEval-2016 Task 5: Beyond Sentiment Lexicon: Combining Domain Dependency and Distributional Semantics Features for Aspect Based Sentiment Analysis , 2016, *SEMEVAL.

[14]  Junlan Feng,et al.  Robust Sentiment Detection on Twitter from Biased and Noisy Data , 2010, COLING.

[15]  Chris Tseng,et al.  Classifying twitter data with Naïve Bayes Classifier , 2012, 2012 IEEE International Conference on Granular Computing.

[16]  Xingming Sun,et al.  Structural Minimax Probability Machine , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[18]  Erik Cambria,et al.  Sentic Computing for social media marketing , 2012, Multimedia Tools and Applications.

[19]  Erik Cambria,et al.  Convolutional MKL Based Multimodal Emotion Recognition and Sentiment Analysis , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[20]  Patrick Paroubek,et al.  Twitter as a Corpus for Sentiment Analysis and Opinion Mining , 2010, LREC.

[21]  Pushpak Bhattacharyya,et al.  WikiSent : Weakly Supervised Sentiment Analysis Through Extractive Summarization With Wikipedia , 2012, ECML/PKDD.

[22]  Diego Reforgiato Recupero,et al.  ESWC 15 Challenge on Concept-Level Sentiment Analysis , 2015, SemWebEval@ESWC.

[23]  Saeedeh Momtazi,et al.  Fine-grained German Sentiment Analysis on Social Media , 2012, LREC.

[24]  Diego Reforgiato Recupero,et al.  Frame-Based Detection of Opinion Holders and Topics: A Model and a Tool , 2014, IEEE Computational Intelligence Magazine.

[25]  Diana Maynard,et al.  Automatic Detection of Political Opinions in Tweets , 2011, #MSM.

[26]  Johanna D. Moore,et al.  Twitter Sentiment Analysis: The Good the Bad and the OMG! , 2011, ICWSM.

[27]  Björn W. Schuller,et al.  SenticNet 4: A Semantic Resource for Sentiment Analysis Based on Conceptual Primitives , 2016, COLING.

[28]  Tao Wang,et al.  Personalized search for social media via dominating verbal context , 2016, Neurocomputing.

[29]  Harith Alani,et al.  SentiCircles: A Platform for Contextual and Conceptual Sentiment Analysis , 2016, ESWC.

[30]  John Carroll,et al.  Automatic Seed Word Selection for Unsupervised Sentiment Classification of Chinese Text , 2008, COLING.

[31]  Christopher Potts,et al.  Learning Word Vectors for Sentiment Analysis , 2011, ACL.

[32]  Harith Alani,et al.  Semantic Patterns for Sentiment Analysis of Twitter , 2014, SEMWEB.

[33]  Diego Reforgiato Recupero,et al.  Sentilo: Frame-Based Sentiment Analysis , 2014, Cognitive Computation.

[34]  Owen Rambow,et al.  Sentiment Analysis of Twitter Data , 2011 .

[35]  Harith Alani,et al.  Contextual semantics for sentiment analysis of Twitter , 2016, Inf. Process. Manag..

[36]  Estevam R. Hruschka,et al.  Tweet sentiment analysis with classifier ensembles , 2014, Decis. Support Syst..