Sentiment analysis algorithms and applications: A survey

Abstract Sentiment Analysis (SA) is an ongoing field of research in text mining field. SA is the computational treatment of opinions, sentiments and subjectivity of text. This survey paper tackles a comprehensive overview of the last update in this field. Many recently proposed algorithms' enhancements and various SA applications are investigated and presented briefly in this survey. These articles are categorized according to their contributions in the various SA techniques. The related fields to SA (transfer learning, emotion detection, and building resources) that attracted researchers recently are discussed. The main target of this survey is to give nearly full image of SA techniques and the related fields with brief details. The main contributions of this paper include the sophisticated categorizations of a large number of recent articles and the illustration of the recent trend of research in the sentiment analysis and its related areas.

[1]  David E. Losada,et al.  Sentiment-Based Ranking of Blog Posts Using Rhetorical Structure Theory , 2013, NLDB.

[2]  Ling Liu,et al.  Manipulation of online reviews: An analysis of ratings, readability, and sentiments , 2012, Decis. Support Syst..

[3]  Andrés Montoyo,et al.  Detecting implicit expressions of emotion in text: A comparative analysis , 2012, Decis. Support Syst..

[4]  Sheng-Tun Li,et al.  A fuzzy conceptualization model for text mining with application in opinion polarity classification , 2013, Knowl. Based Syst..

[5]  Van De CampMatje,et al.  The socialist network , 2012, DSS 2012.

[6]  Patricio Martínez-Barco,et al.  Subjectivity and sentiment analysis: An overview of the current state of the area and envisaged developments , 2012, Decis. Support Syst..

[7]  Uzay Kaymak,et al.  Polarity analysis of texts using discourse structure , 2011, CIKM '11.

[8]  Swapna Somasundaran,et al.  Discourse Level Opinion Interpretation , 2008, COLING.

[9]  Angela Fahrni,et al.  Old Wine or Warm Beer : Target-Specific Sentiment Analysis of Adjectives , .

[10]  Mitsuru Ishizuka,et al.  EmoHeart: Automation of Expressive Communication of Emotions in Second Life , 2009, HCI.

[11]  Sheng-Tun Li,et al.  Noise control in document classification based on fuzzy formal concept analysis , 2011, 2011 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE 2011).

[12]  Bing Qin,et al.  Integrating Intra-and Inter-document Evidences for Improving Sentence Sentiment Classification , 2010 .

[13]  Stephen Shaoyi Liao,et al.  Mining comparative opinions from customer reviews for Competitive Intelligence , 2011, Decis. Support Syst..

[14]  John Carroll,et al.  Weakly supervised techniques for domain-independent sentiment classification , 2009, TSA@CIKM.

[15]  I. Jolliffe Principal Component Analysis , 2002 .

[16]  Swapna Somasundaran,et al.  Recognizing Stances in Online Debates , 2009, ACL.

[17]  Qiong Wu,et al.  A random walk algorithm for automatic construction of domain-oriented sentiment lexicon , 2011, Expert Syst. Appl..

[18]  Björn W. Schuller,et al.  New Avenues in Opinion Mining and Sentiment Analysis , 2013, IEEE Intelligent Systems.

[19]  Erik Cambria,et al.  Sentic PROMs: Application of sentic computing to the development of a novel unified framework for measuring health-care quality , 2012, Expert Syst. Appl..

[20]  Mitsuru Ishizuka,et al.  Recognition of Affect, Judgment, and Appreciation in Text , 2010, COLING.

[21]  Josef Steinberger,et al.  Creating Sentiment Dictionaries via Triangulation , 2011, Decis. Support Syst..

[22]  Mitsuru Ishizuka,et al.  Recognition of Affect Conveyed by Text Messaging in Online Communication , 2007, HCI.

[23]  Heiner Stuckenschmidt,et al.  Fine-Grained Sentiment Analysis with Structural Features , 2011, IJCNLP.

[24]  Erik Cambria,et al.  SenticNet 2: A Semantic and Affective Resource for Opinion Mining and Sentiment Analysis , 2012, FLAIRS.

[25]  Charu C. Aggarwal,et al.  Mining Text Data , 2012 .

[26]  Svetha Venkatesh,et al.  Regularized nonnegative shared subspace learning , 2011, Data Mining and Knowledge Discovery.

[27]  Dirk Neumann,et al.  Automated news reading: Stock price prediction based on financial news using context-capturing features , 2013, Decis. Support Syst..

[28]  Fei Song,et al.  Feature Selection for Sentiment Analysis Based on Content and Syntax Models , 2011, Decis. Support Syst..

[29]  Qing Cao,et al.  Exploring determinants of voting for the "helpfulness" of online user reviews: A text mining approach , 2011, Decis. Support Syst..

[30]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[31]  Yung-Ming Li,et al.  Deriving market intelligence from microblogs , 2013, Decis. Support Syst..

[32]  Patricio Martínez-Barco,et al.  Using EmotiBlog to annotate and analyse subjectivity in the new textual genres , 2012, Data Mining and Knowledge Discovery.

[33]  Luigi Di Caro,et al.  Sentiment analysis via dependency parsing , 2013, Comput. Stand. Interfaces.

[34]  Uta Priss Formal concept analysis in information science , 2006 .

[35]  Kenji Araki,et al.  Affect analysis in context of characters in narratives , 2013, Expert Syst. Appl..

[36]  Songbo Tan,et al.  Weighted SCL model for adaptation of sentiment classification , 2011, Expert Syst. Appl..

[37]  Hwee Tou Ng,et al.  Feature selection, perceptron learning, and a usability case study for text categorization , 1997, SIGIR '97.

[38]  R. Plutchik A GENERAL PSYCHOEVOLUTIONARY THEORY OF EMOTION , 1980 .

[39]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[40]  Daoud Clarke,et al.  On developing robust models for favourability analysis: Model choice, feature sets and imbalanced data , 2012, Decis. Support Syst..

[41]  Erik Cambria,et al.  Towards Crowd Validation of the UK National Health Service , 2010 .

[42]  Tao Xu,et al.  Identifying the semantic orientation of terms using S-HAL for sentiment analysis , 2012, Knowl. Based Syst..

[43]  Diana Inkpen,et al.  A BOOTSTRAPPING METHOD FOR EXTRACTING PARAPHRASES OF EMOTION EXPRESSIONS FROM TEXTS , 2013, Comput. Intell..

[44]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[45]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[46]  Marilyn A. Walker,et al.  That is your evidence?: Classifying stance in online political debate , 2012, Decis. Support Syst..

[47]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[48]  Mark Levene,et al.  Combining lexicon and learning based approaches for concept-level sentiment analysis , 2012, WISDOM '12.

[49]  Iñaki Inza,et al.  Approaching Sentiment Analysis by using semi-supervised learning of multi-dimensional classifiers , 2012, Neurocomputing.

[50]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[51]  Pei-Chann Chang,et al.  Using a contextual entropy model to expand emotion words and their intensity for the sentiment classification of stock market news , 2013, Knowl. Based Syst..

[52]  Mitsuru Ishizuka,et al.  Emerging system for affectively charged interpersonal communication , 2009, 2009 ICCAS-SICE.

[53]  Juan Luis Castro,et al.  Lexicon-based Comments-oriented News Sentiment Analyzer system , 2012, Expert Syst. Appl..

[54]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[55]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[56]  Jen-Shin Hong,et al.  Automatic event-level textual emotion sensing using mutual action histogram between entities , 2010, Expert Syst. Appl..

[57]  Shlomo Argamon,et al.  Using appraisal groups for sentiment analysis , 2005, CIKM '05.

[58]  Saif Mohammad,et al.  Generating High-Coverage Semantic Orientation Lexicons From Overtly Marked Words and a Thesaurus , 2009, EMNLP.

[59]  Yi Hu,et al.  Document sentiment classification by exploring description model of topical terms , 2011, Comput. Speech Lang..

[60]  Max Kaufmann JMaxAlign: A Maximum Entropy Parallel Sentence Alignment Tool , 2012, COLING.

[61]  Chien Chin Chen,et al.  Quality evaluation of product reviews using an information quality framework , 2011, Decis. Support Syst..

[62]  Saif Mohammad,et al.  From once upon a time to happily ever after: Tracking emotions in mail and books , 2012, Decis. Support Syst..

[63]  Rudolf Wille,et al.  Restructuring Lattice Theory: An Approach Based on Hierarchies of Concepts , 2009, ICFCA.

[64]  Ben Taskar,et al.  Multi-View Learning over Structured and Non-Identical Outputs , 2008, UAI.

[65]  Deyu Zhou,et al.  Self-training from labeled features for sentiment analysis , 2011, Inf. Process. Manag..

[66]  Seong Joon Yoo,et al.  Senti-lexicon and improved Naïve Bayes algorithms for sentiment analysis of restaurant reviews , 2012, Expert Syst. Appl..

[67]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[68]  Isa Maks,et al.  A lexicon model for deep sentiment analysis and opinion mining applications , 2012, Decis. Support Syst..

[69]  Walaa Medhat,et al.  Sentiment analysis algorithms and applications: A survey , 2014 .

[70]  Youngjoong Ko,et al.  Automatic Text Categorization by Unsupervised Learning , 2000, COLING.

[71]  Hua Xu,et al.  Weakness Finder: Find product weakness from Chinese reviews by using aspects based sentiment analysis , 2012, Expert Syst. Appl..

[72]  Anil K. Jain,et al.  Classification of text documents , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[73]  Nick Bassiliades,et al.  Ontology-based sentiment analysis of twitter posts , 2013, Expert Syst. Appl..

[74]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[75]  Vasileios Hatzivassiloglou,et al.  Predicting the Semantic Orientation of Adjectives , 1997, ACL.

[76]  Chia-Hui Chang,et al.  Blogger-Centric Contextual Advertising , 2011, Expert Syst. Appl..

[77]  Wei Gao,et al.  Unsupervised Discovery of Discourse Relations for Eliminating Intra-sentence Polarity Ambiguities , 2011, EMNLP.

[78]  Tong Zhang,et al.  A Robust Risk Minimization based Named Entity Recognition System , 2003, CoNLL.

[79]  Qiong Wu,et al.  A two-stage framework for cross-domain sentiment classification , 2011, Expert Syst. Appl..

[80]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[81]  Navneet Kaur,et al.  Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[82]  Jong C. Park,et al.  Identifying helpful reviews based on customer's mentions about experiences , 2012, Expert Syst. Appl..

[83]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[84]  João Francisco Valiati,et al.  Document-level sentiment classification: An empirical comparison between SVM and ANN , 2013, Expert Syst. Appl..

[85]  Charu C. Aggarwal,et al.  Mining Text Data , 2012, Springer US.

[86]  Lise Getoor,et al.  Supervised and Unsupervised Methods in Employing Discourse Relations for Improving Opinion Polarity Classification , 2009, EMNLP.

[87]  Padmini Srinivasan,et al.  Hierarchical neural networks for text categorization , 1999, SIGIR 1999.

[88]  Luis Alfonso Ureña López,et al.  Sentiment polarity detection in Spanish reviews combining supervised and unsupervised approaches , 2013, Expert Syst. Appl..

[89]  Xiaojun Wan,et al.  Co-Training for Cross-Lingual Sentiment Classification , 2009, ACL.

[90]  Thomas L. Griffiths,et al.  Integrating Topics and Syntax , 2004, NIPS.

[91]  Gaël Dias,et al.  Merged Agreement Algorithms for Domain Independent Sentiment Analysis , 2011 .

[92]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[93]  Chun Chen,et al.  DASA: Dissatisfaction-oriented Advertising based on Sentiment Analysis , 2010, Expert Syst. Appl..

[94]  Shourya Roy,et al.  Fast and accurate text classification via multiple linear discriminant projections , 2003, The VLDB Journal.

[95]  Thorsten Joachims,et al.  Learning to classify text using support vector machines - methods, theory and algorithms , 2002, The Kluwer international series in engineering and computer science.

[96]  Yuh-Min Chen,et al.  Electronic word of mouth analysis for service experience , 2013, Expert Syst. Appl..

[97]  Fermín L. Cruz,et al.  'Long autonomy or long delay?' The importance of domain in opinion mining , 2013, Expert Syst. Appl..

[98]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[99]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[100]  Chia-Hui Chang,et al.  Blogger-centric contextual advertising , 2009, Expert Syst. Appl..

[101]  Mitsuru Ishizuka,et al.  Compositionality Principle in Recognition of Fine-Grained Emotions from Text , 2009, ICWSM.

[102]  Diana Maynard,et al.  Automatic Detection of Political Opinions in Tweets , 2011, #MSM.

[103]  Ronen Feldman,et al.  Techniques and applications for sentiment analysis , 2013, CACM.

[104]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.

[105]  Xianghua Fu,et al.  Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon , 2013, Knowl. Based Syst..

[106]  Walaa Medhat,et al.  Combined Algorithm for Data Mining using Association rules , 2014, ArXiv.

[107]  Xue Bai,et al.  Predicting consumer sentiments from online text , 2011, Decis. Support Syst..

[108]  Andrew B. Whinston,et al.  Whose and what chatter matters? The effect of tweets on movie sales , 2013, Decis. Support Syst..

[109]  Nicholas Asher,et al.  Distilling Opinion in Discourse: A Preliminary Study , 2008, COLING.

[110]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[111]  David D. Lewis,et al.  A comparison of two learning algorithms for text categorization , 1994 .

[112]  Themis Palpanas,et al.  Survey on mining subjective data on the web , 2011, Data Mining and Knowledge Discovery.

[113]  Andrei Popescu-Belis,et al.  8th International Conference on Applications of Natural Language to Information Systems , 2003 .

[114]  Padmini Srinivasan,et al.  Exploring Feature Definition and Selection for Sentiment Classifiers , 2011, ICWSM.

[115]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[116]  Yanquan Zhou,et al.  Sentiment Polarity Analysis based multi-dictionary , 2011 .

[117]  Paolo Rosso,et al.  Making objective decisions from subjective data: Detecting irony in customer reviews , 2012, Decis. Support Syst..

[118]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[119]  Antal van den Bosch,et al.  The socialist network , 2012, Decis. Support Syst..