A faceted characterization of the opinion mining landscape

Increasing amounts of user generated content (UGC) on the Internet, is creating research interest in opinion mining. This involves automatic detection of opinions about products, services, political parties, celebrities and events from user generated content. Research efforts in opinion mining are thus far, fragmented and have followed several approaches. However, most of them require understanding of domain-specific opinion words and their polarity, and language-specific opinion rules. In addition, semantic constructs like sarcasm pose open challenges. In this paper, we try to characterize the opinion-mining landscape by proposing a faceted taxonomy of the different aspects of opinion mining. We then survey literature and place these in appropriate places in the proposed model. We also propose a general purpose workflow required from any opinion mining engine. Finally, we speculate on specific challenges in the opinion mining landscape.

[1]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[2]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[3]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[4]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[5]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[6]  Claire Cardie,et al.  Finding Deceptive Opinion Spam by Any Stretch of the Imagination , 2011, ACL.

[7]  Junhui Wang,et al.  Detecting group review spam , 2011, WWW.

[8]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[9]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[10]  Richard Colbaugh,et al.  Estimating the sentiment of social media content for security informatics applications , 2011, Proceedings of 2011 IEEE International Conference on Intelligence and Security Informatics.

[11]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[12]  Bu-Sung Lee,et al.  TwiNER: named entity recognition in targeted twitter stream , 2012, SIGIR '12.

[13]  John R. Anderson ACT: A simple theory of complex cognition. , 1996 .

[14]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[15]  Xiaojin Zhu,et al.  A Topic Model for Word Sense Disambiguation , 2007, EMNLP.

[16]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[17]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[18]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[19]  Gerhard Weikum,et al.  The Bag-of-Opinions Method for Review Rating Prediction from Sparse Text Patterns , 2010, COLING.

[20]  Ee-Peng Lim,et al.  Detecting product review spammers using rating behaviors , 2010, CIKM.

[21]  Yanjun Qi,et al.  Sentiment classification based on supervised latent n-gram analysis , 2011, CIKM '11.

[22]  Carlo Strapparava,et al.  The Lie Detector: Explorations in the Automatic Recognition of Deceptive Language , 2009, ACL.

[23]  D. Kahneman Thinking, Fast and Slow , 2011 .

[24]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[25]  H. Grice Logic and conversation , 1975 .

[26]  Arjun Mukherjee,et al.  Spotting fake reviewer groups in consumer reviews , 2012, WWW.

[27]  Wiltrud Kessler Turney: Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classication of Reviews , 2012 .

[28]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[29]  Claire Cardie,et al.  Partially Supervised Coreference Resolution for Opinion Summarization through Structured Rule Learning , 2006, EMNLP.

[30]  Hang Li,et al.  Named entity recognition in query , 2009, SIGIR.

[31]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[32]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[33]  Jeffrey T. Hancock,et al.  On Lying and Being Lied To: A Linguistic Analysis of Deception in Computer-Mediated Communication , 2007 .

[34]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[35]  Paolo Rosso,et al.  Mining Subjective Knowledge from Customer Reviews: A Specific Case of Irony Detection , 2011, WASSA@ACL.

[36]  Cindy K. Chung,et al.  The development and psychometric properties of LIWC2007 , 2007 .

[37]  Hsinchun Chen,et al.  AI and Opinion Mining , 2010, IEEE Intelligent Systems.

[38]  Michael Gamon,et al.  Sentiment classification on customer feedback data: noisy data, large feature vectors, and the role of linguistic analysis , 2004, COLING.

[39]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[40]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[41]  Akira Utsumi,et al.  A Unified Theory of Irony and Its Computational Formalization , 1996, COLING.

[42]  Yi Yang,et al.  Learning to Identify Review Spam , 2011, IJCAI.

[43]  J. Keziya Rani,et al.  Mining Opinion Features in Customer Reviews. , 2016 .

[44]  Nina Wacholder,et al.  Identifying Sarcasm in Twitter: A Closer Look , 2011, ACL.

[45]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[46]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[47]  Rui Xia,et al.  A POS-based Ensemble Model for Cross-domain Sentiment Classification , 2011, IJCNLP.

[48]  Bing Liu Sentiment Analysis , 2020 .

[49]  R. Kreuz,et al.  Why Do People Use Figurative Language? , 1994 .

[50]  Stephanie Seneff,et al.  Review Sentiment Scoring via a Parse-and-Paraphrase Paradigm , 2009, EMNLP.

[51]  Ronald R. Yager,et al.  WebPET: An Online Tool for Lexicographic Decision Making , 2010, IEEE Intelligent Systems.