Opinion mining: reviewed from word to document level

Opinion mining is one of the most challenging tasks of the field of information retrieval. Research community has been publishing a number of articles on this topic but a significant increase in interest has been observed during the past decade especially after the launch of several online social networks. In this paper, we provide a very detailed overview of the related work of opinion mining. Following features of our review make it stand unique among the works of similar kind: (1) it presents a very different perspective of the opinion mining field by discussing the work on different granularity levels (like word, sentences, and document levels) which is very unique and much required, (2) discussion of the related work in terms of challenges of the field of opinion mining, (3) document level discussion of the related work gives an overview of opinion mining task in blogosphere, one of most popular online social network, and (4) highlights the importance of online social networks for opinion mining task and other related sub-tasks.

[1]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[2]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[3]  Lizhu Zhou,et al.  Integrating Classification and Association Rule Mining: A Concept Lattice Framework , 1999, RSFDGrC.

[4]  OunisIadh,et al.  A case study of distributed information retrieval architectures to index one terabyte of text , 2005 .

[5]  Bernardo Magnini,et al.  Integrating Subject Field Codes into WordNet , 2000, LREC.

[6]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[7]  Coskun Bayrak,et al.  UALR at TREC: Blog Track , 2006, TREC.

[8]  Pero Subasic,et al.  Affect analysis of text using fuzzy semantic typing , 2000, Ninth IEEE International Conference on Fuzzy Systems. FUZZ- IEEE 2000 (Cat. No.00CH37063).

[9]  Janyce Wiebe,et al.  Effects of Adjective Orientation and Gradability on Sentence Subjectivity , 2000, COLING.

[10]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[11]  Jianqiang Wang,et al.  TREC 2008 at the University at Buffalo: Legal and Blog Track , 2008, TREC.

[12]  Yun Chi,et al.  Identifying opinion leaders in the blogosphere , 2007, CIKM '07.

[13]  Carlo Strapparava,et al.  WordNet Affect: an Affective Extension of WordNet , 2004, LREC.

[14]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss classification , 2005, CIKM '05.

[15]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[16]  Nigel Collier,et al.  Sentiment Analysis using Support Vector Machines with Diverse Information Sources , 2004, EMNLP.

[17]  Craig MacDonald,et al.  Overview of the TREC 2007 Blog Track , 2007, TREC.

[18]  Jeonghee Yi,et al.  Sentiment analysis: capturing favorability using natural language processing , 2003, K-CAP '03.

[19]  Jong-Hyeok Lee,et al.  Improving Opinion Retrieval Based on Query-Specific Sentiment Lexicon , 2009, ECIR.

[20]  Craig MacDonald,et al.  University of Glasgow at TREC 2008: Experiments in Blog, Enterprise, and Relevance Feedback Tracks with Terrier , 2008, TREC.

[21]  Jungi Kim,et al.  KLE at TREC 2008 Blog Track: Blog Post and Feed Retrieval , 2008, TREC.

[22]  Kiduk Yang WIDIT in TREC 2008 Blog Track: Leveraging Multiple Sources of Opinion Evidence , 2008, TREC.

[23]  Victoria Bobicev,et al.  Emotions in Words: Developing a Multilingual WordNet-Affect , 2010, CICLing.

[24]  Bing Liu,et al.  Opinion Feature Extraction Using Class Sequential Rules , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[25]  Craig MacDonald,et al.  Integrating Proximity to Subjective Sentences for Blog Opinion Retrieval , 2009, ECIR.

[26]  Kristian J. Hammond,et al.  Domain Specific Affective Classification of Documents , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[27]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[28]  Nirmalie Wiratunga,et al.  The Robert Gordon University at the Opinion Retrieval Task of the 2007 TREC Blog Track , 2007, TREC.

[29]  Craig MacDonald,et al.  An effective statistical approach to blog post opinion retrieval , 2008, CIKM '08.

[30]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[31]  Wei-Hao Lin,et al.  Which Side are You on? Identifying Perspectives at the Document and Sentence Levels , 2006, CoNLL.

[32]  Ido Dagan,et al.  Contextual word similarity and estimation from sparse data , 1995, Comput. Speech Lang..

[33]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[34]  Gregory Grefenstette,et al.  Coupling Niche Browsers and Affect Analysis for an Opinion Mining Application , 2004, RIAO.

[35]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[36]  Hiroya Takamura,et al.  Sentiment Classification Using Word Sub-sequences and Dependency Sub-trees , 2005, PAKDD.

[37]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[38]  Grace Hui Yang,et al.  Knowledge Transfer and Opinion Detection in the TREC 2006 Blog Track , 2006, TREC.

[39]  Martin Chodorow,et al.  Combining local context and wordnet similarity for word sense identification , 1998 .

[40]  Tejashri Inadarchand Jain,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2010 .

[41]  Alistair Kennedy,et al.  SENTIMENT CLASSIFICATION of MOVIE REVIEWS USING CONTEXTUAL VALENCE SHIFTERS , 2006, Comput. Intell..

[42]  Michelle L. Gregory,et al.  Quantifying sentiment and influence in blogspaces , 2010, SOMA '10.

[43]  Hsin-Hsi Chen,et al.  Mining opinions from the Web: Beyond relevance retrieval , 2007 .

[44]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[45]  Songbo Tan,et al.  A survey on sentiment detection of reviews , 2009, Expert Syst. Appl..

[46]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[47]  Sarabjot S. Anand,et al.  Predicting the Polarity Strength of Adjectives Using WordNet , 2009, ICWSM.

[48]  Iadh Ounis,et al.  A case study of distributed information retrieval architectures to index one terabyte of text , 2005, Inf. Process. Manag..

[49]  Claire Fautsch,et al.  UniNE at TREC 2008: Fact and Opinion Retrieval in the Blogsphere , 2008, TREC.

[50]  Iadh Ounis,et al.  University of Glasgow at TREC 2006: Experiments in Terabyte and Enterprise Tracks with Terrier , 2006, TREC.

[51]  Klaus Winkelmann Conference on Innovative Applications of Artificial Intelligence , 1989, Künstliche Intell..

[52]  Ido Dagan,et al.  Similarity-Based Estimation of Word Cooccurrence Probabilities , 1994, ACL.

[53]  Philip J. Stone,et al.  A computer approach to content analysis: studies using the General Inquirer system , 1963, AFIPS Spring Joint Computing Conference.

[54]  Maarten de Rijke,et al.  External Query Expansion in the Blogosphere , 2008, TREC.

[55]  Mohand Boughanem,et al.  Using passage-based language model for opinion detection in blogs , 2010, SAC '10.

[56]  Marco Baroni,et al.  Identifying subjective adjectives through web-based mutual information , 2004 .

[57]  Timothy W. Finin,et al.  The BlogVox Opinion Retrieval System , 2006, TREC.

[58]  Maria Simi,et al.  Blog Mining Through Opinionated Words , 2006, TREC.

[59]  Wei Liu,et al.  Experiments in TREC 2007 Blog Opinion Task at CAS-ICT , 2007, TREC.

[60]  Yi Zhang,et al.  Exact Maximum Likelihood Estimation for Word Mixtures , 2002 .

[61]  Bin Li,et al.  UTDallas at TREC 2008 Blog Track , 2008, TREC.

[62]  E. Vesterinen,et al.  Affective Computing , 2009, Encyclopedia of Biometrics.

[63]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[64]  Gregory Grefenstette,et al.  Validating the Coverage of Lexical Resources for Affect Analysis and Automatically Classifying New Words along Semantic Axes , 2006, Computing Attitude and Affect in Text.

[65]  Olfa Nasraoui,et al.  Web data mining: exploring hyperlinks, contents, and usage data , 2008, SKDD.

[66]  Andrea Esuli,et al.  Determining Term Subjectivity and Term Orientation for Opinion Mining , 2006, EACL.

[67]  Jonathon Read,et al.  Using Emoticons to Reduce Dependency in Machine Learning Techniques for Sentiment Classification , 2005, ACL.

[68]  Bing Liu,et al.  Identifying comparative sentences in text documents , 2006, SIGIR.

[69]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[70]  Sabine Bergler,et al.  Mining WordNet for a Fuzzy Sentiment: Sentiment Tag Extraction from WordNet Glosses , 2006, EACL.

[71]  Janyce Wiebe,et al.  Learning Subjective Language , 2004, CL.

[72]  Annika Waern Rosalind Picard: Affective Computing , 2002 .

[73]  Franco Salvetti,et al.  Automatic Opinion Polarity Classification of Movie Reviews , 2004 .

[74]  Marshall S. Smith,et al.  The general inquirer: A computer approach to content analysis. , 1967 .

[75]  Rob Law,et al.  Automatic Detection of Subjective Sentences Based on Chinese Subjective Patterns , 2009 .

[76]  Shlomo Argamon,et al.  Using appraisal groups for sentiment analysis , 2005, CIKM '05.

[77]  Craig MacDonald,et al.  University of Glasgow at TREC 2007: Experiments in Blog and Enterprise Tracks with Terrier , 2007, TREC.

[78]  Coskun Bayrak,et al.  Topic Categorization for Relevancy and Opinion Detection , 2007, TREC.

[79]  Janyce Wiebe,et al.  Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis , 2005, HLT.

[80]  Yue Liu,et al.  Combining Language Model with Sentiment Analysis for Opinion Retrieval of Blog-Post , 2006, TREC.

[81]  Hui Zhang,et al.  WIDIT in TREC 2007 Blog Track: Combining Lexicon-Based Methods to Detect Opinionated Blogs , 2007, TREC.

[82]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[83]  Alan F. Smeaton,et al.  DCU at the TREC 2008 Blog Track , 2008, TREC.

[84]  Dan Jurafsky,et al.  Automatic Extraction of Opinion Propositions and their Holders , 2004 .

[85]  J. Gerring A case study , 2011, Technology and Society.

[86]  Kazuhiro Seki,et al.  Adaptive subjective triggers for opinionated document retrieval , 2009, WSDM '09.

[87]  Craig MacDonald,et al.  Overview of the TREC 2006 Blog Track , 2006, TREC.

[88]  Hsin-Hsi Chen,et al.  Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[89]  Luo Si,et al.  Knowledge Transfer and Opinion Detection in the TREC2006 Blog Track , 2006 .

[90]  Ellen Riloff,et al.  Learning subjective nouns using extraction pattern bootstrapping , 2003, CoNLL.

[91]  M. de Rijke,et al.  UvA-DARE ( Digital Academic Repository ) Using WordNet to measure semantic orientations of adjectives , 2004 .

[92]  Kathleen R. McKeown,et al.  SIMFINDER: A Flexible Clustering Tool for Summarization , 2001 .

[93]  Hsin-Hsi Chen,et al.  Mining opinions from the Web: Beyond relevance retrieval , 2007, J. Assoc. Inf. Sci. Technol..

[94]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[95]  Marie-Francine Moens,et al.  A machine learning approach to sentiment analysis in multilingual Web texts , 2009, Information Retrieval.

[96]  Sabine Bergler,et al.  Semantic Tag Extraction from WordNet Glosses , 2006, LREC.

[97]  Mostafa Keikha,et al.  University of Lugano at TREC 2008 Blog Track , 2008, TREC.

[98]  Janyce Wiebe,et al.  Recognizing subjectivity: a case study in manual tagging , 1999, Natural Language Engineering.

[99]  Yi Zhang,et al.  UCSC on REC 2006 Blog Opinion Mining , 2006, TREC.

[100]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[101]  Trevor J. Hastie,et al.  The Sentimental Factor: Improving Review Classification Via Human-Provided Information , 2004, ACL.

[102]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[103]  Marius Pasca,et al.  Finding Instance Names and Alternative Glosses on the Web: WordNet Reloaded , 2005, CICLing.

[104]  David R. Pierce,et al.  Identifying Opinionated Sentences , 2003, NAACL.

[105]  Mohand Boughanem,et al.  Sentence-Level Opinion-Topic Association for Opinion Detection in Blogs , 2009, 2009 International Conference on Advanced Information Networking and Applications Workshops.

[106]  Vincent Ng,et al.  Examining the Role of Linguistic Knowledge Sources in the Automatic Identification and Classification of Reviews , 2006, ACL.

[107]  Fabio Crestani,et al.  Investigating Learning Approaches for Blog Post Opinion Retrieval , 2009, ECIR.

[108]  Shlomo Argamon,et al.  Appraisal Extraction for News Opinion Analysis at NTCIR-6 , 2007, NTCIR.

[109]  Gilad Mishne Multiple Ranking Strategies for Opinion Retrieval in Blogs - The University of Amsterdam at the 2006 TREC Blog Track , 2006, TREC.

[110]  Clement Yu,et al.  UIC at TREC 2008 Blog Track , 2008 .

[111]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[112]  Michael L. Littman,et al.  Unsupervised Learning of Semantic Orientation from a Hundred-Billion-Word Corpus , 2002, ArXiv.

[113]  Xuanjing Huang,et al.  FDU at TREC 2007: Opinion Retrieval of Blog Track , 2007, TREC.

[114]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[115]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[116]  Edoardo M. Airoldi,et al.  Sentiment Extraction from Unstructured Text using Tabu Search-Enhanced Markov Blanket , 2004 .

[117]  Bing Liu,et al.  Opinion Mining and Sentiment Analysis , 2011 .

[118]  Guo-Hui Li,et al.  Mining Chinese comparative sentences by semantic role labeling , 2008, 2008 International Conference on Machine Learning and Cybernetics.

[119]  Janyce Wiebe,et al.  Learning Subjective Adjectives from Corpora , 2000, AAAI/IAAI.

[120]  Qin Tang,et al.  DUTIR at TREC 2007 Blog Track , 2007, TREC.

[121]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[122]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[123]  Timothy W. Finin,et al.  Modeling Trust and Influence in the Blogosphere Using Link Polarity , 2007, ICWSM.

[124]  Yasuhiro Suzuki,et al.  Application of Semi-supervised Learning to Evaluative Expression Classification , 2006, CICLing.

[125]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[126]  Annie Zaenen,et al.  Contextual Valence Shifters , 2006, Computing Attitude and Affect in Text.

[127]  Sudeshna Sarkar,et al.  IIT Kharagpur at TREC 2008 Blog Track , 2008, TREC.

[128]  Janyce Wiebe Identifying Subjective Characters in Narrative , 1990, COLING.

[129]  Kazuhiro Seki,et al.  TREC 2007 Blog Track Experiments at Kobe University , 2007, TREC.

[130]  Casey Whitelaw Using Appraisal Taxonomies for Sentiment Analysis , 2005 .

[131]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[132]  Wei Zhang,et al.  Improve the effectiveness of the opinion retrieval and opinion polarity classification , 2008, CIKM '08.

[133]  Theresa Wilson Fine-grained subjectivity and sentiment analysis: recognizing the intensity, polarity, and attitudes of private states , 2008 .

[134]  Ido Dagan,et al.  Contextual Word Similarity and Estimation from Sparse Data , 1993, ACL.

[135]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[136]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.