Comprehensive Review of Opinion Summarization

The abundance of opinions on the web has kindled the study of opinion summarization over the last few years. People have introduced various techniques and paradigms to solving this special task. This survey attempts to systematically investigate the different techniques and approaches used in opinion summarization. We provide a multi-perspective classification of the approaches used and highlight some of the key weaknesses of these approaches. This survey also covers evaluation techniques and data sets used in studying the opinion summarization problem. Finally, we provide insights into some of the challenges that are left to be addressed as this will help set the trend for future research in this area.

[1]  Timothy W. Finin,et al.  Modeling Trust and Influence in the Blogosphere Using Link Polarity , 2007, ICWSM.

[2]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[3]  Julien Velcin,et al.  A combination of opinion mining and social network techniques for discussion analysis , 2009, Fouille de Données d'Opinions.

[4]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[5]  Shengrui Wang,et al.  Identifying authoritative actors in question-answering forums: the case of Yahoo! answers , 2008, KDD.

[6]  Panagiotis G. Ipeirotis,et al.  Show me the money!: deriving the pricing power of product features by mining consumer reviews , 2007, KDD '07.

[7]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[8]  Yue Lu,et al.  Rated aspect summarization of short comments , 2009, WWW '09.

[9]  Guillaume Bouchard,et al.  Opinion mining in social media: Modeling, simulating, and forecasting political opinions in the web , 2012, Gov. Inf. Q..

[10]  ChengXiang Zhai,et al.  Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions , 2012, WWW.

[11]  Janyce Wiebe,et al.  Just How Mad Are You? Finding Strong and Weak Opinion Clauses , 2004, AAAI.

[12]  Bing Liu,et al.  Opinion Extraction and Summarization on the Web , 2006, AAAI.

[13]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[14]  Hsin-Hsi Chen,et al.  Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[15]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[16]  Xiaoqiang Luo,et al.  On Coreference Resolution Performance Metrics , 2005, HLT.

[17]  Hong Yu,et al.  Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences , 2003, EMNLP.

[18]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[19]  Mikolaj Morzy,et al.  Opinion Mining and Social Networks: A Promising Match , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[20]  Bei Yu,et al.  A cross-collection mixture model for comparative text mining , 2004, KDD.

[21]  Bing Liu,et al.  Sentiment Analysis and Subjectivity , 2010, Handbook of Natural Language Processing.

[22]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[23]  Claire Cardie,et al.  Toward Opinion Summarization: Linking the Sources , 2006 .

[24]  Andrés Montoyo,et al.  Multilingual Feature-Driven Opinion Extraction and Summarization from Customer Reviews , 2008, NLDB.

[25]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[26]  Breck Baldwin,et al.  Algorithms for Scoring Coreference Chains , 1998 .

[27]  J. Kamps,et al.  Words with attitude , 2002 .

[28]  Rebecca J. Passonneau Computing Reliability for Coreference Annotation , 2004, LREC.

[29]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[30]  Michael Granitzer,et al.  Blog credibility ranking by exploiting verified content , 2009, WICOW.

[31]  Jiawei Han,et al.  Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions , 2010, COLING.

[32]  Yue Lu,et al.  Opinion integration through semi-supervised topic modeling , 2008, WWW.

[33]  Bing Liu,et al.  Opinion spam and analysis , 2008, WSDM '08.

[34]  Nigel Collier,et al.  Sentiment Analysis using Support Vector Machines with Diverse Information Sources , 2004, EMNLP.

[35]  Regina Barzilay,et al.  Multiple Aspect Ranking Using the Good Grief Algorithm , 2007, NAACL.

[36]  M. Thelwall,et al.  Data mining emotion in social network communication: Gender differences in MySpace , 2010 .

[37]  Julien Velcin,et al.  Definition and Measures of an Opinion Model for Mining Forums , 2009, 2009 International Conference on Advances in Social Network Analysis and Mining.

[38]  Gilad Mishne,et al.  MoodViews: Tools for Blog Mood Analysis , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[39]  ChengXiang Zhai,et al.  Generating comparative summaries of contradictory opinions in text , 2009, CIKM.

[40]  S. Rosen Hedonic Prices and Implicit Markets: Product Differentiation in Pure Competition , 1974, Journal of Political Economy.

[41]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[42]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[43]  András A. Benczúr,et al.  SpamRank - fully automatic link spam detection. Work in progress , 2005 .

[44]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[45]  Jaideep Srivastava,et al.  Predicting trusts among users of online communities: an epinions case study , 2008, EC '08.

[46]  Claire Cardie,et al.  Partially Supervised Coreference Resolution for Opinion Summarization through Structured Rule Learning , 2006, EMNLP.

[47]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[48]  Mark S. Ackerman,et al.  Expertise networks in online communities: structure and algorithms , 2007, WWW '07.

[49]  Chaomei Chen,et al.  Visual Analysis of Conflicting Opinions , 2006, 2006 IEEE Symposium On Visual Analytics Science And Technology.

[50]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[51]  Ellen Riloff,et al.  Learning subjective nouns using extraction pattern bootstrapping , 2003, CoNLL.

[52]  Eduard Hovy,et al.  Automated Text Summarization in SUMMARIST , 1997, ACL 1997.

[53]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[54]  J. M. Kittross The measurement of meaning , 1959 .

[55]  C. Osgood,et al.  The Measurement of Meaning , 1958 .

[56]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[57]  Ramakrishnan Srikant,et al.  Mining newsgroups using networks arising from social behavior , 2003, WWW '03.

[58]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[59]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[60]  Ellen Riloff,et al.  Learning Extraction Patterns for Subjective Expressions , 2003, EMNLP.

[61]  Ming Zhou,et al.  Low-Quality Product Review Detection in Opinion Summarization , 2007, EMNLP.

[62]  Gilad Mishne,et al.  Finding high-quality content in social media , 2008, WSDM '08.

[63]  Dekang Lin,et al.  Dependency-Based Evaluation of Minipar , 2003 .

[64]  Philip S. Yu,et al.  Identifying the influential bloggers in a community , 2008, WSDM '08.

[65]  Ani Nenkova,et al.  The Pyramid Method: Incorporating human content selection variation in summarization evaluation , 2007, TSLP.

[66]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[67]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[68]  R. Passonneau Computing Reliability for Co-Reference Annotation , 2004 .

[69]  Claire Cardie,et al.  Topic Identification for Fine-Grained Opinion Analysis , 2008, COLING.

[70]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[71]  Steven W. Zucker,et al.  On the Foundations of Relaxation Labeling Processes , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[72]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[73]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[74]  Janyce Wiebe,et al.  Learning Subjective Adjectives from Corpora , 2000, AAAI/IAAI.