Sentiment diversification for short review summarization

With the abundance of reviews published on the Web about a given product, consumers are looking for ways to view major opinions that can be presented in a quick and succinct way. Reviews contain many different opinions, making the ability to show a diversified review summary that focus on coverage and diversity a major goal. Most review summarization work focuses on showing salient reviews as a summary which might ignore diversity in summaries. In this paper, we present a graph-based algorithm that is capable of producing extractive summaries that are both diversified from a sentiment point of view and topically well-covered. First, we use statistical measures to find topical words. Then we split the dataset based on the sentiment class of the reviews and perform the ranking on each sentiment graph. When compared with different baselines, our approach scores best in most ROUGE metrics. Specifically, our approach shows improvements of 3.9% in ROUGE-1 and 1.8% in ROUGE-L in comparison with the best competing baseline.

[1]  Yue Lu,et al.  Rated aspect summarization of short comments , 2009, WWW '09.

[2]  Michael J. Paul,et al.  Summarizing Contrastive Viewpoints in Opinionated Text , 2010, EMNLP.

[3]  Takaaki Hasegawa,et al.  Optimizing Informativeness and Readability for Sentiment Summarization , 2010, ACL.

[4]  Haizhou Li,et al.  Graph-based informative-sentence selection for opinion summarization , 2013, 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2013).

[5]  Abdelghani Bellaachia,et al.  Short text keyphrase extraction with hypergraphs , 2014, Progress in Artificial Intelligence.

[6]  Noriko Kando,et al.  Opinion-focused Summarization and its Analysis at DUC 2006 , 2006 .

[7]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[8]  Claire Cardie,et al.  Partially Supervised Coreference Resolution for Opinion Summarization through Structured Rule Learning , 2006, EMNLP.

[9]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[10]  Ralf Krestel,et al.  Diversifying Product Review Rankings: Getting the Full Picture , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[11]  Claire Cardie,et al.  Query-Focused Opinion Summarization for User-Generated Content , 2014, COLING.

[12]  ChengXiang Zhai,et al.  Micropinion generation: an unsupervised approach to generating ultra-concise summaries of opinions , 2012, WWW.

[13]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[14]  Jiawei Han,et al.  Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions , 2010, COLING.

[15]  James Caverlee,et al.  Creating Diverse Product Review Summaries: A Graph Approach , 2015, WISE.

[16]  Hsin-Hsi Chen,et al.  Opinion Extraction, Summarization and Tracking in News and Blog Corpora , 2006, AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs.

[17]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[18]  Ani Nenkova,et al.  Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion , 2007, Information Processing & Management.

[19]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[20]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[21]  M. de Rijke,et al.  Summarizing Contrastive Themes via Hierarchical Non-Parametric Processes , 2015, SIGIR.

[22]  Eduard H. Hovy,et al.  The Automated Acquisition of Topic Signatures for Text Summarization , 2000, COLING.

[23]  Panayiotis Tsaparas,et al.  Review Synthesis for Micro-Review Summarization , 2015, WSDM.

[24]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[25]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.