Review mining for estimating users' ratings and weights for product aspects

Fine-grained opinions are often buried in user reviews. The opinionated aspects may also be associated with different weights by reviewers to represent the aspects’ relative importance. As the opinions and weights provide valuable information about users’ preferences for products, they can facilitate the generation of personalised recommendations. However, few studies to date have investigated the three inter-connected tasks in a unified framework: aspect identification, aspect-based rating inference and weight estimation. In this paper, we propose a unified framework for performing the three tasks, which involves 1) identifying the product aspects mentioned in a review, 2) inferring the reviewer’s ratings for these aspects from the opinions s/he expressed in a review, and 3) estimating the reviewer’s weights for these aspects. The relationship among these three tasks is inherently dependent in that the output of one task adjusts the accuracy of another task. We particularly develop an unsupervised model to Collectively estimate Aspect Ratings and Weights (shorted as CARW), which performs all of the three tasks by enhancing each other mutually. We conduct experiments on three real-life datasets to evaluate the CARW model. Experimental results show that the proposed model can achieve better performance than the related methods regarding each task.

[1]  Yue Lu,et al.  Exploiting Structured Ontology to Organize Scattered Online Opinions , 2010, COLING.

[2]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.

[3]  Rohini K. Srihari,et al.  OpinionMiner: a novel machine learning system for web opinion mining and extraction , 2009, KDD.

[4]  Regina Barzilay,et al.  Multiple Aspect Ranking Using the Good Grief Algorithm , 2007, NAACL.

[5]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[6]  Claire Cardie,et al.  Multi-aspect Sentiment Analysis with Topic Models , 2011, 2011 IEEE 11th International Conference on Data Mining Workshops.

[7]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[8]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[9]  Xuanjing Huang,et al.  Phrase Dependency Parsing for Opinion Mining , 2009, EMNLP.

[10]  Andrea Esuli,et al.  Multi-Faceted Rating of Product Reviews , 2009, ERCIM News.

[11]  Panagiotis G. Ipeirotis,et al.  Show me the money!: deriving the pricing power of product features by mining consumer reviews , 2007, KDD '07.

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  Marie-Francine Moens,et al.  A machine learning approach to sentiment analysis in multilingual Web texts , 2009, Information Retrieval.

[14]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[15]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[16]  Jure Leskovec,et al.  Learning Attitudes and Attributes from Multi-aspect Reviews , 2012, 2012 IEEE 12th International Conference on Data Mining.

[17]  Li Chen,et al.  Recommending Inexperienced Products via Learning from Consumer Reviews , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[18]  Yue Lu,et al.  Latent aspect rating analysis without aspect keyword supervision , 2011, KDD.

[19]  Li Chen,et al.  Comparison of Model-Based Learning Methods for Feature-Level Opinion Mining , 2011, 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[20]  Andrea Esuli,et al.  SENTIWORDNET: A Publicly Available Lexical Resource for Opinion Mining , 2006, LREC.

[21]  Martin Ester,et al.  The FLDA model for aspect-based opinion mining: addressing the cold start problem , 2013, WWW.

[22]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.

[23]  Yue Lu,et al.  Opinion integration through semi-supervised topic modeling , 2008, WWW.

[24]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[25]  Nazli Goharian,et al.  Semi-supervised probabilistic sentiment analysis: Merging labeled sentences with unlabeled reviews to identify sentiment , 2013, ASIST.

[26]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[27]  Xianghua Fu,et al.  Multi-aspect sentiment analysis for Chinese online social reviews based on topic modeling and HowNet lexicon , 2013, Knowl. Based Syst..

[28]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[29]  Ralph Weischedel,et al.  PERFORMANCE MEASURES FOR INFORMATION EXTRACTION , 2007 .

[30]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[31]  Hao Yu,et al.  Structure-Aware Review Mining and Summarization , 2010, COLING.

[32]  Eric Chang,et al.  Red Opal: product-feature scoring from reviews , 2007, EC '07.

[33]  Hongfei Yan,et al.  Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid , 2010, EMNLP.

[34]  Martin Ester,et al.  ILDA: interdependent LDA model for learning latent aspects and their ratings from online product reviews , 2011, SIGIR.

[35]  Nazli Goharian,et al.  Efficient estimation of aspect weights , 2012, SIGIR '12.

[36]  Meng Wang,et al.  Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews , 2011, ACL.

[37]  Jaideep Srivastava,et al.  Impact of social influence in e-commerce decision making , 2007, ICEC.

[38]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[39]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[40]  Yue Lu,et al.  Rated aspect summarization of short comments , 2009, WWW '09.

[41]  Li Chen,et al.  Preference-based clustering reviews for augmenting e-commerce recommendation , 2013, Knowl. Based Syst..