A local LDA based method for Latent Aspect Rating Analysis on reviews

The expanding volume of online reviews has made it an important and challenging task to mine detailed information of opinions in those reviews. In many cases, along with the comment, a user also gives an overall rating on the target entity, which in fact could not reflect the detailed opinions on each aspect of the entity. Therefore, Latent Aspect Rating Analysis (LARA) came into being. The goal of LARA is to infer a latent rating and weight for each aspect based on the overall rating and the review content. Although some methods have been applied to solve this problem, they rely too much on the predefinition of aspects with keywords, which needs supervision and may hence introduce some biases. In this paper, we propose a Local LDA based method for LARA, which includes two stages. In the first stage, we employ Local LDA to discover aspects automatically. In the second stage, we use LRR model to infer the latent rating and weight for each of the discovered aspects. The experimental results on the review dataset demonstrate the advantages of the proposed method over the state-of-the-art methods.

[1]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[2]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[3]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[4]  Eytan Domany,et al.  Resampling Method for Unsupervised Estimation of Cluster Validity , 2001, Neural Computation.

[5]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[6]  Bo Pang,et al.  Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales , 2005, ACL.

[7]  Dong-Hong Ji,et al.  I2R: Three Systems for Word Sense Discrimination, Chinese Word Sense Disambiguation, and English Word Sense Disambiguation , 2007, SemEval@ACL.

[8]  Joachim M. Buhmann,et al.  Stability-Based Validation of Clustering Solutions , 2004, Neural Computation.

[9]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[10]  Chun Chen,et al.  Opinion Word Expansion and Target Extraction through Double Propagation , 2011, CL.

[11]  Vibhu O. Mittal,et al.  Comparative Experiments on Sentiment Classification for Online Product Reviews , 2006, AAAI.

[12]  Lei Zhang,et al.  Entity discovery and assignment for opinion mining applications , 2009, KDD.

[13]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[14]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[15]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[16]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[17]  Yue Lu,et al.  Rated aspect summarization of short comments , 2009, WWW '09.

[18]  Lei Zhang,et al.  Identifying Noun Product Features that Imply Opinions , 2011, ACL.

[19]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[20]  Xiaojin Zhu,et al.  Seeing stars when there aren’t many stars: Graph-based semi-supervised learning for sentiment categorization , 2006 .

[21]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.