Incorporating appraisal expression patterns into topic modeling for aspect and sentiment word identification

With the considerable growth of user-generated content, online reviews are becoming extremely valuable sources for mining customers' opinions on products and services. However, most of the traditional opinion mining methods are coarse-grained and cannot understand natural languages. Thus, aspect-based opinion mining and summarization are of great interest in academic and industrial research. In this paper, we study an approach to extract product and service aspect words, as well as sentiment words, automatically from reviews. An unsupervised dependency analysis-based approach is presented to extract Appraisal Expression Patterns (AEPs) from reviews, which represent the manner in which people express opinions regarding products or services and can be regarded as a condensed representation of the syntactic relationship between aspect and sentiment words. AEPs are high-level, domain-independent types of information, and have excellent domain adaptability. An AEP-based Latent Dirichlet Allocation (AEP-LDA) model is also proposed. This is a sentence-level, probabilistic generative model which assumes that all words in a sentence are drawn from one topic – a generally true assumption, based on our observation. The model also assumes that every review corpus is composed of several mutually corresponding aspect and sentiment topics, as well as a background word topic. The AEP information is incorporated into the AEP-LDA model for mining aspect and sentiment words simultaneously. The experimental results on reviews of restaurants, hotels, MP3 players, and cameras show that the AEP-LDA model outperforms other approaches in identifying aspect and sentiment words.

[1]  Xiaoyan Zhu,et al.  Exploring weakly supervised latent sentiment explanations for aspect-level review analysis , 2013, CIKM.

[2]  Jun Zhao,et al.  Cross-domain sentiment classification using a two-stage method , 2009, CIKM.

[3]  Shlomo Argamon,et al.  Automated learning of appraisal extraction patterns , 2010 .

[4]  Qiang Yang,et al.  Cross-domain sentiment classification via spectral feature alignment , 2010, WWW '10.

[5]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[6]  Shlomo Argamon,et al.  Extracting Appraisal Expressions , 2007, NAACL.

[7]  Shiwen Yu,et al.  Using Pointwise Mutual Information to Identify Implicit Features in Customer Reviews , 2006, ICCPOL.

[8]  Shlomo Argamon,et al.  Unsupervised Extraction of Appraisal Expressions , 2010, Canadian Conference on AI.

[9]  John Blitzer,et al.  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification , 2007, ACL.

[10]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[11]  Sasha Blair-Goldensohn,et al.  Building a Sentiment Summarizer for Local Service Reviews , 2008 .

[12]  Martin Ester,et al.  On the design of LDA models for aspect-based opinion mining , 2012, CIKM.

[13]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[14]  Tetsuya Miyoshi,et al.  Sentiment classification of customer reviews on electric products , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[15]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[16]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[17]  Xiaoyan Zhu,et al.  Movie review mining and summarization , 2006, CIKM '06.

[18]  Mitsuru Ishizuka,et al.  SENTIMENT ASSESSMENT OF TEXT BY ANALYZING LINGUISTIC FEATURES AND CONTEXTUAL VALENCE ASSIGNMENT , 2008, Appl. Artif. Intell..

[19]  Jan Svartvik,et al.  A __ comprehensive grammar of the English language , 1988 .

[20]  Yue Lu,et al.  Latent aspect rating analysis without aspect keyword supervision , 2011, KDD.

[21]  Xuanjing Huang,et al.  Mining product reviews based on shallow dependency parsing , 2009, SIGIR.

[22]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.

[23]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[24]  Ellen Riloff,et al.  Learning Extraction Patterns for Subjective Expressions , 2003, EMNLP.

[25]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[26]  Hongfei Yan,et al.  Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid , 2010, EMNLP.

[27]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[28]  Claire Cardie,et al.  Topic Identification for Fine-Grained Opinion Analysis , 2008, COLING.

[29]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[30]  Viswa Mani Kiran Peddinti,et al.  Domain Adaptation in Sentiment Analysis of Twitter , 2011, Analyzing Microtext.

[31]  Andrea Esuli,et al.  SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining , 2010, LREC.

[32]  Bing Liu,et al.  Mining Opinions in Comparative Sentences , 2008, COLING.

[33]  Bo Pang,et al.  A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts , 2004, ACL.

[34]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[35]  Bing Liu,et al.  Opinion Mining and Sentiment Analysis , 2011 .

[36]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[37]  Andrea Esuli,et al.  Multi-Faceted Rating of Product Reviews , 2009, ERCIM News.

[38]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Giuseppe Carenini,et al.  Extracting knowledge from evaluative text , 2005, K-CAP '05.

[40]  Christopher D. Manning,et al.  The Stanford Typed Dependencies Representation , 2008, CF+CDPE@COLING.

[41]  J. Sim,et al.  The kappa statistic in reliability studies: use, interpretation, and sample size requirements. , 2005, Physical therapy.

[42]  Kentaro Inui,et al.  Collecting Evaluative Expressions for Opinion Extraction , 2004, IJCNLP.

[43]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[44]  Kuiyu Chang,et al.  Mining Chinese Reviews , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[45]  Sven Rill,et al.  Evaluation of an algorithm for aspect-based opinion mining using a lexicon-based approach , 2013, WISDOM '13.

[46]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[47]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[48]  Yue Lu,et al.  Rated aspect summarization of short comments , 2009, WWW '09.

[49]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[50]  Yoshua Bengio,et al.  Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[51]  Bing Liu,et al.  Opinion Extraction and Summarization on the Web , 2006, AAAI.

[52]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[53]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[54]  Stefan M. Rüger,et al.  Weakly Supervised Joint Sentiment-Topic Detection from Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[55]  Yue Lu,et al.  Opinion integration through semi-supervised topic modeling , 2008, WWW.

[56]  Rayid Ghani,et al.  Text mining for product attribute extraction , 2006, SKDD.

[57]  David M. Pennock,et al.  Mining the peanut gallery: opinion extraction and semantic classification of product reviews , 2003, WWW '03.

[58]  Xueqi Cheng,et al.  Adaptive co-training SVM for sentiment classification on tweets , 2013, CIKM.

[59]  Patrick Pantel,et al.  Discovery of inference rules for question-answering , 2001, Natural Language Engineering.

[60]  Iryna Gurevych,et al.  Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[61]  Jürgen Broß,et al.  Automatic construction of domain and aspect specific sentiment lexicons for customer review mining , 2013, CIKM.

[62]  Zhong Su,et al.  Domain customization for aspect-oriented opinion analysis with multi-level latent sentiment clues , 2011, CIKM '11.

[63]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[64]  Martin Ester,et al.  Opinion digger: an unsupervised opinion miner from unstructured product reviews , 2010, CIKM.

[65]  Meng Wang,et al.  Aspect Ranking: Identifying Important Product Aspects from Online Consumer Reviews , 2011, ACL.

[66]  Amélie Marian,et al.  Beyond the Stars: Improving Rating Predictions using Review Text Content , 2009, WebDB.