Parametric and Non-parametric User-aware Sentiment Topic Models

The popularity of Web 2.0 has resulted in a large number of publicly available online consumer reviews created by a demographically diverse user base. Information about the authors of these reviews, such as age, gender and location, provided by many on-line consumer review platforms may allow companies to better understand the preferences of different market segments and improve their product design, manufacturing processes and marketing campaigns accordingly. However, previous work in sentiment analysis has largely ignored these additional user meta-data. To address this deficiency, in this paper, we propose parametric and non-parametric User-aware Sentiment Topic Models (USTM) that incorporate demographic information of review authors into topic modeling process in order to discover associations between market segments, topical aspects and sentiments. Qualitative examination of the topics discovered using USTM framework in the two datasets collected from popular online consumer review platforms as well as quantitative evaluation of the methods utilizing those topics for the tasks of review sentiment classification and user attribute prediction both indicate the utility of accounting for demographic information of review authors in opinion mining.

[1]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[2]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[3]  Thomas L. Griffiths,et al.  The nested chinese restaurant process and bayesian nonparametric inference of topic hierarchies , 2007, JACM.

[4]  Susan T. Dumais,et al.  Partially labeled topic models for interpretable text mining , 2011, KDD.

[5]  Sheng Wang,et al.  SUIT: A Supervised User-Item Based Topic Model for Sentiment Analysis , 2014, AAAI.

[6]  Eugene Agichtein,et al.  Geographical Latent Variable Models for Microblog Retrieval , 2015, ECIR.

[7]  Xiaojun Wan,et al.  Opinion Target Extraction in Chinese News Comments , 2010, COLING.

[8]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[9]  Arjun Mukherjee,et al.  Aspect Extraction through Semi-Supervised Modeling , 2012, ACL.

[10]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[11]  Regina Barzilay,et al.  Content Models with Attitude , 2011, ACL.

[12]  Long Jiang,et al.  User-level sentiment analysis incorporating social networks , 2011, KDD.

[13]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[14]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[15]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[16]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[17]  Masaru Kitsuregawa,et al.  Modeling User Leniency and Product Popularity for Sentiment Classification , 2013, IJCNLP.

[18]  Iryna Gurevych,et al.  Extracting Opinion Targets in a Single and Cross-Domain Setting with Conditional Random Fields , 2010, EMNLP.

[19]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[20]  Dongwoo Kim,et al.  Modeling topic hierarchies with the recursive chinese restaurant process , 2012, CIKM.

[21]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[22]  Hao Yu,et al.  Structure-Aware Review Mining and Summarization , 2010, COLING.

[23]  Martin Ester,et al.  ILDA: interdependent LDA model for learning latent aspects and their ratings from online product reviews , 2011, SIGIR.

[24]  Alice H. Oh,et al.  A Hierarchical Aspect-Sentiment Model for Online Reviews , 2013, AAAI.

[25]  David M. Blei,et al.  Supervised Topic Models , 2007, NIPS.

[26]  Himabindu Lakkaraju,et al.  Exploiting Coherence for the Simultaneous Discovery of Latent Facets and associated Sentiments , 2011, SDM.

[27]  Li Li,et al.  User-sentiment topic model: refining user's topics with sentiment information , 2012, MDS '12.

[28]  Stefan M. Rüger,et al.  Weakly Supervised Joint Sentiment-Topic Detection from Text , 2012, IEEE Transactions on Knowledge and Data Engineering.

[29]  Yue Lu,et al.  Latent aspect rating analysis on review text data: a rating regression approach , 2010, KDD.

[30]  Eugene Agichtein,et al.  Leveraging geographical metadata to improve search over social media , 2013, WWW.

[31]  Eugene Agichtein,et al.  The importance of being socially-savvy: quantifying the influence of social networks on microblog retrieval , 2013, CIKM.