Complementary Aspect-Based Opinion Mining

Aspect-based opinion mining is finding elaborate opinions towards a subject such as a product or an event. With explosive growth of opinionated texts on the Web, mining aspect-level opinions has become a promising means for online public opinion analysis. In particular, the boom of various types of online media provides diverse yet complementary information, bringing unprecedented opportunities for cross media aspect-opinion mining. Along this line, we propose CAMEL, a novel topic model for complementary aspect-based opinion mining across asymmetric collections. CAMEL gains information complementarity by modeling both common and specific aspects across collections, while keeping all the corresponding opinions for contrastive study. An auto-labeling scheme called AME is also proposed to help discriminate between aspect and opinion words without elaborative human labeling, which is further enhanced by adding word embedding-based similarity as a new feature. Moreover, CAMEL-DP, a nonparametric alternative to CAMEL is also proposed based on coupled Dirichlet Processes. Extensive experiments on real-world multi-collection reviews data demonstrate the superiority of our methods to competitive baselines. This is particularly true when the information shared by different collections becomes seriously fragmented. Finally, a case study on the public event “2014 Shanghai Stampede” demonstrates the practical value of CAMEL for real-world applications.

[1]  Martin Ester,et al.  Aspect-based opinion mining from product reviews , 2012, SIGIR '12.

[2]  Jiawei Han,et al.  Mining Multi-aspect Reflection of News Events in Twitter: Discovery, Linking and Presentation , 2015, 2015 IEEE International Conference on Data Mining.

[3]  Martin Ester,et al.  Opinion digger: an unsupervised opinion miner from unstructured product reviews , 2010, CIKM.

[4]  Wei Chen,et al.  ASEM: Mining Aspects and Sentiment of Events from Microblog , 2015, CIKM.

[5]  Thomas Hofmann,et al.  Probabilistic latent semantic indexing , 1999, SIGIR '99.

[6]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[7]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[8]  Luo Si,et al.  Mining contrastive opinions on political texts using cross-perspective topic model , 2012, WSDM '12.

[9]  Nigel Collier,et al.  A partially supervised cross-collection topic model for cross-domain text classification , 2013, CIKM.

[10]  David Buttler,et al.  Exploring Topic Coherence over Many Models and Many Topics , 2012, EMNLP.

[11]  Wray L. Buntine,et al.  Topic Model : Extracting Product Opinions from Tweets by Leveraging Hashtags and Sentiment Lexicon , 2014 .

[12]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[13]  Noémie Elhadad,et al.  An Unsupervised Aspect-Sentiment Model for Online Reviews , 2010, NAACL.

[14]  John W. Fisher,et al.  Coupling Nonparametric Mixtures via Latent Dirichlet Processes , 2012, NIPS.

[15]  Peng Li,et al.  Joint topic modeling for event summarization across news and social media streams , 2012, CIKM.

[16]  Fei Wang,et al.  Complementary Aspect-Based Opinion Mining Across Asymmetric Collections , 2015, 2015 IEEE International Conference on Data Mining.

[17]  Bei Yu,et al.  A cross-collection mixture model for comparative text mining , 2004, KDD.

[18]  Martin Ester,et al.  On the design of LDA models for aspect-based opinion mining , 2012, CIKM.

[19]  Zhong Su,et al.  Product feature categorization with multilevel latent semantic association , 2009, CIKM.

[20]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[21]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[22]  Martin Ester,et al.  FLAME: A Probabilistic Model Combining Aspect Based Opinion Mining and Collaborative Filtering , 2015, WSDM.

[23]  Michael J. Paul,et al.  Cross-Cultural Analysis of Blogs and Forums with Mixed-Collection Topic Models , 2009, EMNLP.

[24]  Andrew McCallum,et al.  Optimizing Semantic Coherence in Topic Models , 2011, EMNLP.

[25]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[26]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[27]  P. Müller,et al.  A method for combining inference across related nonparametric Bayesian models , 2004 .

[28]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[30]  Hongfei Yan,et al.  Jointly Modeling Aspects and Opinions with a MaxEnt-LDA Hybrid , 2010, EMNLP.

[31]  Xuanjing Huang,et al.  Phrase Dependency Parsing for Opinion Mining , 2009, EMNLP.

[32]  Sang Jeong Lee,et al.  Aspect-level news browsing: understanding news events from multiple viewpoints , 2010, IUI '10.

[33]  Hiroshi Nakagawa,et al.  The Hybrid Nested/Hierarchical Dirichlet Process and its Application to Topic Modeling with Word Differentiation , 2015, AAAI.

[34]  Rohini K. Srihari,et al.  OpinionMiner: a novel machine learning system for web opinion mining and extraction , 2009, KDD.