Mining contrastive opinions on political texts using cross-perspective topic model

This paper presents a novel opinion mining research problem, which is called Contrastive Opinion Modeling (COM). Given any query topic and a set of text collections from multiple perspectives, the task of COM is to present the opinions of the individual perspectives on the topic, and furthermore to quantify their difference. This general problem subsumes many interesting applications, including opinion summarization and forecasting, government intelligence and cross-cultural studies. We propose a novel unsupervised topic model for contrastive opinion modeling. It simulates the generative process of how opinion words occur in the documents of different collections. The ad hoc opinion search process can be efficiently accomplished based on the learned parameters in the model. The difference of perspectives can be quantified in a principled way by the Jensen-Shannon divergence among the individual topic-opinion distributions. An extensive set of experiments have been conducted to evaluate the proposed model on two datasets in the political domain: 1) statement records of U.S. senators; 2) world news reports from three representative media in U.S., China and India, respectively. The experimental results with both qualitative and quantitative analysis have shown the effectiveness of the proposed model.

[1]  Craig MacDonald,et al.  Overview of the TREC 2006 Blog Track , 2006, TREC.

[2]  Yulan He,et al.  Joint sentiment/topic model for sentiment analysis , 2009, CIKM.

[3]  Daniel Kifer,et al.  What Is an Opinion About? Exploring Political Standpoints Using Opinion Scoring Model , 2010, AAAI.

[4]  Koji Eguchi,et al.  Sentiment Retrieval using Generative Models , 2006, EMNLP.

[5]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[6]  Zhoujun Li,et al.  Comparable Entity Mining from Comparative Questions , 2010, ACL.

[7]  Alice H. Oh,et al.  Aspect and sentiment unification model for online review analysis , 2011, WSDM '11.

[8]  Bing Liu,et al.  Mining Opinion Features in Customer Reviews , 2004, AAAI.

[9]  Hsin-Hsi Chen,et al.  Overview of Opinion Analysis Pilot Task at NTCIR-6 , 2007, NTCIR.

[10]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[11]  Lillian Lee,et al.  Opinion Mining and Sentiment Analysis , 2008, Found. Trends Inf. Retr..

[12]  Min Zhang,et al.  A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval , 2008, SIGIR '08.

[13]  Ivan Titov,et al.  A Joint Model of Text and Aspect Ratings for Sentiment Summarization , 2008, ACL.

[14]  Michael J. Paul,et al.  Summarizing Contrastive Viewpoints in Opinionated Text , 2010, EMNLP.

[15]  Bei Yu,et al.  A cross-collection mixture model for comparative text mining , 2004, KDD.

[16]  Michael J. Paul,et al.  Cross-Cultural Analysis of Blogs and Forums with Mixed-Collection Topic Models , 2009, EMNLP.

[17]  Noah A. Smith,et al.  Predicting Response to Political Blog Posts with Topic Models , 2009, NAACL.

[18]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Xu Ling,et al.  Topic sentiment mixture: modeling facets and opinions in weblogs , 2007, WWW '07.

[20]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[21]  Stefan Kaufmann,et al.  Language and Ideology in Congress , 2011, British Journal of Political Science.

[22]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[23]  Ryoji Kataoka,et al.  Opinion Sentence Search Engine on Open-Domain Blog , 2007, IJCAI.

[24]  Claire Cardie,et al.  OpinionFinder: A System for Subjectivity Analysis , 2005, HLT.

[25]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[26]  Soo-Min Kim,et al.  Determining the Sentiment of Opinions , 2004, COLING.

[27]  Wei-Hao Lin,et al.  Which Side are You on? Identifying Perspectives at the Document and Sentence Levels , 2006, CoNLL.

[28]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.

[29]  Ryan T. McDonald,et al.  Contrastive Summarization: An Experiment with Consumer Reviews , 2009, NAACL.

[30]  Razvan C. Bunescu,et al.  Sentiment analyzer: extracting sentiments about a given topic using natural language processing techniques , 2003, Third IEEE International Conference on Data Mining.

[31]  Wei Zhang,et al.  Opinion retrieval from blogs , 2007, CIKM '07.

[32]  Philip S. Yu,et al.  A holistic lexicon-based approach to opinion mining , 2008, WSDM '08.

[33]  Bo Pang,et al.  Thumbs up? Sentiment Classification using Machine Learning Techniques , 2002, EMNLP.

[34]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[35]  Matt Thomas,et al.  Get out the vote: Determining support or opposition from Congressional floor-debate transcripts , 2006, EMNLP.

[36]  Kamal Nigam,et al.  Retrieving topical sentiments from online document collections , 2003, IS&T/SPIE Electronic Imaging.

[37]  Kathleen R. McKeown,et al.  Predicting the semantic orientation of adjectives , 1997 .

[38]  Wei-Hao Lin,et al.  Are These Documents Written from Different Perspectives? A Test of Different Perspectives Based on Statistical Distribution Divergence , 2006, ACL.