Exploiting Structured Ontology to Organize Scattered Online Opinions

We study the problem of integrating scattered online opinions. For this purpose, we propose to exploit structured ontology to obtain well-formed relevant aspects to a topic and use them to organize scattered opinions to generate a structured summary. Particularly, we focus on two main challenges in implementing this idea, (1) how to select the most useful aspects from a large number of aspects in the ontology and (2) how to order the selected aspects to optimize the readability of the structured summary. We propose and explore several methods for solving these challenges. Experimental results on two different data sets (US Presidents and Digital Cameras) show that the proposed methods are effective for selecting aspects that can represent the major opinions and for generating coherent ordering of aspects.

[1]  ChengXiang Zhai,et al.  Automatic labeling of multinomial topic models , 2007, KDD '07.

[2]  Jing Zhao,et al.  Learning to Generate Labels for Organizing Search Results from a Domain-Specified Corpus , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2006 Main Conference Proceedings)(WI'06).

[3]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[4]  John D. Lafferty,et al.  Model-based feedback in the language modeling approach to information retrieval , 2001, CIKM '01.

[5]  Bing Liu,et al.  Opinion observer: analyzing and comparing opinions on the Web , 2005, WWW '05.

[6]  Eric K. Ringger,et al.  Pulse: Mining Customer Opinions from Free Text , 2005, IDA.

[7]  Wei-Ying Ma,et al.  Learning to cluster web search results , 2004, SIGIR '04.

[8]  Susan T. Dumais,et al.  Bringing order to the Web: automatically categorizing search results , 2000, CHI.

[9]  Oren Etzioni,et al.  Extracting Product Features and Opinions from Reviews , 2005, HLT.

[10]  Mika Käki,et al.  Optimizing the number of search result categories , 2005, CHI Extended Abstracts.

[11]  Chao Liu,et al.  A probabilistic approach to spatiotemporal theme pattern mining on weblogs , 2006, WWW '06.

[12]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[13]  Yue Lu,et al.  Opinion integration through semi-supervised topic modeling , 2008, WWW.

[14]  W. Bruce Croft,et al.  An Evaluation of Techniques for Clustering Search Results , 2005 .

[15]  Regina Barzilay,et al.  Automatically Generating Wikipedia Articles: A Structure-Aware Approach , 2009, ACL.

[16]  Giuseppe Carenini,et al.  Extracting knowledge from evaluative text , 2005, K-CAP '05.

[17]  Ivan Titov,et al.  Modeling online reviews with multi-grain topic models , 2008, WWW.