Learning to Retrieve Opinions

As a novel information retrieval task, opinion retrieval has attracted considerable amount of attention in recent years. Current researches mainly first computed the topic relevant and opinion relevant scores of the documents and then combined these two scores as the final ranking score using some combination function. One major problem in existing works is that the score combination functions are defined in advance regardless of domains. However, there is no evidence that these two scores should be combined in a unique way. In this paper, we propose to learn the combination functions automatically for retrieval tasks of different domains. We employ the popular Genetic Programming framework for the learning tasks. To perform the whole opinion retrieval task, we also design a novel opinion retrieval system to compute the topic and opinion relevant scores and then learn the optimal combination function to integrate the topic and opinion relevant scores. In the experiments, we compare our system with other state-of-the-art work on a public dataset and the experimental results show that our system performs comparatively with others.

[1]  Philip J. Stone,et al.  Extracting Information. (Book Reviews: The General Inquirer. A Computer Approach to Content Analysis) , 1967 .

[2]  Bing Liu,et al.  Mining and summarizing customer reviews , 2004, KDD.

[3]  Clement Yu,et al.  UIC at TREC 2008 Blog Track , 2008 .

[4]  Chun Chen,et al.  Domain Specific Opinion Retrieval , 2009, AIRS.

[5]  Qiang Yang,et al.  Q2C@UST: our winning solution to query classification in KDDCUP 2005 , 2005, SKDD.

[6]  Andrea Esuli,et al.  Determining the semantic orientation of terms through gloss classification , 2005, CIKM '05.

[7]  Thomas Hofmann,et al.  Probabilistic Latent Semantic Analysis , 1999, UAI.

[8]  Weiguo Fan,et al.  Learning to advertise , 2006, SIGIR.

[9]  Craig MacDonald,et al.  Overview of the TREC 2006 Blog Track , 2006, TREC.

[10]  Craig MacDonald,et al.  Overview of the TREC 2007 Blog Track , 2007, TREC.

[11]  Wei Zhang,et al.  Opinion retrieval from blogs , 2007, CIKM '07.

[12]  Craig MacDonald,et al.  An effective statistical approach to blog post opinion retrieval , 2008, CIKM '08.

[13]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[14]  Gilad Mishne Multiple Ranking Strategies for Opinion Retrieval in Blogs - The University of Amsterdam at the 2006 TREC Blog Track , 2006, TREC.

[15]  Peter D. Turney Thumbs Up or Thumbs Down? Semantic Orientation Applied to Unsupervised Classification of Reviews , 2002, ACL.

[16]  Craig MacDonald,et al.  Ranking opinionated blog posts using OpinionFinder , 2008, SIGIR '08.

[17]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[18]  Wei Zhang,et al.  Improve the effectiveness of the opinion retrieval and opinion polarity classification , 2008, CIKM '08.

[19]  Edward A. Fox,et al.  Ranking function optimization for effective Web search by genetic programming: an empirical study , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[20]  Min Zhang,et al.  A generation model to unify topic relevance and lexicon-based sentiment for opinion retrieval , 2008, SIGIR '08.

[21]  Koji Eguchi,et al.  Sentiment Retrieval using Generative Models , 2006, EMNLP.

[22]  Gilad Mishne Using Blog Properties to Improve Retrieval , 2007, ICWSM.

[23]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..