Discovering intermediate entities from two examples by using web search engine indices

We propose a system for finding intermediate entities from two examples by using web search engine indices. For example, a user wants to find recipients of the Nobel Peace Prize in the thirty years between Mother Teresa in 1979 and Barack Obama in 2009. In this example, the answer is, for example, Kofi Atta Annan. In this situation, the user wants to find something intermediate between two entities. We first describe the problem of finding entities between two examples. We then propose a system for extracting intermediate entities between two inputs by using a Web search engine indices. The system focuses on the positions of terms in Web pages and then extracts candidate terms that are likely to appear between the two inputs. Then, our system ranks candidate terms based on term frequencies and positions. Finally, we conducted experiments to show the usefulness of our system.

[1]  Philip S. Yu,et al.  Discovering unexpected information from your competitors' web sites , 2001, KDD '01.

[2]  Katherine A. Heller,et al.  Bayesian Sets , 2005, NIPS.

[3]  Neil Rubens,et al.  Commonly perceived order within a category , 2007 .

[4]  Sanda M. Harabagiu,et al.  Answering complex questions with random walk models , 2006, SIGIR '06.

[5]  Katsumi Tanaka,et al.  Unsupervised Discovery of Coordinate Terms for Multiple Aspects from Search Engine Query Logs , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[6]  Kentaro Torisawa,et al.  A simple WWW-based method for semantic word class acquisition , 2007 .

[7]  Zheng Chen,et al.  CWS: a comparative web search system , 2006, WWW '06.

[8]  Bei Yu,et al.  A cross-collection mixture model for comparative text mining , 2004, KDD.

[9]  ChengXiang Zhai,et al.  CTMS : A Comparative Text Mining System , 2005 .

[10]  Satoshi Nakamura,et al.  SyncRerank: Reranking Multi Search Results Based on Vertical and Horizontal Propagation of User Intention , 2008, WISE.

[11]  William W. Cohen,et al.  Language-Independent Set Expansion of Named Entities Using the Web , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[12]  Katsumi Tanaka,et al.  A comparative web browser (CWB) for browsing and comparing web pages , 2003, WWW '03.

[13]  Masashi Sugiyama,et al.  Order Retrieval , 2008, LKR.

[14]  Bing Liu,et al.  Visualizing web site comparisons , 2002, WWW '02.

[15]  Toshio Uchiyama,et al.  Ranking Entities Using Comparative Relations , 2008, DEXA.

[16]  Chunqiang Tang,et al.  Answering relationship queries on the web , 2007, WWW '07.

[17]  Daniel Mahler,et al.  Holistic Query Expansion Using Graphical Models , 2004, New Directions in Question Answering.

[18]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[19]  Katsumi Tanaka,et al.  Searching Coordinate Terms with Their Context from the Web , 2006, WISE.