Completing lists of entities

We consider the list completion task, an entity retrieval task where, in return to a topic statement and a number of example entities, systems have to return further examples. For this task, we propose and evaluate several algorithms. One of the core challenges is to overcome the very limited amount of information that serves as input --- to address this challenge we explore different representations of list descriptions. For evaluation purposes we make use of the lists and categories available in Wikipedia. Experimental results show that cluster-based contexts improve retrieval results.

[1]  Jennifer Chu-Carroll,et al.  IBM's PIQUANT II in TREC 2004 , 2004, TREC.

[2]  V. Zlatic,et al.  Wikipedias: collaborative web-based encyclopedias as complex networks. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Evgeniy Gabrilovich,et al.  Overcoming the Brittleness Bottleneck using Wikipedia: Enhancing Text Categorization with Encyclopedic Knowledge , 2006, AAAI.

[4]  Malvina Nissim,et al.  Question Answering with QED at TREC 2005 , 2005, TREC.

[5]  David Hawking,et al.  Panoptic Expert: Searching for experts not just for documents , 2001 .

[6]  Ellen M. Voorhees,et al.  Overview of TREC 2005 , 2005, TREC.

[7]  Leif Azzopardi,et al.  Incorporating context within the language modeling approach for ad hoc information retrieval , 2006, SIGF.

[8]  W. Bruce Croft,et al.  Hierarchical Language Models for Expert Finding in Enterprise Corpora , 2008, Int. J. Artif. Intell. Tools.

[9]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[10]  Maarten de Rijke,et al.  Finding experts and their eetails in e-mail corpora , 2006, WWW '06.

[11]  Katherine A. Heller,et al.  Bayesian Sets , 2005, NIPS.

[12]  M. de Rijke,et al.  Entity Retrieval , 2007 .

[13]  Ludovic Denoyer,et al.  The Wikipedia XML Corpus , 2006, INEX.

[14]  Nick Craswell,et al.  Overview of the TREC 2005 Enterprise Track , 2005, TREC.

[15]  M. de Rijke,et al.  Estimating Importance Features for Fact Mining (With a Case Study in Biography Mining) , 2007, RIAO.

[16]  W. Bruce Croft,et al.  Cluster-based retrieval using language models , 2004, SIGIR '04.

[17]  M. de Rijke,et al.  Discovering missing links in Wikipedia , 2005, LinkKDD '05.

[18]  C. J. van Rijsbergen,et al.  The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[19]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[20]  Valentin Jijkoun,et al.  WiQA: Evaluating Multi-lingual Focused Access to Wikipedia , 2007, EVIA@NTCIR.

[21]  W. Bruce Croft,et al.  Hierarchical Language Models for Expert Finding in Enterprise Corpora , 2006, 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'06).

[22]  W. Bruce Croft,et al.  Relevance Models in Information Retrieval , 2003 .