Category-Based Query Modeling for Entity Search

Users often search for entities instead of documents and in this setting are willing to provide extra input, in addition to a query, such as category information and example entities. We propose a general probabilistic framework for entity search to evaluate and provide insight in the many ways of using these types of input for query modeling. We focus on the use of category information and show the advantage of a category-based representation over a term-based representation, and also demonstrate the effectiveness of category-based expansion using example entities. Our best performing model shows very competitive performance on the INEX-XER entity ranking and list completion tasks.

[1]  Katherine A. Heller,et al.  Bayesian Sets , 2005, NIPS.

[2]  Wei Lu,et al.  Adapting Language Modeling Methods for Expert Search to Rank Wikipedia Entities , 2008, INEX.

[3]  Andrew Trotman,et al.  Advances in Focused Retrieval, 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2008, Dagstuhl Castle, Germany, December 15-18, 2008. Revised and Selected Papers , 2009, INEX.

[4]  Gilad Mishne,et al.  A Study of Blog Search , 2006, ECIR.

[5]  Stefan M. Rüger,et al.  Integrating Document Features for Entity Ranking , 2008, INEX.

[6]  Leif Azzopardi,et al.  An analysis on document length retrieval trends in language modeling smoothing , 2008, Information Retrieval.

[7]  James A. Thom,et al.  Using Wikipedia Categories and Links in Entity Ranking , 2007, INEX.

[8]  Azadeh Shakery,et al.  Toward Entity Retrieval over Structured and Text Data , 2004 .

[9]  Nick Craswell,et al.  L3S at INEX 2008: Retrieving Entities Using Structured Information , 2008, INEX.

[10]  Daniel E. Rose,et al.  Understanding user goals in web search , 2004, WWW '04.

[11]  Emine Yilmaz,et al.  A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[12]  Peter Ingwersen,et al.  Developing a Test Collection for the Evaluation of Integrated Search , 2010, ECIR.

[13]  W. Bruce Croft,et al.  A general language model for information retrieval , 1999, CIKM '99.

[14]  Jovan Pehcevski,et al.  Topic Difficulty Prediction in Entity Ranking , 2008, INEX.

[15]  M. de Rijke,et al.  A few examples go a long way: constructing query models from elaborate query formulations , 2008, SIGIR '08.

[16]  M. de Rijke,et al.  Entity Retrieval , 2007 .

[17]  James A. Thom,et al.  Entity ranking in Wikipedia , 2007, SAC '08.

[18]  Jack G. Conrad,et al.  A system for discovering relationships by feature extraction from text databases , 1994, SIGIR '94.

[19]  Wouter Weerkamp,et al.  A Generative Language Modeling Approach for Ranking Entities , 2008, INEX.

[20]  Giuseppe Attardi,et al.  Ranking very many typed entities on wikipedia , 2007, CIKM '07.

[21]  James Allan,et al.  An Exploration of Entity Models, Collective Classification and Relation Description , 2004 .

[22]  Peter Bailey,et al.  Overview of the TREC 2007 Enterprise Track | NIST , 2008 .

[23]  Paavo Arvola,et al.  Entity Ranking Based on Category Expansion , 2008, INEX.

[24]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[25]  Jaap Kamps,et al.  Finding Entities in Wikipedia Using Links and Categories , 2008, INEX.

[26]  Krisztian Balog,et al.  People search in the enterprise , 2007, SIGF.

[27]  Mounia Lalmas,et al.  Overview of the INEX 2007 Entity Ranking Track , 2008, INEX.

[28]  Djoerd Hiemstra,et al.  Structured Document Retrieval, Multimedia Retrieval, and Entity Ranking Using PF/Tijah , 2008, INEX.

[29]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[30]  Gianluca Demartini,et al.  Overview of the INEX 2008 Entity Ranking Track , 2009, INEX.

[31]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[32]  Peter Bailey,et al.  Overview of the TREC 2008 Enterprise Track , 2008, TREC.

[33]  Andrew Trotman,et al.  Focused Access to XML Documents, 6th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2007, Dagstuhl Castle, Germany, December 17-19, 2007. Selected Papers , 2008, INEX.