Improving Context and Category Matching for Entity Search

Entity search is to retrieve a ranked list of named entities of target types to a given query. In this paper, we propose an approach of entity search by formalizing both context matching and category matching. In addition, we propose a result re-ranking strategy that can be easily adapted to achieve a hybrid of two context matching strategies. Experiments on the INEX 2009 entity ranking task show that the proposed approach achieves a significant improvement of the entity search performance (xinfAP from 0.27 to 0.39) over the existing solutions.

[1]  M. de Rijke,et al.  Formal models for expert finding in enterprise corpora , 2006, SIGIR.

[2]  Krisztian Balog,et al.  Overview of the TREC 2010 Entity Track , 2010, TREC.

[3]  Jaap Kamps,et al.  Entity ranking using Wikipedia as a pivot , 2010, CIKM.

[4]  Craig MacDonald,et al.  Voting for candidates: adapting data fusion techniques for an expert search task , 2006, CIKM '06.

[5]  Paul Thomas,et al.  Overview of the TREC 2009 Entity Track , 2009, TREC.

[6]  M. de Rijke,et al.  Combining Candidate and Document Models for Expert Search , 2008, TREC.

[7]  Jaap Kamps,et al.  Exploiting the category structure of Wikipedia for entity ranking , 2013, Artif. Intell..

[8]  Gerhard Weikum Search for Knowledge , 2009, SeCO Workshop.

[9]  Maarten de Rijke,et al.  Combining Term-Based and Category-Based Representations for Entity Search , 2009, INEX.

[10]  Djoerd Hiemstra,et al.  Modeling Documents as Mixtures of Persons for Expert Finding , 2008, ECIR.

[11]  M. de Rijke,et al.  A language modeling framework for expert finding , 2009, Inf. Process. Manag..

[12]  Craig MacDonald,et al.  Voting for related entities , 2010, RIAO.

[13]  Ian H. Witten,et al.  Learning to link with wikipedia , 2008, CIKM '08.

[14]  Luo Si,et al.  Entity Retrieval with Hierarchical Relevance Model, Exploiting the Structure of Tables and Learning Homepage Classifiers , 2009, TREC.

[15]  Oren Kurland,et al.  The cluster hypothesis for entity oriented search , 2013, SIGIR.

[16]  Fan Zhang,et al.  Nonlinear Evidence Fusion and Propagation for Hyponymy Relation Mining , 2011, ACL.

[17]  Saswati Mukherjee,et al.  A Recursive Approach to Entity Ranking and List Completion Using Entity Determining Terms, Qualifiers and Prominent n-Grams , 2009, INEX.

[18]  Emine Yilmaz,et al.  A simple and efficient sampling method for estimating AP and NDCG , 2008, SIGIR '08.

[19]  M. de Rijke,et al.  Category-Based Query Modeling for Entity Search , 2010, ECIR.

[20]  Oren Kurland,et al.  A ranking framework for entity oriented search using Markov random fields , 2012, JIWES '12.

[21]  Gianluca Demartini,et al.  Overview of the INEX 2009 Entity Ranking Track , 2009, INEX.

[22]  Krisztian Balog,et al.  Overview of the TREC 2011 Entity Track , 2011, TREC.

[23]  M. de Rijke,et al.  Query modeling for entity search based on terms, categories, and examples , 2011, TOIS.

[24]  Benjamin Van Durme,et al.  Finding Cars, Goddesses and Enzymes: Parametrizable Acquisition of Labeled Instances for Open-Domain Information Extraction , 2008, AAAI.

[25]  James Allan,et al.  An Exploration of Entity Models, Collective Classification and Relation Description , 2004 .

[26]  Roi Blanco,et al.  TAER: time-aware entity retrieval-exploiting the past to find relevant entities in news articles , 2010, CIKM.

[27]  Kevin Chen-Chuan Chang,et al.  EntityRank: Searching Entities Directly and Holistically , 2007, VLDB.

[28]  Gerhard Weikum,et al.  From information to knowledge: harvesting entities and relationships from web sources , 2010, PODS '10.

[29]  James A. Thom,et al.  Entity ranking in Wikipedia , 2007, SAC '08.

[30]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[31]  Ralf Krestel,et al.  An Architecture for Finding Entities on the Web , 2009, 2009 Latin American Web Congress.