Example Based Entity Search in the Web of Data

The scale of today's Web of Data motivates the use of keyword search-based approaches to entity-oriented search tasks in addition to traditional structure-based approaches, which require users to have knowledge of the underlying schema. We propose an alternative structure-based approach that makes use of example entities and compare its effectiveness with a text-based approach in the context of an entity list completion task. We find that both the text and structure-based approaches are effective in retrieving relevant entities, but that they find different sets of entities. Additionally, we find that the performance of the structure-based approach is dependent on the quality and number of example entities given. We experiment with a number of hybrid techniques that balance between the two approaches and find that a method that uses the example entities to determine the weights of approaches in the combination on a per query basis is most effective.

[1]  Krisztian Balog,et al.  NTNU at SemSearch 2011 , 2011 .

[2]  Stefan Decker,et al.  Sig.ma: live views on the web of data , 2010, WWW '10.

[3]  Jeffrey Dalton,et al.  Semantic Entity Retrieval using Web Queries over Structured RDF Data , 2010 .

[4]  Jane Greenberg,et al.  Using BM25F for semantic search , 2010, SEMSEARCH '10.

[5]  Haofen Wang,et al.  Top-k Exploration of Query Candidates for Efficient Keyword Search on Graph-Shaped (RDF) Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[6]  Lora Aroyo,et al.  The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, October 23-27, 2011, Proceedings, Part I , 2011, SEMWEB.

[7]  Gerhard Weikum,et al.  Language-model-based ranking for queries on RDF-graphs , 2009, CIKM.

[8]  R. Doyle The American terrorist. , 2001, Scientific American.

[9]  Maarten de Rijke,et al.  Combining Term-Based and Category-Based Representations for Entity Search , 2009, INEX.

[10]  Daniel Schwabe,et al.  A hybrid approach for searching in the semantic web , 2004, WWW '04.

[11]  M. de Rijke,et al.  Generating links to background knowledge: a case study using narrative radiology reports , 2011, CIKM '11.

[12]  Huajun Chen,et al.  The Semantic Web , 2011, Lecture Notes in Computer Science.

[13]  ZhaiChengXiang Statistical Language Models for Information Retrieval A Critical Review , 2008 .

[14]  Luo Si,et al.  Purdue at TREC 2010 Entity Track: A Probabilistic Framework for Matching Types Between Candidate and Target Entities , 2010, TREC.

[15]  Gianluca Demartini,et al.  Combining inverted indices and structured search for ad-hoc object retrieval , 2012, SIGIR '12.

[16]  Stefan Decker,et al.  Sig.ma: Live views on the Web of Data , 2010, J. Web Semant..

[17]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[18]  Chong Wang,et al.  SPARK: Adapting Keyword Query to Semantic Search , 2007, ISWC/ASWC.

[19]  Gianluca Demartini,et al.  Overview of the INEX 2009 Entity Ranking Track , 2009, INEX.

[20]  Peter Mika,et al.  Ad-hoc object retrieval in the web of data , 2010, WWW '10.

[21]  Katja Hofmann,et al.  The University of Amsterdam at TREC 2010: Session, Entity and Relevance Feedback , 2010, TREC.

[22]  Peter Mika,et al.  Entity Search Evaluation over Structured Web Data , 2011 .

[23]  John Davies,et al.  QuizRDF: search technology for the semantic Web , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[24]  ChengXiang Zhai,et al.  Statistical Language Models for Information Retrieval: A Critical Review , 2008, Found. Trends Inf. Retr..

[25]  Krisztian Balog,et al.  Overview of the TREC 2010 Entity Track , 2010, TREC.

[26]  Ladislav Hluchý,et al.  The SemSets model for ad-hoc semantic list search , 2012, WWW.

[27]  Jovan Pehcevski,et al.  Topic Difficulty Prediction in Entity Ranking , 2008, INEX.

[28]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[29]  Krisztian Balog,et al.  On the Modeling of Entities for Ad-Hoc Entity Search in the Web of Data , 2012, ECIR.

[30]  Edward A. Fox,et al.  Combination of Multiple Searches , 1993, TREC.

[31]  Milad Shokouhi,et al.  LambdaMerge: merging the results of query reformulations , 2011, WSDM '11.

[32]  M. de Rijke,et al.  Adding semantics to microblog posts , 2012, WSDM '12.

[33]  Krisztian Balog,et al.  Entity search: building bridges between two worlds , 2010, SEMSEARCH '10.

[34]  William W. Cohen,et al.  Entity List Completion Using Set Expansion Techniques , 2010, TREC.

[35]  Yi Su,et al.  Model Adaptation via Model Interpolation and Boosting for Web Search Ranking , 2009, EMNLP.

[36]  M. de Rijke,et al.  Ranking related entities: components and analyses , 2010, CIKM.

[37]  Roi Blanco,et al.  Effective and Efficient Entity Search in RDF Data , 2011, SEMWEB.

[38]  Andrew Trotman,et al.  Focused Retrieval and Evaluation, 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2009, Brisbane, Australia, December 7-9, 2009, Revised and Selected Papers , 2010, INEX.