A Diffusion-Based Method for Entity Search

Entity search has become an import task in the Web of Data recently. Most solutions developed so far have focused on modelling entity search using standard information retrieval model and adapting graph-based objects to multi-fielded pseudo-documents. Among the models proposed to this regard, we can found bag-of-words, multi-gram, and mixtures of language models. While these works have produced interesting findings, little attention has been put on the graph structure of the Web of data. In this work, we aim to fill this gap by introducing a two-stage method based on a standard information retrieval model combined with a diffusion-based approach. We implemented and tested several diffusion models finding that heat kernel diffusion processes have a competitive performance with state-of-the-art models.

[1]  Krisztian Balog,et al.  DBpedia-Entity v2: A Test Collection for Entity Search , 2017, SIGIR.

[2]  Conor Hayes,et al.  A Random Walk Model for Entity Relatedness , 2018, EKAW.

[3]  Alexander Kotov,et al.  Fielded Sequential Dependence Model for Ad-Hoc Entity Retrieval in the Web of Data , 2015, SIGIR.

[4]  Leo Katz,et al.  A new status index derived from sociometric analysis , 1953 .

[5]  Tamara G. Kolda,et al.  Link Prediction on Evolving Data Using Matrix and Tensor Factorizations , 2009, 2009 IEEE International Conference on Data Mining Workshops.

[6]  Jens Lehmann,et al.  DBpedia: A Nucleus for a Web of Open Data , 2007, ISWC/ASWC.

[7]  Giuseppe Pirrò,et al.  Explaining and Suggesting Relatedness in Knowledge Graphs , 2015, SEMWEB.

[8]  Michael R. Lyu,et al.  DiffusionRank: a possible penicillin for web spamming , 2007, SIGIR.

[9]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[10]  Peter Mika,et al.  Ad-hoc object retrieval in the web of data , 2010, WWW '10.

[11]  Ebrahim Bagheri,et al.  Document Retrieval Model Through Semantic Linking , 2017, WSDM.

[12]  Ioana Hulpus,et al.  Path-Based Semantic Relatedness on Linked Data and Its Use to Word and Entity Disambiguation , 2015, International Semantic Web Conference.

[13]  Michael R. Berthold,et al.  Node Similarities from Spreading Activation , 2010, 2010 IEEE International Conference on Data Mining.

[14]  Wai Lam,et al.  Entity Retrieval in the Knowledge Graph with Hierarchical Entity Type and Content , 2018, ICTIR.

[15]  Ladislav Hluchý,et al.  The SemSets model for ad-hoc semantic list search , 2012, WWW.

[16]  Stefan Dietze,et al.  Improving Entity Retrieval on Structured Data , 2015, SEMWEB.

[17]  Krisztian Balog,et al.  A test collection for entity search in DBpedia , 2013, SIGIR.

[18]  Jane Greenberg,et al.  Using BM25F for semantic search , 2010, SEMSEARCH '10.

[19]  Ulrik Brandes,et al.  Pure spreading activation is pointless , 2009, CIKM.

[20]  Krisztian Balog,et al.  Exploiting Entity Linking in Queries for Entity Retrieval , 2016, ICTIR.

[21]  Risi Kondor,et al.  Diffusion kernels on graphs and other discrete structures , 2002, ICML 2002.

[22]  Shing-Tung Yau,et al.  Coverings, Heat Kernels and Spanning Trees , 1998, Electron. J. Comb..