Distributed Name-based Entity Search

Internet can be seen as a network of peers that store digital representations of entities from the real world (e.g., person, locations, events). Different peers locally represent different “versions” (i.e., different points of view) of the same real world entity. In these different versions, entities are normally identified by multiple (possibly different) names. We propose a distributed entity search based on names that aims to (i) find all the different versions of an entity starting from any name used somewhere in the network to identify such entity; and (ii) allow peers to have full control over the privacy of their local representations. We evaluate our approach by setting up a network of 150 peers on Plan- etLab. The results show that the performance of our algorithms is stable with the network growth, which is promising in terms of scalability.

[1]  Edith Cohen,et al.  Associative search in peer to peer networks: harnessing latent semantics , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[2]  Gurmeet Singh Manku,et al.  SETS: search enhanced by topic segmentation , 2003, SIGIR.

[3]  Guillaume Urvoy-Keller,et al.  Hierarchical Peer-To-Peer Systems , 2003, Parallel Process. Lett..

[4]  Sam Joseph,et al.  NeuroGrid: Semantically Routing Queries in Peer-to-Peer Networks , 2002, NETWORKING Workshops.

[5]  Jianfeng Gao,et al.  A Supervised Learning Approach to Entity Search , 2006, AIRS.

[6]  Fausto Giunchiglia,et al.  Two-layered architecture for peer-to-peer concept search , 2011 .

[7]  Wolfgang Nejdl,et al.  PCIR: Combining DHTs and peer clusters for efficient full-text P2P indexing , 2010, Comput. Networks.

[8]  Claudia Niederée,et al.  Entity Name System: The Back-Bone of an Open and Scalable Web of Data , 2008, 2008 IEEE International Conference on Semantic Computing.

[9]  Jon Crowcroft,et al.  A survey and comparison of peer-to-peer overlay network schemes , 2005, IEEE Communications Surveys & Tutorials.

[10]  Bruce M. Maggs,et al.  Efficient content location using interest-based locality in peer-to-peer systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[11]  David R. Karger,et al.  On the Feasibility of Peer-to-Peer Web Indexing and Search , 2003, IPTPS.

[12]  Scott Shenker,et al.  Fixing the Embarrassing Slowness of OpenDHT on PlanetLab , 2005, WORLDS.

[13]  Themis Palpanas,et al.  Towards a general entity representation model , 2009, 2009 IEEE International Conference on Information Reuse & Integration.

[14]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[15]  Krishna P. Gummadi,et al.  Canon in G major: designing DHTs with hierarchical structure , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[16]  Tim Moors,et al.  Survey of research towards robust peer-to-peer networks: Search methods , 2006, Comput. Networks.

[17]  Kevin Chen-Chuan Chang,et al.  Entity Search Engine: Towards Agile Best-Effort Information Integration over the Web , 2007, CIDR.