P2P Web Search: Make It Light, Make It Fly (Demo)

We propose a live demonstration of MinervaLight, a P2P Web search engine. MinervaLight combines the (previously separate) focused crawler BINGO! (to harvest Web data), the local search engine TopX, and our P2P Web search system MINERVA under one common user interface. The crawler unattendedly downloads and indexes Web data, where the scope of the focused crawl can be tailored to the thematic interest profile of the user. The result of this process is a local search index, which is used by TopX to evaluate user queries. In the background, MinervaLight continuously computes compact statistical synopses that describe a user’s local search index and publishes that information to a conceptually global, but physically fully decentralized directory. MinervaLight o! ers a search interface where users can submit queries to MINERVA. Sophisticated query routing strategies are used to identify the most promising peers for each query based on the statistical synopses in the directory. The query is forwarded to those judiciously chosen peers and evaluated based on their local indexes. These results are sent back to the query initiator and merged into a single result list. We give a live demonstration of the fully functional system.

[1]  Scott Shenker,et al.  The Architecture of PIER: an Internet-Scale Query Processor , 2005, CIDR.

[2]  Gerhard Weikum,et al.  An Efficient and Versatile Query Engine for TopX Search , 2005, VLDB.

[3]  Jie Lu,et al.  Federated Search of Text-Based Digital Libraries in Hierarchical Peer-to-Peer Networks , 2005, Workshop on Peer-to-Peer Information Retrieval.

[4]  Gerhard Weikum,et al.  P2P Content Search: Give the Web Back to the People , 2006, IPTPS.

[5]  Larry L. Peterson,et al.  The design principles of PlanetLab , 2006, OPSR.

[6]  Gerhard Weikum,et al.  MINERVA: Collaborative P2P Search , 2005, VLDB.

[7]  Richard P. Martin,et al.  PlanetP: using gossiping to build content addressable peer-to-peer information sharing communities , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[8]  Gustavo Alonso,et al.  P2P Web Search with MINERVA: How do you want to search tomorrow? (Demo) , 2005 .

[9]  Gerhard Weikum,et al.  P2P Directories for Distributed Web Search: From Each According to His Ability, to Each According to His Needs , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[10]  D. DeWitt,et al.  GALANX: An Efficient Peer-to-Peer Search Engine System , 2004 .

[11]  Gerhard Weikum,et al.  Efficient and decentralized PageRank approximation in a peer-to-peer web search network , 2006, VLDB.

[12]  David R. Karger,et al.  On the Feasibility of Peer-to-Peer Web Indexing and Search , 2003, IPTPS.

[13]  Sandhya Dwarkadas,et al.  Hybrid Global-Local Indexing for Efficient Peer-to-Peer Information Retrieval , 2004, NSDI.

[14]  Gerhard Weikum,et al.  Discovering and exploiting keyword and attribute-value co-occurrences to improve P2P routing indices , 2006, CIKM '06.

[15]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[16]  Gerhard Weikum,et al.  Exploiting Community Behavior for Enhanced Link Analysis and Web Search , 2006, WebDB.

[17]  Gerhard Weikum,et al.  IQN Routing: Integrating Quality and Novelty in P2P Querying and Ranking , 2006, EDBT.

[18]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[19]  Gerhard Weikum,et al.  Query-Log Based Authority Analysis for Web Information Search , 2004, WISE.

[20]  Gerhard Weikum,et al.  Bookmark-driven Query Routing in Peer-to-Peer Web Search , 2005, Workshop on Peer-to-Peer Information Retrieval.

[21]  Hector Garcia-Molina,et al.  Routing indices for peer-to-peer systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[22]  Gerhard Weikum,et al.  Global Document Frequency Estimation in Peer-to-Peer Web Search , 2006, WebDB.

[23]  Amin Vahdat,et al.  Efficient Peer-to-Peer Keyword Searching , 2003, Middleware.

[24]  Henrik Nottelmann,et al.  An integrated approach for searching and browsing in heterogeneous peer-to-peer networks , 2005 .

[25]  Torsten Suel,et al.  ODISSEA: A Peer-to-Peer Architecture for Scalable Web Search and Information Retrieval , 2003, WebDB.