Semantic flooding

Classifications are trees where links between nodes codify the fact that a node lower in the hierarchy describes a topic (and contains documents about this topic) which is more specific than the topic of the node one level above. In turn, multiple classifications can be connected by semantic links which represent mappings among them and which can be computed, e.g. by ontology matching. In this paper, we describe how these two types of links can be used to define a semantic overlay network which can cover any number of peers and which can be flooded to perform a semantic search on documents, i.e. to perform semantic flooding. We have evaluated our approach by simulating networks of 10, 100, 1,000 and 10,000 peers containing classifications which are fragments of the DMoz web directory. The results are promising and show that, in our approach, only a relatively small number of peers needs to be queried in order to achieve high accuracy.

[1]  David R. Karger,et al.  On the Feasibility of Peer-to-Peer Web Indexing and Search , 2003, IPTPS.

[2]  Steffen Staab,et al.  Semantic social overlay networks , 2007, IEEE Journal on Selected Areas in Communications.

[3]  Jie Liu,et al.  QUERY ROUTING IN A PEER‐TO‐PEER SEMANTIC LINK NETWORK , 2005, Comput. Intell..

[4]  Tang Nan,et al.  A Novel Algorithm for Detecting Air Holes in Steel Pipe Welding Based on Hopfield Neural Network , 2007, Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007).

[5]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[6]  Sonia Bergamaschi,et al.  Agents and Peer-to-Peer Computing - 5th International Workshop, AP2PC 2006, Hakodate, Japan, May 9, 2006, Revised and Invited Papers , 2008, AP2PC.

[7]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[8]  Fausto Giunchiglia,et al.  Concept Search , 2009, ESWC.

[9]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[10]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[11]  Fausto Giunchiglia,et al.  Lightweight Ontologies , 2009, Encyclopedia of Database Systems.

[12]  Gurmeet Singh Manku,et al.  SETS: search enhanced by topic segmentation , 2003, SIGIR.

[13]  Hector Garcia-Molina,et al.  Routing indices for peer-to-peer systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[14]  Srinivasan Seshan,et al.  Mercury: supporting scalable multi-attribute range queries , 2004, SIGCOMM 2004.

[15]  Fausto Giunchiglia,et al.  Formalizing the Get-Specific Document Classification Algorithm , 2007, ECDL.

[16]  C. Bauckhage,et al.  Analyzing Social Bookmarking Systems : A del . icio . us Cookbook , 2008 .

[17]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .

[18]  Fausto Giunchiglia,et al.  Semantic Matching: Algorithms and Implementation , 2007, J. Data Semant..

[19]  Fausto Giunchiglia,et al.  Discovering Missing Background Knowledge in Ontology Matching , 2006, ECAI.

[20]  Ben Y. Zhao,et al.  Tapestry: An Infrastructure for Fault-tolerant Wide-area Location and , 2001 .

[21]  Alexander Borgida,et al.  Towards Measuring Similarity in Description Logics , 2005, Description Logics.

[22]  Edith Cohen,et al.  Associative search in peer to peer networks: harnessing latent semantics , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[23]  Abdur Chowdhury,et al.  A picture of search , 2006, InfoScale '06.

[24]  Xuemin Shen,et al.  Handbook of Peer-to-Peer Networking , 2009 .

[25]  Fausto Giunchiglia,et al.  Encoding Classifications into Lightweight Ontologies , 2006, ESWC.

[26]  Tim Moors,et al.  Survey of research towards robust peer-to-peer networks: Search methods , 2006, Comput. Networks.

[27]  Sandhya Dwarkadas,et al.  Peer-to-peer information retrieval using self-organizing semantic overlay networks , 2003, SIGCOMM '03.

[28]  Gerhard Weikum,et al.  MINERVA: Collaborative P2P Search , 2005, VLDB.

[29]  Fausto Giunchiglia,et al.  P2P CONCEPT SEARCH: SOME PRELIMINARY RESULTS , 2009 .

[30]  Enrico Gregori,et al.  Web Engineering and Peer-to-Peer Computing , 2002, Lecture Notes in Computer Science.

[31]  Bruce M. Maggs,et al.  Efficient content location using interest-based locality in peer-to-peer systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[32]  Steffen Staab,et al.  Bibster - A Semantics-Based Bibliographic Peer-to-Peer System , 2004, International Semantic Web Conference.

[33]  Gang Wang,et al.  Concept Index for Document Retrieval with Peer-to-Peer Network , 2007, Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing (SNPD 2007).

[34]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[35]  Ben Y. Zhao,et al.  Tapestry: a fault-tolerant wide-area application infrastructure , 2002, CCRV.

[36]  Isabel F. Cruz,et al.  Ontology-based Query Rewriting in Peer-to-Peer Networks , 2006 .

[37]  Karl Aberer,et al.  AlvisP2P: scalable peer-to-peer text retrieval in a structured P2P network , 2008, Proc. VLDB Endow..

[38]  Ben Y. Zhao,et al.  An Infrastructure for Fault-tolerant Wide-area Location and Routing , 2001 .

[39]  Sam Joseph,et al.  NeuroGrid: Semantically Routing Queries in Peer-to-Peer Networks , 2002, NETWORKING Workshops.

[40]  Xuanjing Huang,et al.  From Web Directories to Ontologies: Natural Language Processing Challenges , 2007, ISWC/ASWC.

[41]  Srinivasan Seshan,et al.  Mercury: supporting scalable multi-attribute range queries , 2004, SIGCOMM '04.

[42]  Frank van Harmelen,et al.  Contextualizing ontologies , 2004, J. Web Semant..

[43]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.