A Framework for Efficient Query Answering on Semantically Heterogeneous Grids

With the rapid growth of Grid computing, more and more data are generated and stored across the grid. To fully utilize the data, efficient data search and query answering mechanism is becoming a very important issue. However, the sheer amount of data and their heterogeneity nature pose challenges that current technology cannot cope with efficiently. In this paper, we propose an efficient query answering solution that integrates topology adaptation, semantic query routing, and view-based caching techniques to reduce bandwidth cost of distributed query processing while allowing efficient evaluation of complex semantic queries over large-scale, fully decentralized, and semantically heterogeneous grids. Simulated experimentations illustrate that our comprehensive query strategies effectively reduce the cost of query evaluation and improves the query performance.

[1]  Dan Suciu,et al.  The Piazza peer data management system , 2004, IEEE Transactions on Knowledge and Data Engineering.

[2]  Beng Chin Ooi,et al.  PeerDB: a P2P-based system for distributed data sharing , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[3]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[4]  Diego Calvanese,et al.  The Description Logic Handbook: Theory, Implementation, and Applications , 2003, Description Logic Handbook.

[5]  Wolfgang Nejdl Design Issues and Challenges for RDF- and Schema-Based Peer-to-Peer Systems , 2003, DBISP2P.

[6]  Felix Naumann,et al.  A research agenda for query processing in large-scale peer data management systems , 2008, Inf. Syst..

[7]  Ben Y. Zhao,et al.  An Infrastructure for Fault-tolerant Wide-area Location and Routing , 2001 .

[8]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[9]  Scott Shenker,et al.  Making gnutella-like P2P systems scalable , 2003, SIGCOMM '03.

[10]  Dan Brickley,et al.  Resource description framework (RDF) schema specification , 1998 .

[11]  Catriel Beeri,et al.  Rewriting queries using views in description logics , 1997, PODS '97.

[12]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[13]  Ora Lassila,et al.  W3c resource description framework (rdf) model and syntax specification , 1998 .

[14]  Karl Aberer,et al.  The chatty web: emergent semantics through gossiping , 2003, WWW '03.

[15]  Lada A. Adamic,et al.  Zipf's law and the Internet , 2002, Glottometrics.

[16]  Marcelo Arenas,et al.  Semantics and complexity of SPARQL , 2006, TODS.

[17]  A. Tversky Features of Similarity , 1977 .

[18]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .

[19]  Juan Li,et al.  Efa: an efficient content routing algorithm in large peer-to-peer overlay networks , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[20]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[21]  Myoung-Ho Kim,et al.  Information Retrieval Based on Conceptual Distance in is-a Hierarchies , 1993, J. Documentation.

[22]  Gerhard Weikum,et al.  Peer-to-Peer Information Search: Semantic, Social, or Spiritual? , 2007, IEEE Data Eng. Bull..

[23]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[24]  Min Cai,et al.  RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network , 2004, WWW '04.

[25]  Fausto Giunchiglia,et al.  Data Management for Peer-to-Peer Computing : A Vision , 2002, WebDB.

[26]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[27]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[28]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[29]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[30]  Nicholas R. Jennings,et al.  The Semantic Grid: A Future e‐Science Infrastructure , 2003 .

[31]  Juan Li,et al.  SOON: A Scalable Self-organized Overlay Network for Distributed Information Retrieval , 2008, DSOM.

[32]  Max J. Egenhofer,et al.  Determining Semantic Similarity among Entity Classes from Different Ontologies , 2003, IEEE Trans. Knowl. Data Eng..

[33]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.