A DHT-based infrastructure for ad-hoc integration and querying of semantic data

A crucial prerequisite for the deployment and success of Peer-to-Peer data management applications is the availability of metadata in a way that makes it easy to access and combine data from different sources and domains. In this paper, we argue for a unified and distributed infrastructure providing a repository for semantic data by offering location transparency and advanced query services. After discussing the challenges of such an approach, we present our solution which applies extended SPARQL-like query features for dealing with large and possibly heterogeneous data sets. We focus on the integration into efficient distributed query processing and evaluate our approach in a series of experiments.

[1]  Ben Y. Zhao,et al.  Awarded Best Student Paper! - Pond: The OceanStore Prototype , 2003 .

[2]  Karl Aberer,et al.  GridVine: An Infrastructure for Peer Information Management , 2007, IEEE Internet Computing.

[3]  Manfred Hauswirth,et al.  Estimating the number of answers with guarantees for structured queries in p2p databases , 2008, CIKM '08.

[4]  Martin Richtarsky,et al.  UniStore: Querying a DHT-based Universal Storage , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[5]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[6]  Manfred Hauswirth,et al.  Cost-Aware Processing of Similarity Queries in Structured Overlays , 2006, Sixth IEEE International Conference on Peer-to-Peer Computing (P2P'06).

[7]  G. Weikum Querying the Internet with PIER , 2005 .

[8]  Jayant Madhavan,et al.  Web-Scale Data Integration: You can afford to Pay as You Go , 2007, CIDR.

[9]  Karl Aberer,et al.  GridVine: Building Internet-Scale Semantic Overlay Networks , 2004, SEMWEB.

[10]  Min Cai,et al.  RDFPeers: a scalable distributed RDF repository based on a structured peer-to-peer network , 2004, WWW '04.

[11]  Donald Kossmann,et al.  The Skyline operator , 2001, Proceedings 17th International Conference on Data Engineering.

[12]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[13]  Karl Aberer,et al.  Probabilistic Message Passing in Peer Data Management Systems , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[14]  Ben Y. Zhao,et al.  Pond: The OceanStore Prototype , 2003, FAST.

[15]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[16]  Scott Shenker,et al.  Complex Queries in Dht-based Peer-to-peer Networks , 2002 .

[17]  Karl Aberer,et al.  Advanced Peer-to-Peer Networking: The P-Grid System and its Applications , 2003, PIK Prax. Informationsverarbeitung Kommun..

[18]  Marcel Karnstedt,et al.  Cost-Aware Skyline Queries in Structured Overlays , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.