Towards distributed processing of RDF path queries

A technical infrastructure for storing, querying and managing RDF data is a key element in the current semantic web development. Systems like Jena, Sesame or the ICS-FORTH RDF Suite are widely used for building semantic web applications. Currently, none of these systems support the integrated querying of distributed RDF repositories. We consider this a major shortcoming since the semantic web is distributed by nature. In this paper we present an architecture for querying distributed RDF repositories by extending the existing Sesame system. We discuss the implications of our architecture and propose an index structure as well as algorithms for query processing and optimisation in such a distributed context.

[1]  James A. Hendler,et al.  Agents and the Semantic Web , 2001, IEEE Intell. Syst..

[2]  Doron Rotem Spatial join indices , 1991, [1991] Proceedings. Seventh International Conference on Data Engineering.

[3]  Vassilis Christophides,et al.  On labeling schemes for the semantic web , 2003, WWW '03.

[4]  Guido Moerkotte,et al.  Constructing Optimal Bushy Processing Trees for Join Queries is NP-hard , 1996 .

[5]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[6]  Object-Oriented Data,et al.  An Indexing Technique for Object-Oriented Databases , 1991 .

[7]  Jiawei Han,et al.  Join Index Hierarchies for Supporting Efficient Navigations in Object-Oriented Databases , 1994, VLDB.

[8]  Philip A. Bernstein,et al.  Using Semi-Joins to Solve Relational Queries , 1981, JACM.

[9]  Guido Moerkotte,et al.  Heuristic and randomized optimization for the join ordering problem , 1997, The VLDB Journal.

[10]  Clement T. Yu,et al.  Priniples of Database Query Processing for Advanced Applications , 1997 .

[11]  Arun N. Swami,et al.  Optimization of large join queries , 1988, SIGMOD '88.

[12]  Elisa Bertino,et al.  Index Organizations for Object-Oriented Database Systems , 1995, IEEE Trans. Knowl. Data Eng..

[13]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[14]  Guido Moerkotte Constructing Optimal Bushy Trees Possibly Containing Cross Products for Order Preserving Joins is in P , 2003 .

[15]  Eugene Wong,et al.  Query optimization by simulated annealing , 1987, SIGMOD '87.

[16]  Philip S. Yu,et al.  Parallel Execution of Hash Joins in Parallel Databases , 1997, IEEE Trans. Parallel Distributed Syst..

[17]  Elisa Bertino,et al.  A graph-theoretic approach to indexing in object-oriented databases , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[18]  Guy M. Lohman,et al.  Measuring the Complexity of Join Enumeration in Query Optimization , 1990, VLDB.

[19]  Donald Kossmann,et al.  The state of the art in distributed query processing , 2000, CSUR.

[20]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[21]  Myra Spiliopoulou,et al.  Genetic programming in database query optimization , 1996 .