论文信息 - Using an index of precomputed joins in order to speed up SPARQL processing

Using an index of precomputed joins in order to speed up SPARQL processing

SparQL is a query language developed by the W3C, the purpose of which is to query a data set in RDF representing a directed graph. Many free available or commercial products already support SparQL processing. Current index-based optimizations integrated in these products typically construct indices on the subject, predicate and object of an RDF triple, which is a single datum of the RDF data, in order to speed up the execution time of SparQL queries. In order to query the directed graph of RDF data, SparQL queries typically contain many joins over a set of triples. We propose to construct and use an index of precomputed joins, where we take advantage of the homogenous structure of RDF data. Furthermore, we present experimental results, which demonstrate the achievable speed-up factors for SparQL processing.

[1] Dave Reynolds,et al. Efficient RDF Storage and Retrieval in Jena2 , 2003, SWDB.

[2] Toshiyuki Amagasa,et al. An Indexing Scheme for RDF and RDF Schema based on Suffix Arrays , 2003, SWDB.

[3] Dave J. Beckett,et al. The design and implementation of the redland RDF application framework , 2001, WWW '01.

[4] Frank van Harmelen,et al. Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[5] Andreas Harth,et al. Optimized index structures for querying RDF from the Web , 2005, Third Latin American Web Congress (LA-WEB'2005).

[6] Vassilis Christophides,et al. The RDFSuite: Managing Voluminous RDF Description Bases , 2000 .

[7] Vassilis Christophides,et al. The ICS-FORTH RDFSuite: Managing Voluminous RDF Description Bases , 2001, SemWeb.

[8] Heiner Stuckenschmidt,et al. Index structures and algorithms for querying distributed RDF repositories , 2004, WWW '04.

[9] Stanislav Barton,et al. Designing Indexing Structure for Discovering Relationships in RDF Graphs , 2004, DATESO.