RDF ("Resource Description Framework") is now a widely used World Wide Web Consortium standard. However, methods to index large volumes of RDF data are still in their infancy. In this paper, we focus on providing a very lightweight indexing mechanism for certain kinds of RDF queries, namely graph-based queries where there is a need to traverse edges in the graph determined by an RDF database. Our approach uses the idea of drawing circles around selected "center" vertices in the graph where the circle would encompass those vertices in the graph that are within a given distance of the "center" vertex. We come up with methods of finding such "center" vertices and identifying the radius of the circles and then leverage this to build an index called GRIN. We compare GRIN with three existing RDF indexex: Jena, Sesame. and RDFBroker. We compared (i) the time to answer graph based queries, (ii) memory needed to store the index, and (iii) the time to build the index. GRIN outperforms Jena, Sesame and RDFBroker on all three measures for graph based queries (for other types of queries, it may be worth building one of these other indexes and using it), at the expense of using a larger amount of memory when answering queries.
[1]
Krys J. Kochut,et al.
BRAHMS: A WorkBench RDF Store and High Performance Memory System for Semantic Association Discovery
,
2005,
SEMWEB.
[2]
Mario Vento,et al.
A (sub)graph isomorphism algorithm for matching large graphs
,
2004,
IEEE Transactions on Pattern Analysis and Machine Intelligence.
[3]
Vassilis Christophides,et al.
Benchmarking Database Representations of RDF/S Stores
,
2005,
SEMWEB.
[4]
Kevin Wilkinson,et al.
Application-Specific Schema Design for Storing Large RDF Datasets
,
2003,
PSSS.
[5]
Peter J. Rousseeuw,et al.
Clustering by means of medoids
,
1987
.
[6]
Dave Reynolds,et al.
Efficient RDF Storage and Retrieval in Jena2
,
2003,
SWDB.
[7]
Dave J. Beckett,et al.
The design and implementation of the redland RDF application framework
,
2001,
WWW '01.
[8]
Vassilis Christophides,et al.
The ICS-FORTH RDFSuite: Managing Voluminous RDF Description Bases
,
2001,
SemWeb.
[9]
Michael Sintek,et al.
RDFBroker: A Signature-Based High-Performance RDF Store
,
2006,
ESWC.
[10]
Frank van Harmelen,et al.
Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema
,
2002,
SEMWEB.