Similarity-Based Query Caching

With the success of the semantic web infrastructures for storing and querying RDF data are gaining importance. A couple of systems are available now that provide basic database functionality for RDF data. Compared to modern database systems, RDF storage technology still lacks sophisticated optimization methods for query processing. Current work in this direction is mainly focussed on index structures for speeding up the access at triple level or for special queries. In this paper, we discuss semantic query caching as a high level optimization technique for RDF querying to supplement existing work on lower level techniques. Our approach for semantic caching is based on the notion of similarity of RDF queries determined by the costs of modifying the results of a previous query into the result for the actual one. We discuss the problem of subsumption for RDF queries, present a cost model and derive a similarity measure for RDF queries based on the cost model and the notion of graph edit distance.

[1]  James A. Hendler,et al.  The Semantic Web — ISWC 2002 , 2002, Lecture Notes in Computer Science.

[2]  Kaizhong Zhang,et al.  Algorithms for Approximate Graph Matching , 1995, Inf. Sci..

[3]  Vassilis Christophides,et al.  On labeling schemes for the semantic web , 2003, WWW '03.

[4]  Boris Chidlovskii,et al.  Semantic caching of Web queries , 2000, The VLDB Journal.

[5]  Horst Bunke,et al.  On a relation between graph edit distance and maximum common subgraph , 1997, Pattern Recognit. Lett..

[6]  Horst Bunke,et al.  A graph distance metric based on the maximal common subgraph , 1998, Pattern Recognit. Lett..

[7]  Divesh Srivastava,et al.  Semantic Data Caching and Replacement , 1996, VLDB.

[8]  Heiner Stuckenschmidt,et al.  Towards distributed processing of RDF path queries , 2005, Int. J. Web Eng. Technol..

[9]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[10]  Dongwon Lee,et al.  Towards Intelligent Semantic Caching for Web Sources , 2001, Journal of Intelligent Information Systems.

[11]  Heiner Stuckenschmidt,et al.  Towards distributed RDF querying , 2004 .

[12]  Vijay Kumar,et al.  Semantic Caching and Query Processing , 2003, IEEE Trans. Knowl. Data Eng..