A hybrid approach to storage scheme for semantic Web metadata

In the near future, it is anticipated that enormous semantic Web metadata described in RDF will be appearing. Thus, some researches on the management for the semantic Web metadata have been proposed. In most previous approaches, the semantic Web metadata as a form of triple is stored in a large relational table. Since it always requires the whole table to be scanned for processing a query, it may degrade retrieval performance. In this paper, we distinguish some frequently appeared properties in the semantic Web metadata and/or frequently used properties in a user query. RDF data with these distinguished properties are independently treated and stored in corresponding property-based tables respectively. For processing a query having a specific property, we can avoid full scanning of the whole data and only have to access a corresponding table. Finally, we partially implement and evaluate the proposed scheme. The initial results show that the proposed scheme achieves better performance for queries having a specific property.

[1]  Dave Reynolds,et al.  Efficient RDF Storage and Retrieval in Jena2 , 2003, SWDB.

[2]  Brian McBride,et al.  Jena: Implementing the RDF Model and Syntax Specification , 2001, SemWeb.

[3]  Rakesh Agrawal,et al.  Storage and Querying of E-Commerce Data , 2001, VLDB.

[4]  A. Swartz MusicBrainz: A Semantic Web Service , 2002, IEEE Intell. Syst..

[5]  Frank van Harmelen,et al.  Sesame: A Generic Architecture for Storing and Querying RDF and RDF Schema , 2002, SEMWEB.

[6]  Ahmad Ashari,et al.  Storing And Querying XML Data Using RDBMS , 2004, iiWAS.

[7]  Li Ding,et al.  How the Semantic Web is Being Used: An Analysis of FOAF Documents , 2005, Proceedings of the 38th Annual Hawaii International Conference on System Sciences.

[8]  Setrag Khoshafian,et al.  A decomposition storage model , 1985, SIGMOD Conference.