An Algebra For Basic Graph Patterns

Motivated by recent developments in the dataspaces, web, and personal information management communities, we outline research directions on query processing for SPARQL, the W3C recommendation language for querying RDF triple stores. The core of each SPARQL query is a basic graph pattern (BGP). BGP is a little logic for extracting subsets of related nodes in an RDF graph. In this paper we undertake a formal study of BGP with an eye towards efficient SPARQL query evaluation. Our main contributions are (1) an algebraization of BGP, and (2) first steps towards a framework for the design of structural indexes to accelerate processing of queries in this algebra.

[1]  Guido Moerkotte,et al.  Access Support Relations: An Indexing Method for Object Bases , 1992, Inf. Syst..

[2]  Olaf Hartig,et al.  The SPARQL Query Graph Model for Query Optimization , 2007, ESWC.

[3]  Marc Gyssens,et al.  A graph-oriented object database model , 1990, IEEE Trans. Knowl. Data Eng..

[4]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[5]  Dave Reynolds,et al.  SPARQL basic graph pattern optimization using selectivity estimation , 2008, WWW.

[6]  Daniel J. Abadi,et al.  Scalable Semantic Web Data Management Using Vertical Partitioning , 2007, VLDB.

[7]  Ehud Gudes,et al.  Exploiting local similarity for indexing paths in graph-structured data , 2002, Proceedings 18th International Conference on Data Engineering.

[8]  Vassilis Christophides,et al.  Benchmarking Database Representations of RDF/S Stores , 2005, SEMWEB.

[9]  Patrick Valduriez,et al.  Join indices , 1987, TODS.

[10]  Karl Aberer Data Management in the Social Web , 2006, EDBT.

[11]  Alberto O. Mendelzon,et al.  Foundations of semantic web databases , 2004, PODS.

[12]  Jos de Bruijn,et al.  Logical Foundations of (e)RDF(S): Complexity and Reasoning , 2007, ISWC/ASWC.

[13]  Jürgen Umbrich,et al.  YARS2: A Federated Repository for Querying Graph Structured Data from the Web , 2007, ISWC/ASWC.

[14]  Serge Abiteboul,et al.  Extracting schema from semistructured data , 1998, SIGMOD '98.

[15]  Jos de Bruijn,et al.  Logical Reconstruction of Normative RDF , 2005, OWLED.

[16]  Flavius Frasincar,et al.  RAL: an algebra for querying RDF , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002..

[17]  Ulf Leser,et al.  Index Support for SPARQL , 2006 .

[18]  Richard Cyganiak,et al.  A relational algebra for SPARQL , 2005 .

[19]  Michael Sintek,et al.  RDFBroker: A Signature-Based High-Performance RDF Store , 2006, ESWC.

[20]  David F. Wood,et al.  Kowari: A Platform for Semantic Web Storage and Analysis , 2005, WWW 2005.

[21]  Edward L. Robertson Triadic Relations: An Algebra for the Semantic Web , 2004, SWDB.

[22]  Tim Furche,et al.  RDF Querying: Language Constructs and Evaluation Methods Compared , 2006, Reasoning Web.

[23]  Jeremy J. Carroll,et al.  Resource description framework (rdf) concepts and abstract syntax , 2003 .

[24]  Ravi Krishnamurthy,et al.  First order normal form for relational databases and multidatabases , 1991, SGMD.

[25]  George H. L. Fletcher,et al.  A methodology for coupling fragments of XPath with structural indexes for XML documents , 2007, Inf. Syst..

[26]  Wendy Hall,et al.  The Semantic Web Revisited , 2006, IEEE Intelligent Systems.

[27]  Marcelo Arenas,et al.  Semantics and Complexity of SPARQL , 2006, International Semantic Web Conference.

[28]  Dan Suciu,et al.  Index Structures for Path Expressions , 1999, ICDT.

[29]  Alon Y. Halevy,et al.  Indexing dataspaces , 2007, SIGMOD '07.

[30]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[31]  David Maier,et al.  Principles of dataspace systems , 2006, PODS '06.

[32]  Beng Chin Ooi,et al.  The Indispensability of Dispensable Indexes , 1999, IEEE Trans. Knowl. Data Eng..

[33]  Kevin Wilkinson,et al.  Jena Property Table Implementation , 2006 .

[34]  Catharine M. Wyss,et al.  Extending relational query optimization to dynamic schemas for information integration in multidatabases , 2007, SIGMOD '07.

[35]  David Harel,et al.  Computable Queries for Relational Data Bases , 1980, J. Comput. Syst. Sci..

[36]  George H. L. Fletcher,et al.  Structural characterizations of the semantics of XPath as navigation tool on a document , 2006, PODS.

[37]  José Francisco Aldana Montes,et al.  A Survey on Disk Oriented Querying and Reasoning on the Semantic Web , 2006, 22nd International Conference on Data Engineering Workshops (ICDEW'06).

[38]  Jiawei Han,et al.  Join Index Hierarchy: An Indexing Structure for Efficient Navigation in Object-Oriented Databases , 1999, IEEE Trans. Knowl. Data Eng..

[39]  Francesco Scarcello,et al.  Query answering exploiting structural properties , 2005, SGMD.

[40]  William Jones Personal Information Management , 2007, Annu. Rev. Inf. Sci. Technol..

[41]  Rakesh Agrawal,et al.  Storage and Querying of E-Commerce Data , 2001, VLDB.

[42]  Christopher R. Longyear Further Towards a Triadic Calculus Part 1 , 1972 .

[43]  Vassilis Christophides,et al.  Containment and Minimization of RDF/S Query Patterns , 2005, SEMWEB.