OGSA-DQP: A Service-Based Distributed Query Processor for the Grid

The Grid is an emerging infrastructure that supports the discovery, access and use of distributed computational resources. The emergence of a serviceoriented view of hardware and software resources on the grid raises the question as to how database management systems and technologies can best be deployed or adapted for use in such an environment. We argue that distributed query processing (DQP) can provide effective declarative support for service orchestration, and we describe an approach to service-based DQP (OGSA-DQP) on the Grid that supports queries over Grid Data Services (GDS)s provided by OGSA-DAI project, and over other services available on the Grid, thereby combining data access with analysis; uses the facilities of the OGSA to dynamically obtain the resources necessary for efficient evaluation of a distributed query; adapts techniques from parallel databases to provide implicit parallelism for complex data-intensive requests; and uses the emerging standard for GDSs to provide consistent access to database metadata and to interact with databases on the Grid. The service-based Distributed Query Processor is itself cast as a service referred to here as Grid Distributed Query Service (GDQS). In addition, OGSADQP employs another service for query evaluation referred to here as Grid Query Evaluation Service (GQES). As such, OGSA-DQP implements a service orchestration framework in two sense: both in terms of the way its internal architecture handles the construction and execution of distributed query plans and in terms of being able to query over data and analysis resources made available as services.

[1]  Reagan Moore,et al.  Data-intensive computing , 1998 .

[2]  Alexander S. Szalay,et al.  SkyQuery: A Web Service Approach to Federate Databases , 2003, CIDR.

[3]  William H. Bell,et al.  Project Spitfire - Towards Grid Web Service Databases , 2002 .

[4]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[5]  P ShethAmit,et al.  Federated database systems for managing distributed, heterogeneous, and autonomous databases , 1990 .

[6]  Norman W. Paton,et al.  The design and implementation of Grid database services in OGSA‐DAI , 2005, Concurr. Pract. Exp..

[7]  Alfons Kemper,et al.  ObjectGlobe: Open Distributed Query Processing Services on the Internet , 2001, IEEE Data Eng. Bull..

[8]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[9]  Jim Smith,et al.  Polar: an architecture for a parallel ODMG compliant object database , 2000, CIKM '00.

[10]  R. G. G. Cattell,et al.  Recent books , 2000, IEEE Spectrum.

[11]  James Snell,et al.  Introduction to Web services architecture , 2002, IBM Syst. J..

[12]  Laura M. Haas,et al.  Garlic: a new flavor of federated query processing for DB2 , 2002, SIGMOD '02.

[13]  Jim Smith,et al.  Distributed Query Processing on the Grid , 2003, Int. J. High Perform. Comput. Appl..

[14]  Tuomas Sandholm,et al.  Globus Toolkit 3 Core-A Grid Service Container Framework , 2003 .

[15]  Mary Roth,et al.  Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources , 1997, VLDB.