A Service-Oriented System to Support Data Integration on Data Grids

Data Grids provide transparent access to heterogeneous and autonomous data resources. The main contribution of this paper is the presentation of a data sharing system that (i) is tailored to data grids, (ii) supports well established and widely spread relational DBMSs, and (iii) adopts a hybrid architecture by relying on a peer model for query reformulation for retrieving semantically equivalent expressions, and on a wrapper-mediator integration model for accessing and querying distributed data sources. The system builds upon the infrastructure provided by the OGSA-DQP distributed query processor and the XMAP query reformulation algorithm. The paper discusses the implementation methodology, and also presents empirical evaluation results.

[1]  Alon Y. Halevy Data Integration: A Status Report , 2003, BTW.

[2]  Norman W. Paton,et al.  OGSA-DQP: A Service for Distributed Querying on the Grid , 2004, EDBT.

[3]  Jim Smith,et al.  Practical Adaptation to Changing Resources in Grid Query Processing , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[4]  Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2007), 14-17 May 2007, Rio de Janeiro, Brazil , 2007, CCGRID.

[5]  Dan Suciu,et al.  Schema mediation for large-scale semantic data sharing , 2005, The VLDB Journal.

[6]  Hongjun Lu,et al.  Query translation from XPath to SQL in the presence of recursive DTDs , 2009, The VLDB Journal.

[7]  Domenico Talia,et al.  Service Choreography for Data Integration on the Grid , 2005, Knowledge and Data Management in GRIDs.

[8]  Domenico Talia,et al.  XML Data Integration in OGSA Grids , 2005, DMG.

[9]  Peter Brezany,et al.  Novel mediator architectures for Grid information systems , 2005, Future Gener. Comput. Syst..

[10]  Norman W. Paton,et al.  A novel approach to resource scheduling for parallel query processing on computational grids , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[11]  Mario Antonioletti,et al.  Profiling OGSA-DAI Performance for Common Use Patterns , 2006 .

[12]  Hamid Pirahesh,et al.  System RX: one part relational, one part XML , 2005, SIGMOD '05.

[13]  Diego Calvanese,et al.  Hyper: A Framework for Peer-to-Peer Data Integration on Grids , 2004, ICSNW.

[14]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[15]  Carole A. Goble,et al.  The Semantic Grid: Myth Busting and Bridge Building , 2004, ECAI.

[16]  Jim Smith,et al.  Service-Based Distributed Querying on the Grid , 2003, ICSOC.