The grid relational catalog project

Today many DataGrid applications need to manage and process a very large amount of data distributed across multiple grid nodes and stored into heterogeneous databases. Grids encourage and promote the publication, sharing and integration of scientifica data (distributed across several Virtual Organizations) in a more open manner than is currently the case, and many e-Science pojects have an urgent need to interconnect legacy and independently operated databases through a set os data access and integration services. The complexity of data management within a Computational Grid comes from the distribution, scale and heterogeneity of data sources. A set of dynamic and adaptive services could address specific issues related to automatic data management providing high performance and transparency as well as fully exploiting a grid infrastructure. These services should involved data migration and integration, discovery of data sources and so on, providing a transparent and dynamic layer of data virtualization. In this pape we introduce the Grid-DBMS concept, a framework for dynamic data management in a grid enviroment, highlighting its requirements, architecture, components and services. We also present an overview about the Grid Relational Catalog Project (GRelC) developed at the CACT/ISUFI of the University of Lecce, which represents a partial implementation of a Grid-DBMS for the Globus Community.

[1]  Italo Epicoco,et al.  iGrid, a Novel Grid Information Service , 2005, EGC.

[2]  Vijayshankar Raman,et al.  Data Access and Management Services on Grid , 2002 .

[3]  Jim Smith,et al.  Distributed Query Processing on the Grid , 2002, GRID.

[4]  Paul Watson,et al.  Databases and the Grid , 2003 .

[5]  Sandro Fiore,et al.  Advanced delivery mechanisms in the GRelC project , 2004, MGC '04.

[6]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[7]  Laura M. Haas,et al.  Optimizing Queries Across Diverse Data Sources , 1997, VLDB.

[8]  David A. Bell,et al.  Distributed database systems , 1992 .

[9]  Italo Epicoco,et al.  The GSI plug-in for gSOAP: enhanced security, performance, and reliability , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[10]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[11]  Sandro Fiore,et al.  A grid-based architecture for earth observation data access , 2005, SAC '05.

[12]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[13]  Maria Mirto,et al.  The GRelC library: a basic pillar in the grid relational catalog architecture , 2004, International Conference on Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004..

[14]  Maria Mirto,et al.  The grid-DBMS: towards dynamic data management in grid environments , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[15]  Ian T. Foster,et al.  The Anatomy of the Grid: Enabling Scalable Virtual Organizations , 2001, Int. J. High Perform. Comput. Appl..

[16]  Maria Mirto,et al.  The GRELC project: Towards GRID-DBMS , 2004, Parallel and Distributed Computing and Networks.

[17]  Kyle A. Gallivan,et al.  The gSOAP Toolkit for Web Services and Peer-to-Peer Computing Networks , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[18]  Maria Mirto,et al.  Early Experiences with the GRelC Library , 2004, J. Digit. Inf. Manag..

[19]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..