A General Data Grid: Framework and Implementation

Today, data grids have become an important emerging platform for managing and processing a very large amount of data distributed across multiple grid nodes and stored in relational databases. However, there are still obstacles for potential grid users to be involved into the trend and the data grid application development is far from the data grids. While the traditional established data grid architectures are not particularly suitable for the some grid service, such as data replica services, user authentication services, and the Optimal Path selection services. Our project aims to exploit a novel architecture named General Data Grid, which integrates the metadata services, data replica services, java message services, and the Optimal Path selection on Data Grid environment. At the end of this paper, we describe the key implement on the GDGrid and present a simple example application concerning finding the optimal route. Our experiment of GDGrid shows the algorithm of the route selection (Heart-Beat algorithm) is effectively, improve the performance greatly, and afford fault-tolerance management to the great extent.

[1]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[2]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[3]  Kurt Stockinger,et al.  Simulation of Dynamic Grid Replication Strategies in OptorSim , 2002, GRID.

[4]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[5]  B. Segal,et al.  Grid computing: the European Data Grid Project , 2000, 2000 IEEE Nuclear Science Symposium. Conference Record (Cat. No.00CH37149).

[6]  Nong Xiao,et al.  GridDaen: A Data Grid Engine , 2003, GCC.

[7]  Reagan Moore,et al.  Data Grids, Digital Libraries, and Persistent Archives: An Integrated Approach to Sharing, Publishing, and Archiving Data , 2005, Proceedings of the IEEE.

[8]  Carl Kesselman,et al.  GriPhyN and LIGO, building a virtual data Grid for gravitational wave scientists , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[9]  Sang-Min Park,et al.  Chameleon: a resource scheduler in a data grid environment , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[10]  Italo Epicoco,et al.  Early experiences with the GridFTP protocol using the GRB-GSIFTP library , 2002, Future Gener. Comput. Syst..