Automatic and Reliable Distribution of Data in Grids over Globus Toolkit

This paper presents a strategy to develop an infrastructure for reliable distribution of data in Grids. We used the data replication services of Globus Toolkit 4.0, but extended their functionality in order to improve the reliability of the overall data distribution in different failure scenarios. Our solution makes the data distribution process automatic, based on distribution patterns the user provides when he configures the distribution system built on our infrastructure. We implemented the proposed solution, tested its functionality and measured its efficiency.

[1]  Carl Kesselman,et al.  Wide area data replication for scientific collaborations , 2005, Int. J. High Perform. Comput. Netw..

[2]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[3]  Adrian Colesa,et al.  Providing High Data Availability in MedioGRID , 2006, 2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[4]  Dorian Gorgan,et al.  Satellite Image Processing Applications in MedioGRID , 2006, 2006 Fifth International Symposium on Parallel and Distributed Computing.

[5]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.

[6]  Peter Z. Kunszt,et al.  Giggle: A Framework for Constructing Scalable Replica Location Services , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[7]  Carl Kesselman,et al.  Performance and scalability of a replica location service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[8]  Kavitha Ranganathan,et al.  Identifying Dynamic Replication Strategies for a High-Performance Data Grid , 2001, GRID.