Replica Management in the European DataGrid Project

Within the European DataGrid project, Work Package 2 has designed and implemented a set of integrated replica management services for use by data intensive scientific applications. These services, based on the web services model, enable movement and replication of data at high speed from one geographical site to another, management of distributed replicated data, optimization of access to data, and the provision of a metadata management tool. In this paper we describe the architecture and implementation of these services and evaluate their performance under demanding Grid conditions.

[1]  Jason Lee,et al.  High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[2]  Peter Z. Kunszt,et al.  Giggle: A Framework for Constructing Scalable Replica Location Services , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[3]  Carl Kesselman,et al.  Performance and scalability of a replica location service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[4]  Douglas Thain,et al.  The Kangaroo approach to data movement on the Grid , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[5]  H. Schellman,et al.  Distributed data access and resource management in the D0 SAM system , 2001 .

[6]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[7]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[8]  Floriano Zini,et al.  Evaluating scheduling and replica optimisation strategies in OptorSim , 2003, Proceedings. First Latin American Web Congress.

[9]  Frank Leymann,et al.  Modeling Stateful Resources with Web Services , 2004 .

[10]  Ian T. Foster,et al.  Replica selection in the Globus Data Grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[11]  Arie Shoshani,et al.  SRM Joint Functional Design , 2003 .

[12]  Ian T. Foster,et al.  A National-Scale Authentication Infrastructur , 2000, Computer.

[13]  A. Oram Peer-to-Peer , 2001 .

[14]  Erwin Laure,et al.  Next-Generation EU DataGrid Data Management Services , 2003 .

[15]  Ian Clarke,et al.  Protecting Free Expression Online with Freenet , 2002, IEEE Internet Comput..

[16]  H. Schellman,et al.  Distributed data access and resource management in the D0 SAM system , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[17]  Kyle A. Gallivan,et al.  The gSOAP Toolkit for Web Services and Peer-to-Peer Computing Networks , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[18]  Erwin Laure,et al.  Advanced Replica Management with Reptor , 2003, PPAM.

[19]  Kavitha Ranganathan,et al.  Identifying Dynamic Replication Strategies for a High-Performance Data Grid , 2001, GRID.

[20]  Flavia Donno,et al.  Grid Data Mirroring Package (GDMP) , 2002, Sci. Program..

[21]  Boleslaw K. Szymanski,et al.  Simulation of dynamic data replication strategies in Data Grids , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[22]  Erwin Laure,et al.  Replica Management with Reptor Edg Replica Manager , 2003 .

[23]  Flavia Donno,et al.  Grid Data Management in Action: Experience in Running and Supporting Data Management Services in the EU DataGrid Project , 2003, ArXiv.

[24]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[25]  Jason Lee,et al.  High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[26]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[27]  Burton H. Bloom,et al.  Space/time trade-offs in hash coding with allowable errors , 1970, CACM.