Models for replica synchronisation and consistency in a data grid

Data grids are currently proposed solutions to large-scale data management problems, including efficient file transfer and replication. Large amounts of data and the world-wide distribution of data stores contribute to the complexity of the data management challenge. Recent architecture proposals and prototypes deal with replication of read-only files but do not address the replica synchronisation problem. We propose a new data grid service, called the Grid Consistency Service (GCS), that sits on top of existing data grid services and allows for replica update synchronisation and consistency maintenance. We give models for different levels of consistency, provided to the Grid user and discuss how they can be included into a replica consistency service for a data grid.

[1]  Heinz Stockinger Distributed Database Management Systems and the Data Grid , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.

[2]  Divyakant Agrawal,et al.  Epidemic Algorithms for Replicated Databases , 2003, IEEE Trans. Knowl. Data Eng..

[3]  Matthias Jarke,et al.  Increasing the Expressiveness of Analytical Performance Models for Replicated Databases , 1999, ICDT.

[4]  Heinz Stockinger,et al.  Grid Data Management Pilot (GDMP): A Tool for Wide Area Replication , 2001 .

[5]  Divyakant Agrawal,et al.  Epidemic algorithms in replicated databases (extended abstract) , 1997, PODS.

[6]  Heinz Stockinger,et al.  Data Replication in Distributed Database Systems , 1999 .

[7]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[8]  Henry F. Korth,et al.  Replication and consistency: being lazy helps sometimes , 1997, PODS.

[9]  R. G. G. Cattell,et al.  Recent books , 2000, IEEE Spectrum.

[10]  S. Hadjiefthymiades,et al.  Hypertext Transfer Protocol (HTTP) , 1996 .

[11]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[12]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[13]  Ian T. Foster,et al.  Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.

[14]  Kjg Koen Holtman,et al.  Prototyping of CMS storage management , 2000 .

[15]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.