Update Propagation Technique for Data Grid

Data replication is a well known technique used to reduce accesses latency, improve availability, and performance in a distributed computing environment. An asynchronous replication is a commonly agreed solution for the consistency of replicas problem. Update propagation using a classical propagation schema called the radial method suffers from high overhead of the master replica while line method suffers from high delay time. This paper presents a new asynchronous replication protocol called Update Propagation Grid (UPG) which especially for a wide area distributed Data Grid. Updates reach other replicas using a propagation technique based on nodes organized into a logical structure network that enables the technique to scale well for thousands of replicas. Restructuring operation is provided to build and reconfigure the UPG dynamically. An analytical model is developed; communication cost, average load balance, and average delay time have been analyzed. The technique achieves load balancing and minimizes the delay for file replication in Data Grid.

[1]  Flavia Donno,et al.  Relaxed data consistency with CONStanza , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[2]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[3]  Erwin Laure,et al.  Replica Management in Data Grids , 2002 .

[4]  Dennis Shasha,et al.  The dangers of replication and a solution , 1996, SIGMOD '96.

[5]  Heinz Stockinger,et al.  Grid Data Management Pilot (GDMP): A Tool for Wide Area Replication , 2001 .

[6]  Javier Jaén Martínez,et al.  Models for replica synchronisation and consistency in a data grid , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[7]  Boleslaw K. Szymanski,et al.  Decentralized data management framework for Data Grids , 2007, Future Gener. Comput. Syst..

[8]  Zhiwei Xu,et al.  Grid replication coherence protocol , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[9]  Jaechun No,et al.  Data Replication Techniques for Data-Intensive Applications , 2006, International Conference on Computational Science.

[10]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[11]  Flavia Donno,et al.  Replica Management Services in the European DataGrid Project , 2004 .

[12]  Ruay-Shiung Chang,et al.  Adaptable Replica Consistency Service for Data Grids , 2006, Third International Conference on Information Technology: New Generations (ITNG'06).

[13]  Flavia Donno,et al.  Replica Management in the European DataGrid Project , 2004, Journal of Grid Computing.

[14]  Flavia Donno,et al.  Replica Consistency in a Data Grid , 2004 .

[15]  Rajkumar Buyya,et al.  A taxonomy of Data Grids for distributed data sharing, management, and processing , 2005, CSUR.

[16]  Mohan Kumar,et al.  An efficient update propagation algorithm for P2P systems , 2007, Comput. Commun..

[17]  Peter Z. Kunszt,et al.  Giggle: A Framework for Constructing Scalable Replica Location Services , 2002, ACM/IEEE SC 2002 Conference (SC'02).