An efficient replicated data access approach for large-scale distributed systems

In data-intensive distributed systems, replication is the most widely used approach to offer high data availability, low bandwidth consumption, increased fault-tolerance and improved scalability of the overall system. Replication-based systems implement replica control protocols that enforce a specified semantics of accessing the data. Also, the performance depends on a host of factors chief of which is the protocol used to maintain consistency among object replica. In this paper, we propose a new low-cost and high data availability protocol for maintaining replicated data on networked distributed computing systems. We show that the proposed approach provides high data availability, low bandwidth consumption, increased fault-tolerance and improved scalability of the overall system as compared to standard replica control protocols.

[1]  Sushil Jajodia,et al.  An adaptive data replication algorithm , 1997, TODS.

[2]  Lei Gao,et al.  Improving Availability and Performance with Application-Specific Data Replication , 2004 .

[3]  Shojiro Nishio,et al.  Data management issues in mobile and peer-to-peer environments , 2002, Data Knowl. Eng..

[4]  Sushil Jajodia,et al.  Dynamic voting algorithms for maintaining the consistency of a replicated database , 1990, TODS.

[5]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[6]  Jemal H. Abawajy An integrated resource scheduling approach on cluster computing systems , 2003 .

[7]  Ming Tang,et al.  The impact of data replication on job scheduling performance in the Data Grid , 2006, Future Gener. Comput. Syst..

[8]  Mustafa Mat Deris,et al.  Diagonal Replication on Grid for Efficient Access of Data in Distributed Database Systems , 2004, International Conference on Computational Science.

[9]  Avishai Wool,et al.  Replication, consistency, and practicality: are these mutually exclusive? , 1998, SIGMOD '98.

[10]  Erwin Laure,et al.  Advanced Replica Management with Reptor , 2003, PPAM.

[11]  S. S. Ravi,et al.  Deferred updates and data placement in distributed databases , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[12]  Jemal H. Abawajy File Replacement Algorithm for Storage Resource Managers in Data Grids , 2004, International Conference on Computational Science.

[13]  Hector Garcia-Molina,et al.  How to assign votes in a distributed system , 1985, JACM.

[14]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.

[15]  Divyakant Agrawal,et al.  Using Reconfiguration for Efficient Management of Replicated Data , 1996, IEEE Trans. Knowl. Data Eng..

[16]  Mostafa H. Ammar,et al.  The Grid Protocol: A High Performance Scheme for Maintaining Replicated Data , 1992, IEEE Trans. Knowl. Data Eng..

[17]  Jemal H. Abawajy,et al.  Placement of File Replicas in Data Grid Environments , 2004, International Conference on Computational Science.

[18]  Wanlei Zhou,et al.  Managing Replicated Remote Procedure Call Transactions , 1999, Comput. J..

[19]  Yanxiang He,et al.  Improved Grid Information Service Using the Idea of File-Parted Replication , 2005, ADMA.

[20]  Philip A. Bernstein,et al.  An algorithm for concurrency control and recovery in replicated distributed databases , 1984, TODS.

[21]  Divyakant Agrawal,et al.  Epidemic Algorithms for Replicated Databases , 2003, IEEE Trans. Knowl. Data Eng..

[22]  David J. Evans,et al.  Binary vote assignment on a grid for efficient access of replicated data , 2003, Int. J. Comput. Math..

[23]  Ian T. Foster,et al.  Replica selection in the Globus Data Grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[24]  Bharat K. Bhargava,et al.  Mobile data and transaction management , 2002, Inf. Sci..

[25]  Divyakant Agrawal,et al.  The generalized tree quorum protocol: an efficient approach for managing replicated data , 1992, TODS.

[26]  Flavia Donno,et al.  Grid Data Management in Action: Experience in Running and Supporting Data Management Services in the EU DataGrid Project , 2003, ArXiv.

[27]  Data and Transaction Management in a Mobile Environment , 2002 .

[28]  Peter Z. Kunszt,et al.  File-based replica management , 2005, Future Gener. Comput. Syst..