A new fuzzy based dynamic data replication algorithm in data grids

Data grid deals with data intensive applications, and provides the ability to access and manage data intensive resources across widely distributed communities. Data replication, through creating many replicas in different sites, reduces the data access time, increases reliability and fault tolerance, and also improves the performance of systems. Here, through improving the modified BHR (MBHR) method, we proposed a novel dynamic algorithm named fuzzy_rep for data replication in data grids. The algorithm uses a fuzzy interfere system for finding suitable site where the file may be required in the future with high probability. Based on file access history, the purposed algorithm predicts future needs of grid sites. The algorithm was tested using a grid simulator, OptorSim developed by European Data Grid Projects. The simulation results show that our proposed algorithm has better performance in comparison with other algorithms in terms of the job execution time and percentage of storage filled.

[1]  Won-Sik Yoon,et al.  Dynamic Data Grid Replication Strategy Based on Internet Hierarchy , 2003, GCC.

[2]  Kavitha Ranganathan,et al.  Simulation Studies of Computation and Data Scheduling Algorithms for Data Grids , 2003, Journal of Grid Computing.

[3]  Bostjan Slivnik,et al.  The complexity of static data replication in data grids , 2005, Parallel Comput..

[4]  Ayaz Isazadeh,et al.  PHFS: A dynamic replication method, to decrease access latency in the multi-tier data grid , 2011, Future Gener. Comput. Syst..

[5]  Boleslaw K. Szymanski,et al.  Simulation of dynamic data replication strategies in Data Grids , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[6]  Ming Tang,et al.  Dynamic replication algorithms for the multi-tier Data Grid , 2005, Future Gener. Comput. Syst..

[7]  Kurt Stockinger,et al.  Dynamic data replication in LCG 2008 , 2008 .

[8]  Boleslaw K. Szymanski,et al.  Decentralized data management framework for data grids , 2007 .

[9]  Jesús Carretero,et al.  Branch replication scheme: A new model for data replication in large scale data grids , 2010, Future Gener. Comput. Syst..

[10]  Ruay-Shiung Chang,et al.  Job scheduling and data replication on data grids , 2007, Future Gener. Comput. Syst..

[11]  Atakan Dogan,et al.  A study on performance of dynamic file replication algorithms for real-time file access in Data Grids , 2009, Future Gener. Comput. Syst..

[12]  Floriano Zini,et al.  Analysis of Scheduling and Replica Optimisation Strategies for Data Grids Using OptorSim , 2004, Journal of Grid Computing.

[13]  Ming Tang,et al.  The impact of data replication on job scheduling performance in the Data Grid , 2006, Future Gener. Comput. Syst..

[14]  Jie Xu,et al.  On Dynamic Replication Strategies in Data Service Grids , 2008, 2008 11th IEEE International Symposium on Object and Component-Oriented Real-Time Distributed Computing (ISORC).

[15]  Ming Tang,et al.  Combining Data Replication Algorithms and Job Scheduling Heuristics in the Data Grid , 2005, Euro-Par.

[16]  Antony Selvadoss Thanamani,et al.  Dynamic replication in a data grid using a Modified BHR Region Based Algorithm , 2011, Future Gener. Comput. Syst..

[17]  Kavitha Ranganathan,et al.  Design and Evaluation of Dynamic Replication Strategies for a High-Performance Data Grid , 2001 .