A locality-based replication manager for data cloud

Efficient data management is a key issue for environments distributed on a large scale such as the data cloud. This can be taken into account by replicating the data. The replication of data reduces the time of service and the delay in availability, increases the availability, and optimizes the distribution of load in the system. It is worth mentioning, however, that with the replication of data, the use of resources and energy increases due to the storing of copies of the data. We suggest a replication manager that decreases the cost of using resources, energy, and the delay in the system, and also increases the availability of the system. To reach this aim, the suggested replication manager, called the locality replication manager (LRM), works by using two important algorithms that use the physical adjacency feature of blocks. In addition, a set of simulations are reported to show that LRM can be a suitable option for distributed systems as it uses less energy and resources, optimizes the distribution of load, and has more availability and less delay.

[1]  Lili Qiu,et al.  On the placement of Web server replicas , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[2]  Boleslaw K. Szymanski,et al.  Simulation of dynamic data replication strategies in Data Grids , 2003, Proceedings International Parallel and Distributed Processing Symposium.

[3]  Jianliang Xu,et al.  QoS-aware replica placement for content distribution , 2005, IEEE Transactions on Parallel and Distributed Systems.

[4]  GhemawatSanjay,et al.  The Google file system , 2003 .

[5]  Bin Tang,et al.  Data Replication in Data Intensive Scientific Applications with Performance Guarantee , 2011, IEEE Transactions on Parallel and Distributed Systems.

[6]  Christopher E. Dabrowski,et al.  Reliability in grid computing systems , 2009, Concurr. Comput. Pract. Exp..

[7]  Reda Alhajj,et al.  Replica placement design with static optimality and dynamic maintainability , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[8]  Betty H. C. Cheng,et al.  Goal-Oriented Patterns for UML-Based Modeling of Embedded Systems Requirements , 2007 .

[9]  Hee Yong Youn,et al.  Dynamic hybrid replication effectively combining tree and grid topology , 2010, The Journal of Supercomputing.

[10]  Zhonghang Xia,et al.  A Secure and Scalable Update Protocol for P2P Data Grids , 2007 .

[11]  E. Rodney Canfield,et al.  Replication in Overlay Networks: A Multi-objective Optimization Approach , 2008, CollaborateCom.

[12]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[13]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[14]  Jinjun Chen,et al.  A Cost-Effective Mechanism for Cloud Data Reliability Management Based on Proactive Replica Checking , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[15]  Zhonghang Xia,et al.  A Secure and Scalable Update Protocol for P2P Data Grids , 2007, 10th IEEE High Assurance Systems Engineering Symposium (HASE'07).

[16]  Marios D. Dikaiakos,et al.  Cloud Computing: Distributed Internet Computing for IT and Scientific Research , 2009, IEEE Internet Computing.

[17]  Kavitha Ranganathan,et al.  Identifying Dynamic Replication Strategies for a High-Performance Data Grid , 2001, GRID.

[18]  Shahram Ghandeharizadeh,et al.  Near Optimal Number of Replicas for Continuous Media in Ad-hoc Networks of Wireless Devices , 2004, Multimedia Information Systems.

[19]  Ruay-Shiung Chang,et al.  A dynamic data replication strategy using access-weights in data grids , 2008, The Journal of Supercomputing.

[20]  Mache Creeger,et al.  Cloud Computing: An Overview , 2009, ACM Queue.

[21]  Xiaoyan Hong,et al.  An on-line replication strategy to increase availability in Data Grids , 2008, Future Gener. Comput. Syst..

[22]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[23]  Karl Aberer,et al.  Dynamic cost-efficient replication in data clouds , 2009, ACDC '09.

[24]  Yun Yang,et al.  A Novel Cost-Effective Dynamic Data Replication Strategy for Reliability in Cloud Data Centres , 2011, 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing.

[25]  Bin Tang,et al.  Benefit-Based Data Caching in Ad Hoc Networks , 2008, IEEE Trans. Mob. Comput..

[26]  Dan Feng,et al.  CDRM: A Cost-Effective Dynamic Replication Management Scheme for Cloud Storage Cluster , 2010, 2010 IEEE International Conference on Cluster Computing.

[27]  Atakan Dogan,et al.  A study on performance of dynamic file replication algorithms for real-time file access in Data Grids , 2009, Future Gener. Comput. Syst..

[28]  Deborah Estrin,et al.  Directed diffusion: a scalable and robust communication paradigm for sensor networks , 2000, MobiCom '00.