Dynamic Replication Based on a Data Classification Model in Cloud Computing

Cloud Computing provides on demand resources for customers and enterprises to outsource their online activities efficiently and less expensively. However, the cloud environment is heterogeneous and very dynamic, storage node failures and increasing demands on data can lead to data unavailability situations leading to a decrease in quality of service. Cloud service providers face the challenge of ensuring maximum data availability and reliability. Replication of data to different nodes in the cloud has become the most common solution for achieving good performance in terms of load balancing, response time and availability. In this article, we propose a new dynamic replication strategy based on a data classification model that would adapt the replication process according to user behavior towards data. This strategy dynamically and adaptively creates the replicas necessary in order to obtain the desired performance such as, reduced response time and improved system availability while ensuring the quality of service. The solution also attempts to meet customer requirements by respecting the SLA contract. The CloudSim simulator was used to evaluate the proposed strategy and compare it to other strategies. The results obtained showed an improvement in the criteria studied in a satisfactory manner.

[1]  Najme Mansouri Adaptive data replication strategy in cloud computing for performance improvement , 2016, Frontiers of Computer Science.

[2]  Dan Feng,et al.  CDRM: A Cost-Effective Dynamic Replication Management Scheme for Cloud Storage Cluster , 2010, 2010 IEEE International Conference on Cluster Computing.

[3]  Richard Boateng,et al.  Cloud computing research: A review of research themes, frameworks, methods and future research directions , 2018, Int. J. Inf. Manag..

[4]  Albert Y. Zomaya,et al.  Energy-efficient data replication in cloud computing datacenters , 2013, 2013 IEEE Globecom Workshops (GC Wkshps).

[5]  GhemawatSanjay,et al.  The Google file system , 2003 .

[6]  Najib A. Kofahi,et al.  Identifying the Top Threats in Cloud Computing and Its Suggested Solutions: A Survey , 2018 .

[7]  Jianjing Shen,et al.  Two phase enhancing replica selection in cloud storage system , 2016, CCC 2016.

[8]  Riad Mokadem,et al.  Data replication strategy with satisfaction of availability, performance and tenant budget requirements , 2019, Cluster Computing.

[9]  Sherali Zeadally,et al.  Performance analysis of data intensive cloud systems based on data management and replication: a survey , 2016, Distributed and Parallel Databases.

[10]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[11]  Mohammad Ubaidullah Bokhari,et al.  A Survey on Cloud Computing , 2018 .

[12]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[13]  Li He,et al.  A novel predicted replication strategy in cloud storage , 2018, The Journal of Supercomputing.

[14]  Abdelkader Hameurlain,et al.  A data replication strategy with tenant performance and provider economic profit guarantees in Cloud data centers , 2020, J. Syst. Softw..

[15]  Yun Yang,et al.  A Novel Cost-Effective Dynamic Data Replication Strategy for Reliability in Cloud Data Centres , 2011, 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing.

[16]  Ghalem Belalem,et al.  Dynamic Replication Based on Availability and Popularity in the Presence of Failures , 2012, J. Inf. Process. Syst..

[17]  Abdelkader Hameurlain,et al.  Ensuring performance and provider profit through data replication in cloud systems , 2017, Cluster Computing.

[18]  Rajkumar Buyya,et al.  Dynamic replication and migration of data objects with hot-spot and cold-spot statuses across storage data centers , 2019, J. Parallel Distributed Comput..

[19]  Shang Gao,et al.  Modeling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments , 2012, Journal of Computer Science and Technology.

[20]  Keqin Li,et al.  Power and performance management for parallel computations in clouds and data centers , 2016, J. Comput. Syst. Sci..