Adaptive replication strategy based on popular content in cloud computing

The cloud is an infrastructure that provides decentralized on-demand services. It allows consumers to pay only for the services they use. The consumer is the important entity in the cloud. The violation of the SLA contract between the consumer and the provider often leads to consequences because the service provider has to pay penalties. Data replication is emerging as an ideal solution to meet the new challenges of the cloud. This paper proposes a new replication strategy based on the popularity of data. This strategy adaptively selects the files to be replicated to improve the overall availability of data in the system, minimize query response time, and achieve the required quality of service. In addition, it dynamically determines the number of replicas to add and the best locations to store them. Experimental results show the effectiveness of the proposed strategy.

[1]  Najme Mansouri,et al.  An Effective Weighted Data Replication Strategy for Data Grid , 2012 .

[2]  Philippe Merle,et al.  Elasticity in Cloud Computing: State of the Art and Research Challenges , 2018, IEEE Transactions on Services Computing.

[3]  Xiangke Liao,et al.  Developing the Cloud-integrated data replication framework in decentralized online social networks , 2016, J. Comput. Syst. Sci..

[4]  Ahmad Noraziah,et al.  Overview of Replication Techniques on Distributed Database in Cloud Environment , 2017 .

[5]  Shang Gao,et al.  Modeling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments , 2012, Journal of Computer Science and Technology.

[6]  Rajkumar Buyya,et al.  A Taxonomy of Software-Defined Networking (SDN)-Enabled Cloud Computing , 2018, ACM Comput. Surv..

[7]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[8]  Victor I. Chang,et al.  A model to compare cloud and non-cloud storage of Big Data , 2016, Future Gener. Comput. Syst..

[9]  Manish B. Gudadhe,et al.  Performance Analysis Survey of Data Replication Strategies in Cloud Environment , 2017, ICBDR 2017.

[10]  Riad Mokadem,et al.  Data Replication in Cloud Systems: A Survey , 2017, Int. J. Inf. Syst. Soc. Chang..

[11]  Belabbas Yagoubi,et al.  Dynamic Replication Based on a Data Classification Model in Cloud Computing , 2020, MISC.

[12]  Li He,et al.  A novel predicted replication strategy in cloud storage , 2018, The Journal of Supercomputing.

[13]  GhemawatSanjay,et al.  The Google file system , 2003 .

[14]  Abdelkader Hameurlain,et al.  Ensuring performance and provider profit through data replication in cloud systems , 2017, Cluster Computing.

[15]  Peter Scheuermann,et al.  File Assignment in Parallel I/O Systems with Minimal Variance of Service Time , 2000, IEEE Trans. Computers.

[16]  Ritu Aggarwal,et al.  Resource Provisioning and Resource Allocation in Cloud Computing Environment , 2018 .

[17]  Tarek Hamrouni,et al.  Data popularity measurements in distributed systems: Survey and design directions , 2016, J. Netw. Comput. Appl..

[18]  Keqin Li,et al.  Power and performance management for parallel computations in clouds and data centers , 2016, J. Comput. Syst. Sci..

[19]  Abdelkader Hameurlain,et al.  A data replication strategy with tenant performance and provider economic profit guarantees in Cloud data centers , 2020, J. Syst. Softw..

[20]  Richard Boateng,et al.  Cloud computing research: A review of research themes, frameworks, methods and future research directions , 2018, Int. J. Inf. Manag..

[21]  Julia Myint,et al.  Management of Data Replication for PC Cluster-based Cloud Storage System , 2011, CloudCom 2011.

[22]  Wei Chen,et al.  MORM: A Multi-objective Optimized Replication Management strategy for cloud storage cluster , 2014, J. Syst. Archit..

[23]  Sherali Zeadally,et al.  Performance analysis of data intensive cloud systems based on data management and replication: a survey , 2016, Distributed and Parallel Databases.

[24]  Mohammad Kazem Akbari,et al.  An effective model for store and retrieve big health data in cloud computing , 2016, Comput. Methods Programs Biomed..

[25]  Mohammad Ubaidullah Bokhari,et al.  A Survey on Cloud Computing , 2018 .

[26]  Najme Mansouri Adaptive data replication strategy in cloud computing for performance improvement , 2016, Frontiers of Computer Science.

[27]  Rajkumar Buyya,et al.  A Taxonomy and Future Directions for Sustainable Cloud Computing , 2017, ACM Comput. Surv..

[28]  Rajkumar Buyya,et al.  Dynamic replication and migration of data objects with hot-spot and cold-spot statuses across storage data centers , 2019, J. Parallel Distributed Comput..

[29]  Najib A. Kofahi,et al.  Identifying the Top Threats in Cloud Computing and Its Suggested Solutions: A Survey , 2018 .

[30]  Albert Y. Zomaya,et al.  Energy-efficient data replication in cloud computing datacenters , 2013, 2013 IEEE Globecom Workshops (GC Wkshps).

[31]  Nima Jafari Navimipour,et al.  A comprehensive review of the data replication techniques in the cloud environments: Major trends and future directions , 2016, J. Netw. Comput. Appl..

[32]  Ruay-Shiung Chang,et al.  A dynamic data replication strategy using access-weights in data grids , 2008, The Journal of Supercomputing.

[33]  Tarek Hamrouni,et al.  Adaptive measurement method for data popularity in distributed systems , 2016, Cluster Computing.

[34]  Riad Mokadem,et al.  Data replication strategy with satisfaction of availability, performance and tenant budget requirements , 2019, Cluster Computing.