Data replication strategy with satisfaction of availability, performance and tenant budget requirements

We propose a dynamic replication strategy that satisfies simultaneously availability and performance tenant requirements while taking into account the tenant budget and the provider profit. The proposed strategy is based on a cost model that aims to calculate the minimum number of replicas required to maintain a high data availability. A replica creation is triggered only when this number of replicas is not reached or when the response time objective is not satisfied. Then, the replication must be profitable for the provider when creating a new replica. Furthermore, data replication and query scheduling are coupled in order to place these replicas in a load balancing way while dealing with the tenant budget. The experiment results prove that the proposed strategy can significantly improve availability and performance while the tenant budget is taken into account.

[1]  Shang Gao,et al.  Modeling a Dynamic Data Replication Strategy to Increase System Availability in Cloud Computing Environments , 2012, Journal of Computer Science and Technology.

[2]  Margarida Mamede,et al.  PIXIDA: Optimizing Data Parallel Jobs in Wide-Area Data Analytics , 2015, Proc. VLDB Endow..

[3]  Thomas Ledoux,et al.  Cross-layer SLA Selection for Cloud Services , 2011, 2011 First International Symposium on Network Cloud Computing and Applications.

[4]  Faiez Zalila,et al.  Model-Driven Elasticity Management with OCCI , 2019, IEEE Transactions on Cloud Computing.

[5]  Rajkumar Buyya,et al.  Cost Optimization for Dynamic Replication and Migration of Data in Cloud Data Centers , 2019, IEEE Transactions on Cloud Computing.

[6]  Dan Feng,et al.  CDRM: A Cost-Effective Dynamic Replication Management Scheme for Cloud Storage Cluster , 2010, 2010 IEEE International Conference on Cluster Computing.

[7]  Abdelkader Hameurlain,et al.  A Performance and Profit Oriented Data Replication Strategy for Cloud Systems , 2016, 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld).

[8]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[9]  Patrick Valduriez,et al.  Principles of distributed database systems (2nd ed.) , 1999 .

[10]  Mohamed-K Hussein,et al.  A Light-weight Data Replication for Cloud DataCenters Environment , 2014 .

[11]  Hai Jin,et al.  RTRM: A Response Time-Based Replica Management Strategy for Cloud Storage System , 2013, GPC.

[12]  Thomas Ledoux,et al.  CSLA: A Language for Improving Cloud SLA Management , 2012, CLOSER.

[13]  Uras Tos,et al.  Data replication in large-scale data management systems. (Réplication de données dans les systèmes de gestion de données à grande échelle) , 2017 .

[14]  Wenzhong Guo,et al.  Data Replication Placement Strategy Based On Bidding Mode for Cloud Storage Cluster , 2014, 2014 11th Web Information System and Application Conference.

[15]  Ghalem Belalem,et al.  Towards Improving the Functioning of CloudSim Simulator , 2011, ICDIPC.

[16]  Flávio R. C. Sousa,et al.  Towards Elastic Multi-Tenant Database Replication with Quality of Service , 2012, 2012 IEEE Fifth International Conference on Utility and Cloud Computing.

[17]  Albert Y. Zomaya,et al.  Energy-efficient data replication in cloud computing datacenters , 2013, 2013 IEEE Globecom Workshops (GC Wkshps).

[18]  Nima Jafari Navimipour,et al.  A comprehensive review of the data replication techniques in the cloud environments: Major trends and future directions , 2016, J. Netw. Comput. Appl..

[19]  Won-Sik Yoon,et al.  Dynamic Data Grid Replication Strategy Based on Internet Hierarchy , 2003, GCC.

[20]  Patrick Valduriez,et al.  Principles of Distributed Database Systems , 1990 .

[21]  Jignesh M. Patel,et al.  Towards Multi-Tenant Performance SLOs , 2014, IEEE Trans. Knowl. Data Eng..

[22]  E. Deelman,et al.  Data replication strategies in grid environments , 2002, Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002. Proceedings..

[23]  Riad Mokadem,et al.  Data Replication in Cloud Systems: A Survey , 2017, Int. J. Inf. Syst. Soc. Chang..

[24]  Sarbjeet Singh,et al.  Dynamic Cost-Aware Re-replication and Rebalancing Strategy in Cloud System , 2014, FICTA.

[25]  Pierre Sens,et al.  AREN: A Popularity Aware Replication Scheme for Cloud Storage , 2012, 2012 IEEE 18th International Conference on Parallel and Distributed Systems.

[26]  Floriano Zini,et al.  Evaluation of an economy-based file replication strategy for a data grid , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[27]  Fang Dong,et al.  QoS Preference-Aware Replica Selection Strategy Using MapReduce-Based PGA in Data Grids , 2011, 2011 International Conference on Parallel Processing.

[28]  Abdelkader Hameurlain,et al.  Ensuring performance and provider profit through data replication in cloud systems , 2017, Cluster Computing.

[29]  Sherif Sakr,et al.  SLA-Based and Consumer-centric Dynamic Provisioning for Cloud Databases , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[30]  Haiying Shen,et al.  A popularity-aware cost-effective replication scheme for high data durability in cloud storage , 2016, 2016 IEEE International Conference on Big Data (Big Data).