Modelling multi level consistency in erasure code based storage systems

In the internet era, cloud storage services are in high demand. Due to huge volumes of the present day big data, cloud service providers look for alternatives for data replication, which has been traditionally used for providing fault tolerance and availability. If three copies of data are maintained for ensuring availability, then there is 200% storage overhead in replication based storage systems. Erasure code based storage is proving itself the most suitable alternative to replication schemes. Hardware assisted encoding, decoding process and research outcomes on data updates from various research communities, indicate a promising future for erasure code based storage systems for hot data storage. In this paper, a new protocol is proposed which shows how to provide different types of consistency in erasure code based storage systems in concurrent data access scenarios where failures of components are anticipated. We have developed protocols and data structures for implementing the strong, eventual and monotonic types of consistencies. The proposed consistency model has been tested successfully and the results are promising.

[1]  Tanuj Ahuja,et al.  File System and Hadoop Distributed File System-An Analogy , 2015 .

[2]  Itzhak Tamo,et al.  A family of optimal locally recoverable codes , 2014, ISIT.

[3]  Xin Wang,et al.  Efficient Scheduling for Multi-Block Updates in Erasure Coding Based Storage Systems , 2018, IEEE Transactions on Computers.

[4]  Kevin Lee,et al.  Data Consistency Properties and the Trade-offs in Commercial Cloud Storage: the Consumers' Perspective , 2011, CIDR.

[5]  Rajkumar Buyya,et al.  Data Storage Management in Cloud Environments , 2017, ACM Comput. Surv..

[6]  Nihar B. Shah,et al.  Optimal Exact-Regenerating Codes for Distributed Storage at the MSR and MBR Points via a Product-Matrix Construction , 2010, IEEE Transactions on Information Theory.

[7]  Rajiv Gupta,et al.  Efficient sequential consistency via conflict ordering , 2012, ASPLOS XVII.

[8]  Nancy A. Lynch,et al.  A coded shared atomic memory algorithm for message passing architectures , 2014, 2014 IEEE 13th International Symposium on Network Computing and Applications.

[9]  Marcos K. Aguilera,et al.  Using erasure codes efficiently for storage in a distributed system , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[10]  Alexander Reinefeld,et al.  Consistency and fault tolerance for erasure-coded distributed storage systems , 2012, DIDC '12.

[11]  Dimitris S. Papailiopoulos,et al.  XORing Elephants: Novel Erasure Codes for Big Data , 2013, Proc. VLDB Endow..

[12]  Xiao Qin,et al.  Revisiting Updating Schemes for Erasure-Coded In-Memory Stores , 2017, 2017 International Conference on Networking, Architecture, and Storage (NAS).

[13]  Kannan Ramchandran,et al.  A Solution to the Network Challenges of Data Recovery in Erasure-coded Distributed Storage Systems: A Study on the Facebook Warehouse Cluster , 2013, HotStorage.

[14]  Han Mao Kiah,et al.  Repairing Reed-Solomon Codes With Multiple Erasures , 2016, IEEE Transactions on Information Theory.

[15]  Ojus Thomas Lee,et al.  RAPID: A Fast Data Update Protocol in Erasure Coded Storage Systems for Big Data , 2017, 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[16]  Amin Vahdat,et al.  Design and evaluation of a conit-based continuous consistency model for replicated services , 2002, TOCS.

[17]  Yang Tang,et al.  NCCloud: applying network coding for the storage repair in a cloud-of-clouds , 2012, FAST.

[18]  Nader Gemayel Analyzing Google File System and Hadoop Distributed File System , 2016 .

[19]  Nancy A. Lynch,et al.  A Layered Architecture for Erasure-Coded Consistent Distributed Storage , 2017, PODC.

[20]  Michael K. Reiter,et al.  Efficient Consistency for Erasure-coded Data via Versioning Servers (CMU-CS-03-127) , 2003 .

[21]  Yunnan Wu,et al.  A Survey on Network Codes for Distributed Storage , 2010, Proceedings of the IEEE.

[22]  Ojus Thomas Lee,et al.  Erasure coded storage systems for cloud storage — challenges and opportunities , 2016, 2016 International Conference on Data Science and Engineering (ICDSE).

[23]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[24]  Cheng Huang,et al.  Erasure Coding in Windows Azure Storage , 2012, USENIX Annual Technical Conference.

[25]  A. Fleischmann Distributed Systems , 1994, Springer Berlin Heidelberg.

[26]  Guido Salvaneschi,et al.  Consistency Types for Safe and Efficient Distributed Programming , 2017, FTfJP@ECOOP.