Two Efficient Partial-Updating Schemes for Erasure-Coded Storage Clusters

Nowadays, erasure codes have been widely used in data storage to achieve high fault-tolerance. However, compared with replica-based storage, erasure-coded system may suffer significant performance overhead in encoding, decoding and updating. Traditional updating schemes(e.g. DUM and PUM) use an individual manager node to accomplish the updating. In this paper, we propose two partial-updating schemes (i.e. PUM-P and PDN-P) to improve the small update in erasure coded storage clusters, where both schemes only read a portion of data, including the data blocks to be updated and the parity blocks, and utilize the calculation capacity of the storage nodes. We implement four updating algorithms (DUM, PUM, PUM-P and PDN-P) upon an erasure-coded storage cluster platform, and conduct a set of comparative tests under two real-world workloads with different fault-tolerance parameters. The experimental results shows that PUM-P and PDN-P can speed up the small updating by a factor of up to 1.62 and 2.72 compared with DUM under small update, respectively; and by a factor of up to 1.42 and 2.23 relative to PUM, respectively. We also validate that DUM scheme can achieve better updating performance than the other schemes within the large update scenario.

[1]  Marcos K. Aguilera,et al.  On the erasure recoverability of MDS codes under concurrent updates , 2005, Proceedings. International Symposium on Information Theory, 2005. ISIT 2005..

[2]  Marcos K. Aguilera,et al.  Using erasure codes efficiently for storage in a distributed system , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[3]  Qi Zhang,et al.  Characterization of storage workload traces from production Windows Servers , 2008, 2008 IEEE International Symposium on Workload Characterization.

[4]  Michael K. Reiter,et al.  Efficient Byzantine-tolerant erasure-coded storage , 2004, International Conference on Dependable Systems and Networks, 2004.

[5]  Ju Wang,et al.  Windows Azure Storage: a highly available cloud storage service with strong consistency , 2011, SOSP.

[6]  James S. Plank The RAID-6 Liberation Codes , 2008, FAST.

[7]  Jehoshua Bruck,et al.  X-Code: MDS Array Codes with Optimal Encoding , 1999, IEEE Trans. Inf. Theory.

[8]  Lihao Xu,et al.  An efficient XOR-scheduling algorithm for erasure codes encoding , 2009, 2009 IEEE/IFIP International Conference on Dependable Systems & Networks.

[9]  Arif Merchant,et al.  A decentralized algorithm for erasure-coded virtual disks , 2004, International Conference on Dependable Systems and Networks, 2004.

[10]  Rüdiger L. Urbanke,et al.  Efficient encoding of low-density parity-check codes , 2001, IEEE Trans. Inf. Theory.

[11]  John Kubiatowicz,et al.  Erasure Coding Vs. Replication: A Quantitative Comparison , 2002, IPTPS.

[12]  Ethan L. Miller,et al.  Pergamum: Replacing Tape with Energy Efficient, Reliable, Disk-Based Archival Storage , 2008, FAST.

[13]  Peter F. Corbett,et al.  Row-Diagonal Parity for Double Disk Failure Correction (Awarded Best Paper!) , 2004, USENIX Conference on File and Storage Technologies.

[14]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[15]  Garth A. Gibson,et al.  DiskReduce: RAID for data-intensive scalable computing , 2009, PDSW '09.

[16]  Gregory R. Ganger,et al.  Towards higher disk head utilization: extracting free bandwidth from busy disk drives , 2000, OSDI.

[17]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[18]  Jehoshua Bruck,et al.  EVENODD: an optimal scheme for tolerating double disk failures in RAID architectures , 1994, ISCA '94.

[19]  James Lee Hafner,et al.  HoVer Erasure Codes For Disk Arrays , 2006, International Conference on Dependable Systems and Networks (DSN'06).

[20]  Xiaosong Ma,et al.  Does erasure coding have a role to play in my data center , 2010 .

[21]  Adam Wierman,et al.  Open Versus Closed: A Cautionary Tale , 2006, NSDI.

[22]  Catherine D. Schuman,et al.  A Performance Evaluation and Examination of Open-Source Erasure Coding Libraries for Storage , 2009, FAST.

[23]  James S. Plank,et al.  The Raid-6 Liber8Tion Code , 2009, Int. J. High Perform. Comput. Appl..

[24]  O. Antoine,et al.  Theory of Error-correcting Codes , 2022 .

[25]  GhemawatSanjay,et al.  The Google file system , 2003 .