Candidate MDS Array Codes for Tolerating Three Disk Failures in RAID-7 Architectures

Current storage systems use RAID-5 and RAID-6 architectures to provide protection against one and two disk failures, respectively. However, as the size of storage system grows rapidly three concurrent disk failures are becoming more frequent. To cope up with three disk failure, we propose a new RAID level, i.e., RAID-7, for which three-column-erasure tolerating MDS array codes are needed. However, it is an open question as to which MDS array codes should be used for RAID-7. In this paper, we compare different array codes, which can be used in RAID-7 systems that require storage efficiency (the ratio of number of information symbols to encoding (or codeword) symbols) <= 0.5. The paper discusses three-column-erasure tolerating MDS array codes proposed in the literature namely, [5,2] 2 x 5 BP-XOR code, [6,3] 4 x 6 lowest-density array code, [6,3] 2 x 6 STAR code, [6,3] 4 x 6 generalized RDP code. The paper introduces a new three-column-erasure tolerating [6,3] 2 x 6 almost BP-XOR codes. We analyze annual failure rate, storage efficiency, worst case normalized encoding/update/repairing/read complexity, repair bandwidth, and number of buffers required for these codes. We also provide experimental results to understand the average case encoding and repairing complexity of BP-XOR, STAR, GRDP, and almost BP-XOR codes by implementing them in software. From our analysis and experimental results, we conclude that [6,3] 2 x 6 almost BP-XOR are best suited for RAID-7 systems with storage efficiency <= 0.5.

[1]  Mario Blaum A Family of MDS Array Codes with Minimal Number of Encoding Operations , 2006, 2006 IEEE International Symposium on Information Theory.

[2]  Peter F. Corbett,et al.  RAID triple parity , 2012, OPSR.

[3]  David A. Patterson,et al.  Designing Disk Arrays for High Data Reliability , 1993, J. Parallel Distributed Comput..

[4]  GhemawatSanjay,et al.  The Google file system , 2003 .

[5]  Peter F. Corbett,et al.  Row-Diagonal Parity for Double Disk Failure Correction (Awarded Best Paper!) , 2004, USENIX Conference on File and Storage Technologies.

[6]  Van-Anh Truong,et al.  Availability in Globally Distributed Storage Systems , 2010, OSDI.

[7]  Yongge Wang,et al.  Array BP-XOR codes for reliable cloud storage systems , 2013, 2013 IEEE International Symposium on Information Theory.

[8]  Alexander Rosa,et al.  One-factorizations of the complete graph - A survey , 1985, J. Graph Theory.

[9]  C. Walter Kryder's law. , 2005, Scientific American.

[10]  Mario Blaum,et al.  On Lowest Density MDS Codes , 1999, IEEE Trans. Inf. Theory.

[11]  Michael Luby,et al.  LT codes , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[12]  Mario Blaum,et al.  New array codes for multiple phased burst correction , 1993, IEEE Trans. Inf. Theory.

[13]  F. Moore,et al.  Polynomial Codes Over Certain Finite Fields , 2017 .

[14]  Alexandros G. Dimakis,et al.  Network Coding for Distributed Storage Systems , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[15]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[16]  Xubin He,et al.  A Comprehensive Analysis of XOR-Based Erasure Codes Tolerating 3 or More Concurrent Failures , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[17]  Alexander Vardy,et al.  MDS array codes with independent parity symbols , 1995, Proceedings of 1995 IEEE International Symposium on Information Theory.

[18]  Yongge Wang,et al.  On encoding symbol degrees of array BP-XOR codes , 2015, Cryptography and Communications.

[19]  Cheng Huang,et al.  STAR : An Efficient Coding Scheme for Correcting Triple Storage Node Failures , 2005, IEEE Transactions on Computers.

[20]  Ron M. Roth,et al.  Lowest density MDS codes over extension alphabets , 2003, IEEE Transactions on Information Theory.

[21]  Hong Jiang,et al.  An Improved Decoding Algorithm for Generalized RDP Codes , 2016, IEEE Communications Letters.

[22]  Yongge Wang,et al.  Edge-colored graphs with applications to homogeneous faults , 2011, Inf. Process. Lett..

[23]  Hong Jiang,et al.  A Comprehensive Study on RAID-6 Codes: Horizontal vs. Vertical , 2011, 2011 IEEE Sixth International Conference on Networking, Architecture, and Storage.

[24]  Jehoshua Bruck,et al.  EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures , 1995, IEEE Trans. Computers.

[25]  Yongge Wang,et al.  Privacy-Preserving Data Storage in Cloud Using Array BP-XOR Codes , 2015, IEEE Transactions on Cloud Computing.