A cascading Latin scheme to tolerate double disk failures in raid architectures

In recent years, a lot of XOR-based coding schemes have been developed to tolerate double disk failures in Redundant Array of Independent Disks (RAID) architectures, such as EVENODD-code, X-code, B-code and BG-HEDP. Despite those researches, the decades-old strategy of Reed-Solomon (RS) code remains the only popular space-optimal Maximum Distance Separable (MDS) code for all but the smallest storage systems. The reason is that all those XOR-based schemes are too difficult to be implemented, it mainly because the coding-circle of those codes vary with the number of disks. By contrast, the coding-circle of RS code is a constant. In order to solve this problem, we develop a new MDS code named Latin code and a cascading scheme based on Latin code. The cascading Latin scheme is a nearly MDS code (with only one or two more parity disks compared with the MDS ones). Nevertheless, it keeps the coding-circle of the basic Latin code (i.e. a constant) and the low encoding/ decoding complexity similar to other parity array codes.

[1]  Gang Wang,et al.  Combinatorial Constructions of Multi-erasure-Correcting Codes with Independent Parity Symbols for Storage Systems , 2007, 13th Pacific Rim International Symposium on Dependable Computing (PRDC 2007).

[2]  Hai Jin,et al.  The EVENODD Code and its Generalization: An Efficient Scheme for Tolerating Multiple Disk Failures in RAID Architectures , 2002 .

[3]  Daniel J. Costello,et al.  Error Control Coding, Second Edition , 2004 .

[4]  Shu Lin,et al.  Error Control Coding , 2004 .

[5]  James Lee Hafner,et al.  WEAVER codes: highly fault tolerant erasure codes for storage systems , 2005, FAST'05.

[6]  Jehoshua Bruck,et al.  X-Code: MDS Array Codes with Optimal Encoding , 1999, IEEE Trans. Inf. Theory.

[7]  Randy H. Katz,et al.  Coding techniques for handling failures in large disk arrays , 2005, Algorithmica.

[8]  Liu Jing,et al.  Generalizing RDP Codes Using the Combinatorial Method , 2008, 2008 Seventh IEEE International Symposium on Network Computing and Applications.

[9]  Lihao Xu,et al.  Optimizing Cauchy Reed-Solomon Codes for Fault-Tolerant Network Storage Applications , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[10]  C. Colbourn,et al.  Handbook of Combinatorial Designs , 2006 .

[11]  James Lee Hafner,et al.  HoVer Erasure Codes For Disk Arrays , 2006, International Conference on Dependable Systems and Networks (DSN'06).

[12]  Gang Wang,et al.  Combinatorial Constructions of Multi-erasure-Correcting Codes with Independent Parity Symbols for Storage Systems , 2007 .

[13]  Mario Blaum,et al.  New array codes for multiple phased burst correction , 1993, IEEE Trans. Inf. Theory.

[14]  Gang Wang,et al.  Constructing double- and triple-erasure-correcting codes with high availability using mirroring and parity approaches , 2007, 2007 International Conference on Parallel and Distributed Systems.

[15]  Ahmad Patooghy,et al.  A Low-Power and SEU-Tolerant Switch Architecture for Network on Chips , 2007 .

[16]  James S. Plank The RAID-6 Liberation Codes , 2008, FAST.

[17]  Ian M. Wanless Perfect Factorisations of Bipartite Graphs and Latin Squares Without Proper Subrectangles , 1999, Electron. J. Comb..

[18]  James S. Plank A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems , 1997 .

[19]  Jehoshua Bruck,et al.  EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures , 1995, IEEE Trans. Computers.

[20]  Cheng Huang,et al.  STAR : An Efficient Coding Scheme for Correcting Triple Storage Node Failures , 2005, IEEE Transactions on Computers.