Analytic Modeling of Clustered RAID with Mapping Based on Nearly Random Permutation

A Redundant Array of Independent Disks (RAID) of G disks provides protection against single disk failures by adding one parity block for each G-1 data blocks. In a clustered RAID, the G data/parity blocks are distributed over a cluster of C disks (C>G), thus reducing the additional load on each disk due to a single disk failure. However, most methods proposed for implementing such a mapping do not work for general C and G values. In this paper, we describe a fast mapping algorithm based on almost-random permutations. An analytical model is constructed, based on the queue with a permanent customer, to predict recovery time and read/write performance. The accuracy of the results derived from this model is validated by comparing with simulations. Our analysis shows that clustered RAID is significantly more tolerant of disk failure than the basic RAID scheme. Both recovery time and performance degradation during recovery are substantially reduced in clustered RAID; moreover, these gains can be achieved using fairly small C/G ratios.

[1]  Spencer W. Ng,et al.  Uniform Parity Group Distribution in Disk Arrays with Multiple Failures , 1994, IEEE Trans. Computers.

[2]  Philip S. Yu,et al.  Design and modeling of clustered RAID , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[3]  John C. S. Lui,et al.  Performance Analysis of Disk Arrays under Failure , 1990, VLDB.

[4]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[5]  Marshall Hall,et al.  Combinatorial Theory: Hall/Combinatorial , 1988 .

[6]  Prithviraj Banerjee,et al.  Gracefully degradable disk arrays , 1991, [1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium.

[7]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[8]  E. Thorp Nonrandom Shuffling with Applications to the Game of Faro , 1973 .

[9]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[10]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[11]  E. Gumbel,et al.  Statistics of extremes , 1960 .

[12]  Garth A. Gibson,et al.  Parity declustering for continuous operation in redundant disk arrays , 1992, ASPLOS V.

[13]  Philip S. Yu,et al.  Analytic Modeling and Comparisons of Striping Strategies for Replicated Disk Arrays , 1995, IEEE Trans. Computers.