Declustered disk array architectures with optimal and near-optimal parallelism

This paper investigates the placement of data and parity on redundant disk arrays. Declustered organizations have been traditionally used to achieve fast reconstruction of a failed disk's contents. In previous work, Holland and Gibson identified six desirable properties for ideal layouts; however no declustered layout satisfying all properties has been published in the literature. We present a complete, constructive characterization of the collection of ideal declustered layouts possessing all six properties. Given that ideal layouts exist only for a limited set of configurations, we also present two novel layout families. PRIME and RELPR can tolerate multiple failures in a wide variety of configurations with slight deviations from the ideal. Our simulation studies show that the new layouts provide excellent parallel access performance and reduced incremental loads during degraded operation, when compared with previously published layouts. For large accesses and under high loads, response times for the new layouts are typically smaller than those of previously published declustered layouts by a factor of 2.5.

[1]  Eric J. Schwabe,et al.  Improved parity-declustered layouts for disk arrays , 1994, SPAA '94.

[2]  Haim Hanani,et al.  Balanced incomplete block designs and related designs , 1975, Discret. Math..

[3]  Eric J. Schwabe,et al.  Improved Parity-Declustered Layouts for Disk Arrays , 1996, J. Comput. Syst. Sci..

[4]  Peter M. Chen,et al.  Striping in a RAID level 5 disk array , 1995, SIGMETRICS '95/PERFORMANCE '95.

[5]  Garth A. Gibson,et al.  Parity declustering for continuous operation in redundant disk arrays , 1992, ASPLOS V.

[6]  Jehoshua Bruck,et al.  EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures , 1995, IEEE Trans. Computers.

[7]  Randy H. Katz,et al.  Coding techniques for handling failures in large disk arrays , 2005, Algorithmica.

[8]  Jie Li,et al.  Reliability analysis of disk array organizations by considering uncorrectable bit errors , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[9]  Flaviu Cristian,et al.  Tolerating Multiple Failures in RAID Architectures with Optimal Storage and Uniform Declustering , 1997, ISCA.

[10]  John C. S. Lui,et al.  Performance Analysis of Disk Arrays under Failure , 1990, VLDB.

[11]  Jim Zelenka,et al.  File server scaling with network-attached secure disks , 1997, SIGMETRICS '97.

[12]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[13]  Andrew Tomkins,et al.  Informed multi-process prefetching and caching , 1997, SIGMETRICS '97.

[14]  Flaviu Cristian,et al.  Declustered disk array architectures with optimal and near-optimal parallelism , 1998, ISCA.

[15]  Walter A. Burkhard,et al.  Disk array storage system reliability , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[16]  Jim Zelenka,et al.  A structured approach to redundant disk array implementation , 1996, Proceedings of IEEE International Computer Performance and Dependability Symposium.

[17]  Randy H. Katz,et al.  Performance consequences of parity placement in disk arrays , 1991, ASPLOS IV.