DR-nets: data-reconstruction networks for highly reliable parallel-disk systems

We propose DR-nets, Data-Reconstruction networks, to construct massively parallel disk systems with large capacity, wide bandwidth and high reliability. Each node of a DR-net has disks, and is connected by links to form an interconnection network. To realize the high reliability, nodes in a sub-network of the interconnection network organize a group of parity calculation proposed for RAIDs. Inter-node communication for calculating parity keeps the locality of data transfer in DR-nets, and it inhibits bottlenecks from occurring, even if the size of the network becomes very large. Overlapped two types of parity groups on the network make the system able to handle multiple disk-drive failures. A 5 × 5 torus DR-net recovers data 100% with two damaged disk drives located in any place, 95% with four damaged drives, and can recover with up to nine damaged drives.

[1]  John Wilkes The DataMesh research project , 1991 .

[2]  Dennis Tsichritzis,et al.  Audio/video databases: an object-oriented approach , 1993, Proceedings of IEEE 9th International Conference on Data Engineering.

[3]  横田 治夫,et al.  The performance of a highly reliable parallel disk system , 1994 .

[4]  Philip S. Yu,et al.  Design and modeling of clustered RAID , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[5]  Shivakumar Venkataraman,et al.  The TickerTAIP parallel RAID architecture , 1993, ISCA '93.

[6]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[7]  Randy H. Katz,et al.  Raid-ii: a scalable storage architecture for high-bandwidth network file service , 1992 .

[8]  H KatzRandy,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988 .

[9]  Srinivasan Seshan,et al.  RAID-II: a high-bandwidth network file server , 1994, ISCA '94.

[10]  John C. S. Lui,et al.  Performance Analysis of Disk Arrays under Failure , 1990, VLDB.