Swift/RAID: A Distributed RAID

The Swift VO architecture is designed to provide high data rates in support of multimedia type applications in general-purpose distributed environ- ments through the use of distributed striping. Strþing techniques place sections of a single logical data space onto multiple physical devices. The original Swift pro- totype was designed to validate the architecture, but did not provide fault tolerance. We have implemented a new prototype of the Swift architecture that provides fault tolerance in the distributed environment in the same manner as RAID levels 4 and 5. RAID (Redun- dant Arrays of Inexpensive Disks) techniques have recently been widely used to increase both performance and fault tolerance of disk storage systems. The new Swift/RAID implementation manages all communication using a distributed transfer plan ex- ecutor which isolates all communication code from the rest of Swift. The transfer plan executor is im- plemented as a distributed finite state machine that decodes and executes a set of reliable data-transfer op- erations. This approach enables us to easily investigate alternative architectures and communications protocols.

[1]  Jeffrey C. Mogul,et al.  Measured capacity of an Ethernet: myths and reality , 1988, CCRV.

[2]  Randy H. Katz,et al.  Disk system architectures for high performance computing , 1989, Proc. IEEE.

[3]  Michelle Y. Kim,et al.  Synchronized Disk Interleaving , 1986, IEEE Transactions on Computers.

[4]  Hector Garcia-Molina,et al.  Disk striping , 1986, 1986 IEEE Second International Conference on Data Engineering.

[5]  John McPherson,et al.  Implementing Atomicity in Two Systems: Techniques, Tradeoffs, and Experience , 1993, IEEE Trans. Software Eng..

[6]  A. M. Patel Error and failure-control proceedure for a large-size bubble memory , 1982 .

[7]  David Banks,et al.  A High-Performance Network Architecture for a PA-RISC Workstation , 1993, IEEE J. Sel. Areas Commun..

[8]  Spencer W. Ng,et al.  Maintaining good performance in disk arrays during failure via uniform parity group distribution , 1992, Proceedings of the First International Symposium on High-Performance Distributed Computing. (HPDC-1).

[9]  Darrell D. E. Long,et al.  Swift: Using Distributed Disk Striping to Provide High I/O Data Rates , 1991, Comput. Syst..

[10]  Theodore P. Baker,et al.  An Architecture for Real-Time Software Systems , 1989 .

[11]  Darrell D. E. Long,et al.  Providing performance guarantees in an FDDI network , 1993, [1993] Proceedings. The 13th International Conference on Distributed Computing Systems.

[12]  M. Padlipsky The elements of networking style and other essays and animadversions on the art of intercomputer networking , 1984 .

[13]  Jeffrey C. Mogul,et al.  The packer filter: an efficient mechanism for user-level network code , 1987, SOSP '87.

[14]  Lee MacLaren Evolving toward Ada in real time systems , 1980, SIGPLAN '80.