Tertiary Disk: Large Scale Distributed Storage

In the past 5 years, disk costs have been falling at a factor of 2 per year. Today, terabyte capacity disk storage systems are feasible. Given the rapidly increasing areal density and disk transfer rates, these systems will have significant cost/performance advantages over tape libraries of similar capacity. If commodity hardware is used, large disk systems can avoid the high cost of custom designed disk arrays, as well as their limitations on scalability. This paper presents Tertiary Disk, a 3TB disk storage system built from commodity hardware. Tertiary Disk uses PCs and switched networks to connect 370 8GB disks. We show that even though commodity hardware is used, the overall system can be more reliable than a single disk. A cost analysis of our prototype shows that the additional infrastructure needed to create a terabyte scale storage system is a fraction of the cost of the underlying disks. In comparison, the costs of large disk arrays are many times the cost of the underlying disks. We also present performance measurements from our prototype, and show that the PC architecture is a good match for hosting a large number of disks. Overall, we show that storage system designs like Tertiary Disk have cost/performance and reliability advantages over most choices available today for terabyte scale storage.

[1]  Randy H. Katz,et al.  Two Papers on RAIDs , 1988 .

[2]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[3]  Thomas E. Anderson,et al.  xFS: a wide area mass storage file system , 1993, Proceedings of IEEE 4th Workshop on Workstation Operating Systems. WWOS-III.

[4]  Shivakumar Venkataraman,et al.  The TickerTAIP parallel RAID architecture , 1993, ISCA '93.

[5]  Garth A. Gibson Redundant disk arrays: Reliable, parallel secondary storage. Ph.D. Thesis , 1990 .

[6]  David A. Patterson,et al.  A case for networks of workstations (now) , 1994, Symposium Record Hot Interconnects II.

[7]  Jamie Shiers Data management at CERN: current status and future trends , 1995, Proceedings of IEEE 14th Symposium on Mass Storage Systems.

[8]  John H. Hartman,et al.  The Zebra striped network file system , 1995, TOCS.

[9]  Edward K. Lee Highly-available, scalable network storage , 1995, Digest of Papers. COMPCON'95. Technologies for the Information Superhighway.

[10]  Garth A. Gibson,et al.  A Case for Network-Attached Secure Disks, , 1996 .