FTRAID: A Fat-tree Based Parallel Storage Architecture for Very Large Disk Array

Traditional disk arrays have a centralized architecture, with a single controller through which all requests flow. Such a controller is a single point of failure, and its performance limits the maximum number of disks to which the array can scale. Fat-trees are well-adopted as the topologies of interconnection networks because of many nice properties they have. We propose a novel fat-tree based parallel RAID architecture - FTRAID. The unit of deployment in FTRAID is a storage node, a small rack-mounted storage appliance built from commodity components including CPU, RAM, NIC and disks. We also establish stochastic Petri net (SPN) model of storage node to analyze its utilization. In particular we have designed and implemented a seven node FTRAID prototype to demonstrate its feasibility. Numerical results using popular benchmark have shown good performance gain.

[1]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[2]  Shivakumar Venkataraman,et al.  The TickerTAIP parallel RAID architecture , 1993, ISCA '93.

[3]  Darrell D. E. Long,et al.  Swift/RAID: A Distributed RAID System , 1994, Comput. Syst..

[4]  Arif Merchant,et al.  FAB: building distributed enterprise disk arrays from commodity components , 2004, ASPLOS XI.

[5]  Alexander Thomasian Multilevel RAID Disk Arrays , 2006 .

[6]  Gregory R. Ganger,et al.  Self-* Storage: Brick-based Storage with Automated Administration (CMU-CS-03-178) , 2003 .

[7]  Jim Gray Storage Bricks Have Arrived , 2002 .

[8]  Charles E. Leiserson,et al.  Fat-trees: Universal networks for hardware-efficient supercomputing , 1985, IEEE Transactions on Computers.

[9]  Masaru Kitsuregawa,et al.  Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5 , 1996, SIGMOD '96.

[10]  Chandramohan A. Thekkath,et al.  Petal: distributed virtual disks , 1996, ASPLOS VII.

[11]  W. Daniel Hillis,et al.  The Network Architecture of the Connection Machine CM-5 , 1996, J. Parallel Distributed Comput..

[12]  Sung Hoon Baek,et al.  Reliability and performance of hierarchical RAID with multiple controllers , 2001, PODC '01.

[13]  Michael K. Molloy Performance Analysis Using Stochastic Petri Nets , 1982, IEEE Transactions on Computers.

[14]  Hai Jin,et al.  Orthogonal Striping and Mirroring in Distributed RAID for I/O-Centric Cluster Computing , 2002, IEEE Trans. Parallel Distributed Syst..