Fast Reconstruction for Large Disk Enclosures Based on RAID2.0

In the era of explosive data growth, RAID2.0 architecture with dozens or even hundreds of disks is commonly used to provide large capacity data storage. Due to limited resources, such as memory and CPU, the reconstruction for disk failures in RAID2.0 is executed in batches. Traditional random data placement and recovery scheme make the I/O access highly skewed within a batch, which slows down the reconstruction speed. We propose DR-RAID, an efficient reconstruction scheme that balances local rebuilding workloads across all surviving disks within a batch. Instead of executing tasks sequentially from all pending recovery tasks, we dynamically select a batch of tasks with an almost balanced read load. Furthermore, we transform the problem of distributing the reconstructed data to surviving disks into a bipartite graph model, and achieve uniform write load distribution by finding a maximum matching in the graph. DR-RAID can be applied in a large disk pool with homogeneous or heterogeneous rebuilding bandwidth. We implemented DR-RAID into a pool of 50 disks and conducted extensive experiments. DR-RAID increases the rebuilding throughput by up to 61.90% compared with the random data placement scheme in offline rebuilding, and up to 59.28% with varied rebuilding bandwidth. DR-RAID accelerates rebuilding by effectively eliminating the local load imbalance within a batch, and greatly shortens the period of interference to user requests.

[1]  Qing Yang,et al.  ${\rm S}^{2}$-RAID: Parallel RAID Architecture for Fast Data Recovery , 2013, IEEE Transactions on Parallel and Distributed Systems.

[2]  Albert G. Greenberg,et al.  Scarlett: coping with skewed content popularity in mapreduce clusters , 2011, EuroSys '11.

[3]  Hong Jiang,et al.  WorkOut: I/O Workload Outsourcing for Boosting RAID Reconstruction Performance , 2009, FAST.

[4]  Weimin Zheng,et al.  Determining Data Distribution for Large Disk Enclosures with 3-D Data Templates , 2019, ACM Trans. Storage.

[5]  ZhangGuangyan,et al.  Determining Data Distribution for Large Disk Enclosures with 3-D Data Templates , 2020 .

[6]  3 PAR Fast RAID : High Performance Without Compromise , 2010 .

[7]  Kannan Ramchandran,et al.  EC-Cache: Load-Balanced, Low-Latency Cluster Caching with Online Erasure Coding , 2016, OSDI.

[8]  Gregory R. Ganger,et al.  Towards higher disk head utilization: extracting free bandwidth from busy disk drives , 2000, OSDI.

[9]  Hong Jiang,et al.  PRO: A Popularity-based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems , 2007, FAST.

[10]  Norman Biggs,et al.  Combinatorics and Graph Theory , 2007 .

[11]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[12]  Gregory R. Ganger,et al.  Association Proceedings of the Third USENIX Conference on File and Storage Technologies San Francisco , CA , USA March 31 – April 2 , 2004 , 2004 .

[13]  Si Wu,et al.  OI-RAID: A Two-Layer RAID Architecture towards Fast Recovery and High Reliability , 2016, 2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).

[14]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[15]  Garth A. Gibson,et al.  Parity declustering for continuous operation in redundant disk arrays , 1992, ASPLOS V.

[16]  John C. S. Lui,et al.  Performance Analysis of Disk Arrays under Failure , 1990, VLDB.

[17]  Xiaosong Ma,et al.  RAID+: Deterministic and Balanced Data Distribution for Large Disk Enclosures , 2018, FAST.

[18]  Gregory R. Ganger,et al.  Argon: Performance Insulation for Shared Storage Servers , 2007, FAST.

[19]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[20]  Scott Shenker,et al.  Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks , 2014, SoCC.

[21]  Michael Grosskopf,et al.  Extreme Protection Against Data Loss with Single-Overlap Declustered Parity , 2020, 2020 50th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN).