A Stripe-Oriented Write Performance Optimization for RAID-Structured Storage Systems

In modern RAID-structured storage systems, reliability is guaranteed by the use of parity blocks. But the parity-update overheads upon each write request have become a performance bottleneck of RAID systems. In some ways, an attached log disk is used to improve the write performance by delaying the parity blocks update. However, these methods are data-block-oriented and they need more time to rebuild or synchronize the RAID system when a data disk or the log disk fails. In this paper, we propose a novel optimization method, called SWO, which can improve RAID write performance and reconstruction performance. Moreover, when handling a write request, the SWO chooses reconstruction- write or read-modify-write combining with the log information to further minimize the number of pre- read data blocks. We have implemented the proposed SWO prototype and carried out some performance measurements using IOmeter and RAIDmeter. We have implemented the main idea of RAID6L in RAID5 and call it RAID5L. At the same time, we have evaluated the reconstruction time and the synchronization time of the SWO. Our experiments demonstrate that the SWO significantly improves write performance and saves more time than Data Logging and RAID5L when rebuilding and synchronizing.

[1]  Hong Jiang,et al.  RAID6L: A log-assisted RAID6 storage architecture with improved write performance , 2011, 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST).

[2]  Eran Gabber,et al.  Data logging: a method for efficient data updates in constantly active RAIDs , 1998, Proceedings 14th International Conference on Data Engineering.

[3]  Hong Jiang,et al.  PRO: A Popularity-based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems , 2007, FAST.

[4]  Arkady Kanevsky,et al.  Are disks the dominant contributor for storage failures?: A comprehensive study of storage subsystem failure characteristics , 2008, TOS.

[5]  Hong Jiang,et al.  TRIP: Temporal Redundancy Integrated Performance Booster for Parity-Based RAID Storage Systems , 2010, 2010 IEEE 16th International Conference on Parallel and Distributed Systems.

[6]  Jaishankar Moothedath Menon,et al.  A performance comparison of RAID-5 and log-structured arrays , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[7]  Masaru Kitsuregawa,et al.  Hot mirroring: a method of hiding parity update penalty and degradation during rebuilds for RAID5 , 1996, SIGMOD '96.

[8]  Dan Feng,et al.  HerpRap: A Hybrid Array Architecture Providing Any Point-in-Time Data Tracking for Datacenter , 2012, 2012 IEEE International Conference on Cluster Computing.

[9]  Carl Staelin,et al.  The HP AutoRAID hierarchical storage system , 1995, SOSP.

[10]  Stefan Savage,et al.  AFRAID - A Frequently Redundant Array of Independent Disks , 1996, USENIX Annual Technical Conference.

[11]  Hong Jiang,et al.  HPDA: A hybrid parity-based disk array for enhanced performance and reliability , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[12]  Dan Feng,et al.  HRAID6ML: A hybrid RAID6 storage architecture with mirrored logging , 2012, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST).

[13]  Garth A. Gibson,et al.  Parity declustering for continuous operation in redundant disk arrays , 1992, ASPLOS V.

[14]  Hai Jin,et al.  Parity Logging Overcoming the Small Write Problem in Redundant Disk Arrays , 2002 .

[15]  Jiguang Wan,et al.  A new high-performance, energy-efficient replication storage system with reliability guarantee , 2012, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST).

[16]  Masaru Kitsuregawa,et al.  Dynamic parity stripe reorganizations for RAID5 disk arrays , 1994, Proceedings of 3rd International Conference on Parallel and Distributed Information Systems.

[17]  Ying Chen,et al.  Logging RAID - An Approach to Fast, Reliable, and Low-Cost Disk Arrays , 2000, Euro-Par.

[18]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[19]  Yiming Hu,et al.  DCD—disk caching disk: a new approach for boosting I/O performance , 1996, ISCA '96.