Efficient Parity Update for Scaling RAID-like Storage Systems

It is inevitable to scale RAID systems with the increasing demand of storage capacity and I/O throughput. When scaling RAID systems, we will always need to update parity to maintain the reliability of the storage systems. There are two schemes, read-modify-write (RMW) and read-reconstruct-write (RCW), to update parity. However most existing scaling approaches simply use RMW to update parity. While in many scenarios for existing scaling approaches, RCW performs better in terms of the number of scaling I/Os. In this paper, we propose an algorithm, called EPU, to analyze which of RCW and RMW is better for a scaling scenario and select the more efficient one to save the scaling I/Os. We apply EPU to online scaling scenarios and further use two optimizations, I/O overlap and access aggregation, to enhance the online scaling performance. Using Scale-CRS, one of the existing scaling approaches, as an example, we show via numerical studies that Scale-CRS+EPU reduces the amount of scaling I/Os over the traditional Scale-CRS in many scaling cases. To justify the online efficiency of EPU, we implement both Scale-CRS+EPU and Scale-CRS in a simulator with Disksim as a working module. Through extensive experiments, we show that Scale-CRS+EPU reduces the scaling time and the online response time of user requests over Scale-CRS.

[1]  Peter F. Corbett,et al.  Row-Diagonal Parity for Double Disk Failure Correction (Awarded Best Paper!) , 2004, USENIX Conference on File and Storage Technologies.

[2]  Toni Cortes,et al.  Increasing the capacity of RAID5 by online gradual assimilation , 2004, SNAPI '04.

[3]  Si Wu,et al.  I/O-Efficient Scaling Schemes for Distributed Storage Systems with CRS Codes , 2016, IEEE Transactions on Parallel and Distributed Systems.

[4]  Chentao Wu,et al.  SDM: A Stripe-Based Data Migration Scheme to Improve the Scalability of RAID-6 , 2012, 2012 IEEE International Conference on Cluster Computing.

[5]  Jehoshua Bruck,et al.  EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures , 1995, IEEE Trans. Computers.

[6]  Rodrigo Schmidt,et al.  The Search for a Highly-Available Hadoop Distributed Filesystem , 2010 .

[7]  Ben Y. Zhao,et al.  Maintenance-Free Global Data Storage , 2001, IEEE Internet Comput..

[8]  Hong Jiang,et al.  P-Code: a new RAID-6 code with optimal properties , 2009, ICS '09.

[9]  Van-Anh Truong,et al.  Availability in Globally Distributed Storage Systems , 2010, OSDI.

[10]  Keqin Li,et al.  Rethinking RAID-5 Data Layout for Better Scalability , 2014, IEEE Transactions on Computers.

[11]  Weimin Zheng,et al.  FastScale: Accelerate RAID Scaling by Minimizing Data Migration , 2011, FAST.

[12]  Jehoshua Bruck,et al.  X-Code: MDS Array Codes with Optimal Encoding , 1999, IEEE Trans. Inf. Theory.

[13]  Gregory R. Ganger,et al.  The DiskSim Simulation Environment Version 4.0 Reference Manual (CMU-PDL-08-101) , 1998 .

[14]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[15]  Xiao Qin,et al.  Scale-RS: An Efficient Scaling Scheme for RS-Coded Storage Clusters , 2015, IEEE Transactions on Parallel and Distributed Systems.

[16]  Chentao Wu,et al.  GSR: A Global Stripe-Based Redistribution Approach to Accelerate RAID-5 Scaling , 2012, 2012 41st International Conference on Parallel Processing.

[17]  Catherine D. Schuman,et al.  A Performance Evaluation and Examination of Open-Source Erasure Coding Libraries for Storage , 2009, FAST.

[18]  David A. Patterson,et al.  A Simple Way to Estimate the Cost of Downtime , 2002, LISA.

[19]  F. Moore,et al.  Polynomial Codes Over Certain Finite Fields , 2017 .

[20]  Marek Karpinski,et al.  An XOR-based erasure-resilient coding scheme , 1995 .

[21]  Ashish Goel,et al.  SCADDAR: an efficient randomized technique to reorganize continuous media blocks , 2002, Proceedings 18th International Conference on Data Engineering.

[22]  Jiwu Shu,et al.  ALV: A New Data Redistribution Approach to RAID-5 Scaling , 2010, IEEE Transactions on Computers.

[23]  Keqin Li,et al.  Accelerate RDP RAID-6 Scaling by Reducing Disk I/Os and XOR Operations , 2015, IEEE Transactions on Computers.

[24]  Shahram Ghandeharizadeh,et al.  On-line Reorganization of Data in Scalable Continuous Media Servers , 1996, DEXA.