Exploiting Redundancies and Deferred Writes to Conserve Energy in Erasure-Coded Storage Clusters

We present a power-efficient scheme for erasure-coded storage clusters---ECS<sup>2</sup>---which aims to offer high energy efficiency with marginal reliability degradation. ECS<sup>2</sup> utilizes data redundancies and deferred writes to conserve energy. In ECS<sup>2</sup> parity blocks are buffered exclusively in active data nodes whereas parity nodes are placed into low-power mode. (<i>k</i> + <i>r</i>, <i>k</i>) RS-coded ECS<sup>2</sup> can achieve ⌈(<i>r</i> + 1)/2⌉-fault tolerance for <i>k</i> active data nodes and <i>r</i>-fault tolerance for all <i>k</i> + <i>r</i> nodes. ECS<sup>2</sup> employs the following three optimizing approaches to improve the energy efficiency of storage clusters. (1) An adaptive threshold policy takes system configurations and I/O workloads into account to maximize standby time periods; (2) a selective activation policy minimizes the number of power-transitions in storage nodes; and (3) a region-based buffer policy speeds up the synchronization process by migrating parity blocks in a batch method. After implementing an ECS<sup>2</sup>-based prototype in a Linux cluster, we evaluated its energy efficiency and performance using four different types of I/O workloads. The experimental results indicate that compared to energy-oblivious erasure-coded storage, ECS<sup>2</sup> can save the energy used by storage clusters up to 29.8% and 28.0% in read-intensive and write-dominated workloads when k = 6 and r = 3, respectively. The results also show that ECS<sup>2</sup> accomplishes high power efficiency in both normal and failed cases without noticeably affecting the I/O performance of storage clusters.

[1]  Song Jiang,et al.  SmartSaver: Turning Flash Drive into a Disk Energy Saver for Mobile Computers , 2006, ISLPED'06 Proceedings of the 2006 International Symposium on Low Power Electronics and Design.

[2]  Garth A. Gibson,et al.  DiskReduce: RAID for data-intensive scalable computing , 2009, PDSW '09.

[3]  Gregory R. Ganger,et al.  Towards higher disk head utilization: extracting free bandwidth from busy disk drives , 2000, OSDI.

[4]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[5]  Christoforos E. Kozyrakis,et al.  On the energy (in)efficiency of Hadoop clusters , 2010, OPSR.

[6]  Xiaosong Ma,et al.  Does erasure coding have a role to play in my data center , 2010 .

[7]  Jeffrey F. Naughton,et al.  On energy management, load balancing and replication , 2010, SGMD.

[8]  Dirk Grunwald,et al.  Massive Arrays of Idle Disks For Storage Archives , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[9]  Thomas F. Wenisch,et al.  PowerNap: eliminating server idle power , 2009, ASPLOS.

[10]  Daniel P. Siewiorek,et al.  Architectures and algorithms for on-line failure recovery in redundant disk arrays , 1994, Distributed and Parallel Databases.

[11]  Peter F. Corbett,et al.  Row-Diagonal Parity for Double Disk Failure Correction (Awarded Best Paper!) , 2004, USENIX Conference on File and Storage Technologies.

[12]  Catherine D. Schuman,et al.  A Performance Evaluation and Examination of Open-Source Erasure Coding Libraries for Storage , 2009, FAST.

[13]  Darrell D. E. Long,et al.  A Spin-Up Saved Is Energy Earned: Achieving Power-Efficient, Erasure-Coded Storage , 2008, HotDep.

[14]  Jehoshua Bruck,et al.  EVENODD: An Efficient Scheme for Tolerating Double Disk Failures in RAID Architectures , 1995, IEEE Trans. Computers.

[15]  John Kubiatowicz,et al.  Erasure Coding Vs. Replication: A Quantitative Comparison , 2002, IPTPS.

[16]  Mendel Rosenblum,et al.  Fast crash recovery in RAMCloud , 2011, SOSP.

[17]  Yuanyuan Zhou,et al.  Reducing Energy Consumption of Disk Storage Using Power-Aware Cache Management , 2004, 10th International Symposium on High Performance Computer Architecture (HPCA'04).

[18]  Srinivasan Seshan,et al.  Measurement and Analysis of TCP Throughput Collapse in Cluster-based Storage Systems , 2008, FAST.

[19]  Marcos K. Aguilera,et al.  Using erasure codes efficiently for storage in a distributed system , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[20]  Qi Zhang,et al.  Characterization of storage workload traces from production Windows Servers , 2008, 2008 IEEE International Symposium on Workload Characterization.

[21]  James Lee Hafner,et al.  WEAVER codes: highly fault tolerant erasure codes for storage systems , 2005, FAST'05.

[22]  Lakshmi Ganesh,et al.  Optimizing Power Consumption in Large Scale Storage Systems , 2007, HotOS.

[23]  J. Plank Optimizing Cauchy Reed-Solomon Codes for Fault-Tolerant Storage Applications , 2005 .

[24]  Arkady Kanevsky,et al.  Are disks the dominant contributor for storage failures?: A comprehensive study of storage subsystem failure characteristics , 2008, TOS.

[25]  Jignesh M. Patel,et al.  Energy management for MapReduce clusters , 2010, Proc. VLDB Endow..

[26]  Scott A. Brandt,et al.  Reliability mechanisms for very large storage systems , 2003, 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies, 2003. (MSST 2003). Proceedings..

[27]  Shankar Pasupathy,et al.  An analysis of latent sector errors in disk drives , 2007, SIGMETRICS '07.

[28]  Jin Qian,et al.  PARAID: A gear-shifting power-aware RAID , 2007, TOS.

[29]  Dong Li,et al.  EERAID: energy efficient redundant and inexpensive disk array , 2004, EW 11.

[30]  Mehul A. Shah,et al.  Analyzing the energy efficiency of a database server , 2010, SIGMOD Conference.

[31]  Robert H. McDonald,et al.  Disk and Tape Storage Cost Models , 2007, Archiving Conference.

[32]  Karsten Schwan,et al.  Robust and flexible power-proportional storage , 2010, SoCC '10.

[33]  Ricardo Bianchini,et al.  Exploiting redundancy to conserve energy in storage systems , 2006, SIGMETRICS '06/Performance '06.

[34]  James S. Plank A tutorial on Reed-Solomon coding for fault-tolerance in RAID-like systems , 1997 .

[35]  Jun Wang,et al.  RIMAC: a novel redundancy-based hierarchical cache architecture for energy efficient, high performance storage systems , 2006, EuroSys.

[36]  James Lee Hafner,et al.  Reliability for Networked Storage Nodes , 2011, IEEE Transactions on Dependable and Secure Computing.

[37]  Yuanyuan Zhou,et al.  Hibernator: helping disk arrays sleep through the winter , 2005, SOSP '05.

[38]  GhemawatSanjay,et al.  The Google file system , 2003 .

[39]  Zhang Xiao Research on Energy Consumption of General Storage Network System , 2011 .

[40]  Ethan L. Miller,et al.  Pergamum: Replacing Tape with Energy Efficient, Reliable, Disk-Based Archival Storage , 2008, FAST.

[41]  Dong Li,et al.  eRAID: Conserving Energy in Conventional Disk-Based RAID System , 2008, IEEE Transactions on Computers.