RoLo: A Rotated Logging Storage Architecture for Enterprise Data Centers

We propose RoLo (Rotated Logging), a new logging architecture for RAID10 systems for enhanced energy efficiency, performance and reliability. By spreading destaging I/O activities among short idle time slots and proactively reclaiming the stale logging space, RoLo rotates loggers among a logical logging space pool formed collectively from the free storage space available among mirrored disks. Therefore, without the extra dedicated log disks and the corresponding centralized logging, RoLo eliminates the additional hardware and energy costs, potential single point of failure and performance bottleneck. Furthermore, RoLo prolongs the lifecycle of the disks and improves the system’s energy efficiency by reducing the disk spin up/down frequency. We develop three flavors of RoLo, that is, RoLo-E/R/P, to emphasize energy efficiency, reliability, and performance respectively. Extensive trace-driven evaluations demonstrate the advantages of the three RoLo schemes over both a RAID10 system with centralized logging architecture and a typical RAID10 system.

[1]  Qing Yang,et al.  DCD --- Disk Caching Disk: A New Approach for Boosting I/O Performance , 1996, 23rd Annual International Symposium on Computer Architecture (ISCA'96).

[2]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[3]  Yuanyuan Zhou,et al.  Hibernator: helping disk arrays sleep through the winter , 2005, SOSP '05.

[4]  Sriram Sankar,et al.  Intra-disk Parallelism: An Idea Whose Time Has Come , 2008, 2008 International Symposium on Computer Architecture.

[5]  Lakshmi Ganesh,et al.  Optimizing Power Consumption in Large Scale Storage Systems , 2007, HotOS.

[6]  Kang G. Shin,et al.  FS2: dynamic data replication in free disk space for improving disk performance and energy consumption , 2005, SOSP '05.

[7]  Jai Menon,et al.  The architecture of a fault-tolerant cached RAID controller , 1993, ISCA '93.

[8]  Jaishankar Moothedath Menon,et al.  A performance comparison of RAID-5 and log-structured arrays , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[9]  Chanik Park,et al.  An adaptive high-low water mark destage algorithm for cached RAID5 , 2002, 2002 Pacific Rim International Symposium on Dependable Computing, 2002. Proceedings..

[10]  GhemawatSanjay,et al.  The Google file system , 2003 .

[11]  Jeffrey C. Mogul,et al.  A Better Update Policy , 1994, USENIX Summer.

[12]  Yuanyuan Zhou,et al.  Power-aware storage cache management , 2005, IEEE Transactions on Computers.

[13]  Qi Zhang,et al.  Efficient management of idleness in storage systems , 2009, TOS.

[14]  Shankar Pasupathy,et al.  An analysis of latent sector errors in disk drives , 2007, SIGMETRICS '07.

[15]  Jin Qian,et al.  PARAID: A gear-shifting power-aware RAID , 2007, TOS.

[16]  Carlos Maltzahn,et al.  Ceph: a scalable, high-performance distributed file system , 2006, OSDI '06.

[17]  Dharmendra S. Modha,et al.  WOW: wise ordering for writes - combining spatial and temporal locality in non-volatile caches , 2005, FAST'05.

[18]  Marina Alonso,et al.  A new destage algorithm for disk cache: DOME , 1999, Proceedings 25th EUROMICRO Conference. Informatics: Theory and Practice for the New Millennium.

[19]  Scott A. Brandt,et al.  Reliability mechanisms for very large storage systems , 2003, 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies, 2003. (MSST 2003). Proceedings..

[20]  Arvind Krishnamurthy,et al.  Modeling Hard-Disk Power Consumption , 2003, FAST.

[21]  Hong Jiang,et al.  GRAID: A Green RAID Storage Architecture with Improved Energy Efficiency and Reliability , 2008, 2008 IEEE International Symposium on Modeling, Analysis and Simulation of Computers and Telecommunication Systems.

[22]  Yiming Hu,et al.  DCD—disk caching disk: a new approach for boosting I/O performance , 1996, ISCA '96.

[23]  N BairavasundaramLakshmi,et al.  An analysis of latent sector errors in disk drives , 2007 .

[24]  Ying Chen,et al.  Logging RAID - An Approach to Fast, Reliable, and Low-Cost Disk Arrays , 2000, Euro-Par.

[25]  Gregory R. Ganger,et al.  The DiskSim Simulation Environment Version 4.0 Reference Manual (CMU-PDL-08-101) , 1998 .

[26]  Arkady Kanevsky,et al.  Are disks the dominant contributor for storage failures?: A comprehensive study of storage subsystem failure characteristics , 2008, TOS.

[27]  Mahmut T. Kandemir,et al.  DRPM: dynamic speed control for power management in server class disks , 2003, 30th Annual International Symposium on Computer Architecture, 2003. Proceedings..

[28]  Quinn Jacobson,et al.  Destage Algorithms for Disk Arrays with Nonvolatile Caches , 1998, IEEE Trans. Computers.

[29]  Mahmut T. Kandemir,et al.  Interplay of energy and performance for disk arrays running transaction processing workloads , 2003, 2003 IEEE International Symposium on Performance Analysis of Systems and Software. ISPASS 2003..

[30]  Daniel Stodolsky,et al.  Parity logging overcoming the small write problem in redundant disk arrays , 1993, ISCA '93.

[31]  Chen Wang,et al.  A database scale-out solution for emerging write-intensive commercial workloads , 2008, OPSR.

[32]  Randal C. Burns,et al.  AWOL: An Adaptive Write Optimizations Layer , 2008, FAST.

[33]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[34]  E. N. Elnozahy,et al.  Checkpointing for peta-scale systems: a look into the future of practical rollback-recovery , 2004, IEEE Transactions on Dependable and Secure Computing.