Improving reliability and energy efficiency of disk systems via utilization control

As disk drives become increasingly sophisticated and processing power increases, one of the most critical issues of designing modern disk systems is data reliability. Although numerous energy saving techniques are available for disk systems, most of energy conservation techniques are not effective in reliability critical environments due to their limitation of ignoring the reliability issue. A wide range of factors affect the reliability of disk systems; the most important factors - disk utilization and ages - are the focus of this study. We build a model to quantify the relationship among the disk age, utilization, and failure probabilities. Observing that the reliability of a disk heavily relies on both disk utilization and age, we propose a novel concept of safe utilization zone, where energy of the disk can be conserved without degrading reliability. We investigate an approach to improving both reliability and energy efficiency of disk systems via utilization control, where disk drives are operated in safe utilization zones to minimize the probability of disk failure. In this study, we integrate an existing energy consumption technique that operates the disks at different power modes with our proposed reliability approach. Experimental results show that our approach can significantly improve reliable while achieving high energy efficiency for disk systems.

[1]  Rami Melhem,et al.  The effects of energy management on reliability in real-time embedded systems , 2004, ICCAD 2004.

[2]  Yifeng Zhu,et al.  Evaluating memory energy efficiency in parallel I/O workloads , 2007, 2007 IEEE International Conference on Cluster Computing.

[3]  Peter J. Varman,et al.  DiskGroup: Energy Efficient Disk Layout for RAID1 Systems , 2007, 2007 International Conference on Networking, Architecture, and Storage (NAS 2007).

[4]  Arvind Krishnamurthy,et al.  Modeling Hard-Disk Power Consumption , 2003, FAST.

[5]  Ricardo Bianchini,et al.  Conserving disk energy in network servers , 2003, ICS '03.

[6]  Darrell D. E. Long,et al.  A dynamic disk spin-down technique for mobile computing , 1996, MobiCom '96.

[7]  J. Sikora Disk failures in the real world : What does an MTTF of 1 , 000 , 000 hours mean to you ? , 2007 .

[8]  S. Shah,et al.  Server class disk drives: how reliable are they? , 2004, Annual Symposium Reliability and Maintainability, 2004 - RAMS.

[9]  L. Benini,et al.  Dynamic power management of laptop hard disk , 2000, Proceedings Design, Automation and Test in Europe Conference and Exhibition 2000 (Cat. No. PR00537).

[10]  Anand Sivasubramaniam,et al.  Dynamic Speed Control for Server Class Disks , 2003 .

[11]  Mark S. Squillante,et al.  Models of Parallel Applications with Large Computation and I/O Requirements , 2002, IEEE Trans. Software Eng..

[12]  Mahmut T. Kandemir,et al.  Exposing disk layout to compiler for reducing energy consumption of parallel disk based systems , 2005, PPOPP.

[13]  Mahmut T. Kandemir,et al.  Compiler-Directed Energy Optimization for Parallel Disk Based Systems , 2007, IEEE Transactions on Parallel and Distributed Systems.

[14]  Eduardo Pinheiro,et al.  Failure Trends in a Large Disk Drive Population , 2007, FAST.

[15]  Christos G. Cassandras,et al.  Optimal Dynamic Voltage Scaling in Energy-Limited Nonpreemptive Systems with Real-Time Constraints , 2007, IEEE Transactions on Mobile Computing.

[16]  Yehua Du,et al.  Dynamic voltage scaling of flash memory storage systems for low-power real-time embedded systems , 2005, Second International Conference on Embedded Software and Systems (ICESS'05).

[17]  Dong Li,et al.  eRAID: Conserving Energy in Conventional Disk-Based RAID System , 2008, IEEE Transactions on Computers.