SEA: A Striping-Based Energy-Aware Strategy for Data Placement in RAID-Structured Storage Systems

Many real-world applications need to frequently access data stored on large-scale parallel disk storage systems. On one hand, prompt responses to access requests are essential for these applications. On the other hand, however, with an explosive increase of data volume and the emerging of faster disks with higher power requirements, energy consumption of disk-based storage systems has become a salient issue. To achieve energy-conservation and prompt responses simultaneously, in this paper we propose a novel energy-aware strategy, called striping-based energy-aware (SEA), which can be integrated into data placement in RAID-structured storage systems to noticeably save energy while providing quick responses. Next, to illustrate the effectiveness of SEA, we implement two SEA-powered striping-based data placement algorithms, SEA0 and SEA5, by incorporating the SEA strategy into RAID-0 and RAID-5, respectively. Extensive experimental results demonstrate that compared with traditional non-stripping data placement algorithms, our algorithms significantly improve performance and save energy. Further, compared with an existing stripping-based data placement scheme, the two SEA-powered strategies noticeably reduce energy consumption with only a little performance degradation.

[1]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[2]  Lawrence W. Dowdy,et al.  Comparative Models of the File Assignment Problem , 1982, CSUR.

[3]  David A. Patterson,et al.  Maximizing performance in a striped disk array , 1990, ISCA '90.

[4]  Garth A. Gibson,et al.  RAID: high-performance, reliable secondary storage , 1994, CSUR.

[5]  Paul M. Greenawalt Modeling power management for hard disks , 1994, Proceedings of International Workshop on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[6]  Steven Glassman,et al.  A Caching Relay for the World Wide Web , 1994, Comput. Networks ISDN Syst..

[7]  Cyrus Shahabi,et al.  On Disk Scheduling and Data Placement for Video Servers , 1995 .

[8]  Peter M. Chen,et al.  Striping in a RAID level 5 disk array , 1995, SIGMETRICS '95/PERFORMANCE '95.

[9]  Cyrus Shahabi,et al.  On configuring a single disk continuous media server , 1995, SIGMETRICS '95/PERFORMANCE '95.

[10]  Mark Crovella,et al.  Characteristics of WWW Client-based Traces , 1995 .

[11]  Daniel A. Reed,et al.  NCSA's World Wide Web Server: Design and Performance , 1995, Computer.

[12]  Kenneth Salem,et al.  Adaptive block rearrangement , 1993, TOCS.

[13]  Hiroshi Tsuji,et al.  Memory-Based Architecture for Distributed WWW Caching Proxy , 1998, Comput. Networks.

[14]  Marianne Winslett,et al.  Parallel I/O performance of fine grained data distributions , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[15]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[16]  Yun He,et al.  Data Organization and I/O in a Parallel Ocean Circulation Model , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[17]  Stavros Christodoulakis,et al.  Optimal Data Placement on Disks: A Comprehensive Solution for Different Technologies , 2000, IEEE Trans. Knowl. Data Eng..

[18]  Christian Scheideler,et al.  Efficient, distributed data placement strategies for storage area networks (extended abstract) , 2000, SPAA '00.

[19]  Peter Scheuermann,et al.  File Assignment in Parallel I/O Systems with Minimal Variance of Service Time , 2000, IEEE Trans. Computers.

[20]  Scott A. Brandt,et al.  Data Placement Based on the Seek Time Analysis of a MEMS-based Storage Device , 2001 .

[21]  Chita R. Das,et al.  Adaptive block rearrangement algorithms for video-on-demand server , 2001, International Conference on Parallel Processing, 2001..

[22]  A Major Performance Analysis of the Linux Buffer Cache While Running an Oracle OLTP Workload , 2002 .

[23]  Joshua Keith Obal,et al.  Performance analysis of the Linux buffer cache while running an Oracle OLTP workload. , 2002 .

[24]  Mahmut T. Kandemir,et al.  DRPM: dynamic speed control for power management in server class disks , 2003, 30th Annual International Symposium on Computer Architecture, 2003. Proceedings..

[25]  Ricardo Bianchini,et al.  Conserving disk energy in network servers , 2003, ICS '03.

[26]  Paolo Merialdo,et al.  Design and development of data-intensive web sites: The Araneus approach , 2003, TOIT.

[27]  R. Bianchini,et al.  Energy conservation techniques for disk array-based servers , 2004, ICS '04.

[28]  Michael L. Scott,et al.  Power-efficient Server-class Performance from Arrays of Laptop Disks , 2004 .

[29]  Robert Latham,et al.  A next-generation parallel file system for Linux cluster. , 2004 .

[30]  Mahmut T. Kandemir,et al.  An evaluation of code and data optimizations in the context of disk power reduction , 2005, ISLPED '05. Proceedings of the 2005 International Symposium on Low Power Electronics and Design, 2005..

[31]  Mahmut T. Kandemir,et al.  Exposing disk layout to compiler for reducing energy consumption of parallel disk based systems , 2005, PPOPP.

[32]  Yuanyuan Zhou,et al.  Hibernator: helping disk arrays sleep through the winter , 2005, SOSP '05.

[33]  A. Couch,et al.  Toward a cost model for system administration , 2005 .

[34]  Kang G. Shin,et al.  FS2: dynamic data replication in free disk space for improving disk performance and energy consumption , 2005, SOSP '05.

[35]  Mahmut T. Kandemir,et al.  Disk layout optimization for reducing energy consumption , 2005, ICS '05.

[36]  Yao Sun,et al.  No More Energy-Performance Trade-Off: A New Data Placement Strategy for RAID-Structured Storage Systems , 2007, HiPC.

[37]  Xiao Qin,et al.  An Energy-Efficient Scheduling Algorithm Using Dynamic Voltage Scaling for Parallel Applications on Clusters , 2007, 2007 16th International Conference on Computer Communications and Networks.

[38]  Xiao Qin,et al.  Energy-Efficient Scheduling for Parallel Applications Running on Heterogeneous Clusters , 2007, 2007 International Conference on Parallel Processing (ICPP 2007).