Data layout for power efficient archival storage systems

Legacy archival workloads have a typical write-once-read-never pattern, which fits well for tape based archival systems. With the emergence of newer applications like Facebook, Yahoo! Flickr, Apple iTunes, demand for a new class of archives has risen, where archived data continues to get accessed, albeit at lesser frequency and relaxed latency requirements. We call these types of archival storage systems as active archives. However, keeping archived data on always spinning storage media to fulfill occasional read requests is not practical due to significant power costs. Using spin-down disks, having better latency characteristics as compared to tapes, for active archives can save significant power. In this paper, we present a two-tier architecture for active archives comprising of online and offline disks, and provide an access-aware intelligent data layout mechanism to bring power efficiency. We validate the proposed mechanism with real-world archival traces. Our results indicate that the proposed clustering and optimized data layout algorithms save upto 78% power over random placement.

[1]  Atish Kathpal,et al.  Nakshatra: Towards Running Batch Analytics on an Archive , 2014, 2014 IEEE 22nd International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems.

[2]  Sanjeev Kumar,et al.  Finding a Needle in Haystack: Facebook's Photo Storage , 2010, OSDI.

[3]  Darrell D. E. Long,et al.  A Spin-Up Saved Is Energy Earned: Achieving Power-Efficient, Erasure-Coded Storage , 2008, HotDep.

[4]  Xiao Qin,et al.  Exploiting Redundancies and Deferred Writes to Conserve Energy in Erasure-Coded Storage Clusters , 2013, TOS.

[5]  Ricardo Bianchini,et al.  Conserving disk energy in network servers , 2003, ICS '03.

[6]  Yao Sun,et al.  Sacrificing Reliability for Energy Saving: Is it worthwhile for disk arrays? , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.

[7]  Yuanyuan Zhou,et al.  Hibernator: helping disk arrays sleep through the winter , 2005, SOSP '05.

[8]  Tobias Achterberg,et al.  SCIP: solving constraint integer programs , 2009, Math. Program. Comput..

[9]  Antony I. T. Rowstron,et al.  Pelican: A Building Block for Exascale Cold Data Storage , 2014, OSDI.

[10]  Ethan L. Miller,et al.  An Economic Perspective of Disk vs. Flash Media in Archival Storage , 2014, 2014 IEEE 22nd International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems.

[11]  André Brinkmann,et al.  Analysis of the ECMWF Storage Landscape , 2015, FAST.