iCostale: Adaptive Cost Optimization for Storage Clouds

The unprecedented volume of data generated by contemporary business users and consumers has created enormous data storage and management challenges. In order to control data storage cost, many users are moving their data to online storage clouds, and applying capacity usage reducing data transformation techniques like de-duplication, compression, and transcoding. These give rise to several challenges, such as which cloud to choose, and what data transformation techniques to apply for optimizing cost. This paper presents an integrated storage service called 'iCostale' that reduces the overall cost of data storage through automatic selection and placement of users' data into one of many storage clouds. Further, it intelligently transforms data based on its type, access frequency, transformation overhead, and the cost model of the storage cloud providers. We demonstrate the efficacy of iCostale through a series of micro- and application-level benchmarks. Our experimental results show that, through intelligent data placement and transformation, 'iCostale' can reduce overall cost of data storage by more than 50%.

[1]  Joachim Schaper,et al.  Cloud Services , 2010, 4th IEEE International Conference on Digital Ecosystems and Technologies.

[2]  Matei Ripeanu,et al.  Amazon S3 for science grids: a viable solution? , 2008, DADC '08.

[3]  Calton Pu,et al.  Fine-Grain Adaptive Compression in Dynamically Variable Networks , 2005, 25th IEEE International Conference on Distributed Computing Systems (ICDCS'05).

[4]  Chandra Krintz,et al.  ACE: a resource-aware adaptive compression environment , 2003, Proceedings ITCC 2003. International Conference on Information Technology: Coding and Computing.

[5]  冯海超 Windows Azure:微软押上未来 , 2012 .

[6]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[7]  Bingsheng He,et al.  Distributed Systems Meet Economics: Pricing in the Cloud , 2010, HotCloud.

[8]  Chris Rose,et al.  A Break in the Clouds: Towards a Cloud Definition , 2011 .

[9]  David Mosberger,et al.  httperf—a tool for measuring web server performance , 1998, PERV.

[10]  Michael Vrable,et al.  Cumulus: Filesystem backup to the cloud , 2009, TOS.

[11]  Shankar Pasupathy,et al.  Maximizing Efficiency by Trading Storage for Computation , 2009, HotCloud.

[12]  Simson L. Garfinkel,et al.  An Evaluation of Amazon's Grid Computing Services: EC2, S3, and SQS , 2007 .

[13]  Jon B. Weissman,et al.  Using Proxies to Accelerate Cloud Applications , 2009, HotCloud.

[14]  Xiaowei Yang,et al.  CloudCmp: Shopping for a Cloud Made Easy , 2010, HotCloud.

[15]  Frank Leymann,et al.  Web Services , 2004, Informatik-Spektrum.

[16]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).