ASSER: An Efficient, Reliable, and Cost-Effective Storage Scheme for Object-Based Cloud Storage Systems

High reliability, efficient I/O performance and flexible consistency provided with low storage cost are all desirable properties of cloud storage systems. Due to the inherent conflicts, however, simultaneously achieving optimum on all these properties is impractical. N-way Replication and Erasure Coding, two extensively-applied storage schemes with high reliability, adopt opposite and unbalanced strategies on the tradeoff among these properties, thus considerably restraining their effectiveness on wide range of workloads. To address the aforementioned obstacle, we propose a novel storage scheme called ASSER, an ASSembling chain of Erasure coding and Replication. ASSER stores each object in two parts: a full copy and a certain amount of erasure-coded segments. We establish dedicated read/write protocols for ASSER leveraging the unique structural advantages. On the basis of elementary protocols, we implement sequential and PRAM (Pipeline-RAM) consistency to make ASSER feasible for various services with different performance/consistency requirements. Evaluation results demonstrate that under the same fault tolerance and consistency level, ASSER outperforms N-way replication and pure erasure coding in I/O throughput under diverse system and workload configurations with superior performance stability. More importantly, ASSER delivers stably efficient I/O performance at much lower storage cost than the other comparatives.

[1]  Jian Lin,et al.  Boosting Degraded Reads in Heterogeneous Erasure-Coded Storage Systems , 2015, IEEE Transactions on Computers.

[2]  Cheng Huang,et al.  Erasure Coding in Windows Azure Storage , 2012, USENIX Annual Technical Conference.

[3]  Leslie Lamport,et al.  How to Make a Multiprocessor Computer That Correctly Executes Multiprocess Programs , 2016, IEEE Transactions on Computers.

[4]  Cheng Huang,et al.  Rethinking erasure codes for cloud file systems: minimizing I/O for recovery and degraded reads , 2012, FAST.

[5]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[6]  Heng Zhang,et al.  Efficient and Available In-Memory KV-Store with Hybrid Erasure Coding and Replication , 2016, FAST.

[7]  Werner Vogels,et al.  Eventually consistent , 2008, CACM.

[8]  Ying Chen,et al.  Partial Selection: An Efficient Approach for QoS-Aware Web Service Composition , 2014, 2014 IEEE International Conference on Web Services.

[9]  F. Moore,et al.  Polynomial Codes Over Certain Finite Fields , 2017 .

[10]  Jianwei Yin,et al.  Workload Classification Model for Specializing Virtual Machine Operating System , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[11]  William H. Sanders,et al.  Blackbox prediction of the impact of DVFS on end-to-end performance of multitier systems , 2010, PERV.

[12]  Suman Banerjee,et al.  An ensemble of replication and erasure codes for cloud file systems , 2013, 2013 Proceedings IEEE INFOCOM.

[13]  Joseph Pasquale,et al.  Analysis of Long-Running Replicated Systems , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[14]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[15]  Robbert van Renesse,et al.  Chain Replication for Supporting High Throughput and Availability , 2004, OSDI.

[16]  Jianzhong Huang,et al.  An Efficient I/O-Redirection-Based Reconstruction Scheme for Erasure-Coded Storage Clusters , 2015, IEEE Transactions on Computers.

[17]  Thomas Risse,et al.  Selecting skyline services for QoS-based web service composition , 2010, WWW '10.

[18]  M. Blaum,et al.  EVENODD: an optimal scheme for tolerating double disk failures in RAID architectures , 1994, Proceedings of 21 International Symposium on Computer Architecture.

[19]  Dimitris S. Papailiopoulos,et al.  XORing Elephants: Novel Erasure Codes for Big Data , 2013, Proc. VLDB Endow..

[20]  Robert Mateescu,et al.  Opening the Chrysalis: On the Real Repair Performance of MSR Codes , 2016, FAST.

[21]  GhemawatSanjay,et al.  The Google file system , 2003 .

[22]  Hong Jiang,et al.  RAID6L: A log-assisted RAID6 storage architecture with improved write performance , 2011, 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST).

[23]  Christos Faloutsos,et al.  RainMon: an integrated approach to mining bursty timeseries monitoring data , 2012, KDD.

[24]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[25]  Jian Lin,et al.  Enabling Concurrent Failure Recovery for Regenerating-Coding-Based Storage Systems: From Theory to Practice , 2015, IEEE Transactions on Computers.

[26]  Ethan L. Miller,et al.  Pergamum: Replacing Tape with Energy Efficient, Reliable, Disk-Based Archival Storage , 2008, FAST.

[27]  Yang Tang,et al.  NCCloud: A Network-Coding-Based Storage System in a Cloud-of-Clouds , 2014, IEEE Transactions on Computers.

[28]  Ying Chen,et al.  Energy Efficient Dynamic Service Selection for Large-Scale Web Service Systems , 2014, 2014 IEEE International Conference on Web Services.

[29]  Van-Anh Truong,et al.  Availability in Globally Distributed Storage Systems , 2010, OSDI.

[30]  Stefan Savage,et al.  Total Recall: System Support for Automated Availability Management , 2004, NSDI.

[31]  S. Resnick Adventures in stochastic processes , 1992 .

[32]  Margo Seltzer,et al.  Trace-based analyses and optimizations for network storage servers , 2004 .

[33]  Masoud Ardakani,et al.  A Class of Binary Locally Repairable Codes , 2016, IEEE Transactions on Communications.

[34]  Kanchi Gopinath,et al.  Discovery of Application Workloads from Network File Traces , 2010, FAST.

[35]  Michael J. Freedman,et al.  Object Storage on CRAQ: High-Throughput Chain Replication for Read-Mostly Workloads , 2009, USENIX Annual Technical Conference.

[36]  I-Ling Yen,et al.  QoS-Driven Service Composition with Reconfigurable Services , 2013, IEEE Transactions on Services Computing.

[37]  Andrew S. Tanenbaum,et al.  Distributed systems: Principles and Paradigms , 2001 .

[38]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[39]  Prashant J. Shenoy,et al.  Analytical modeling for what-if analysis in complex cloud computing applications , 2013, PERV.

[40]  Gregory R. Ganger,et al.  Ursa minor: versatile cluster-based storage , 2005, FAST'05.

[41]  Patrick P. C. Lee,et al.  Parity logging with reserved space: towards efficient updates and recovery in erasure-coded clustered storage , 2014, FAST.