A popularity-aware cost-effective replication scheme for high data durability in cloud storage

Cloud storage system usually experiences data loss, hindering data durability. Three-way random replication is commonly used to prevent data loss in cloud storage systems. However, it cannot effectively handle correlated machine failures. Although Copyset Replication and Tiered Replication can reduce data loss in correlated and independent failures and enhance data durability, they fail to leverage different data popularities to substantially reduce the storage cost and bandwidth cost caused by replication. To address these issues, we present a popularity-aware multi-failure resilient and cost-effective replication (PM-CR) scheme for high data durability in cloud storage. PMCR splits the cloud storage system into primary tier and backup tier, and classifies data into hot data, warm data and cold data based on data popularities. To handle both correlated and independent failures, PMCR stores the three replicas of the same data into one Copyset formed by two servers in the primary tier and one server in the backup tier. For the third replicas of warm data and cold data in the backup tier, PMCR uses the Similar Compression method for read-intensive data and uses the Delta Compression method for write-intensive data to reduce storage cost and bandwidth cost. As a result, these costs are reduced and data durability and availability are enhanced without compromising data request delay greatly. Extensive experiment results based on trace parameters show that PMCR achieves high data durability, low probability of data loss, and low storage cost and bandwidth cost compared to previous replication schemes.

[1]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[2]  David R. Karger,et al.  Wide-area cooperative storage with CFS , 2001, SOSP.

[3]  GhemawatSanjay,et al.  The Google file system , 2003 .

[4]  Muriel Medard,et al.  How good is random linear coding based distributed networked storage , 2005 .

[5]  Michael Dahlin,et al.  TAPER: tiered approach for eliminating redundancy in replica synchronization , 2005, FAST'05.

[6]  Srinivasan Seshan,et al.  Subtleties in Tolerating Correlated Failures in Wide-area Storage Systems , 2006, NSDI.

[7]  Andreas Haeberlen,et al.  Efficient Replica Maintenance for Distributed Storage Systems , 2006, NSDI.

[8]  Ming Zhong,et al.  Replication degree customization for high availability , 2008, Eurosys '08.

[9]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[10]  Ben Y. Zhao,et al.  Exploiting locality of interest in online social networks , 2010, CoNEXT.

[11]  Karl Aberer,et al.  A self-organized, fault-tolerant and scalable replication scheme for cloud storage , 2010, SoCC '10.

[12]  Ju Wang,et al.  Windows Azure Storage: a highly available cloud storage service with strong consistency , 2011, SOSP.

[13]  Albert G. Greenberg,et al.  Scarlett: coping with skewed content popularity in mapreduce clusters , 2011, EuroSys '11.

[14]  Murali S. Kodialam,et al.  Frugal storage for cloud file systems , 2012, EuroSys '12.

[15]  冯海超 Windows Azure:微软押上未来 , 2012 .

[16]  Philip Shilane,et al.  WAN-optimized replication of backup datasets using stream-informed delta compression , 2012, TOS.

[17]  Irene Zhang Reducing the Frequency of Data Loss in Cloud Storage , 2013 .

[18]  Sachin Katti,et al.  Copysets: Reducing the Frequency of Data Loss in Cloud Storage , 2013, USENIX Annual Technical Conference.

[19]  Antony I. T. Rowstron,et al.  Pelican: A Building Block for Exascale Cold Data Storage , 2014, OSDI.

[20]  Anne-Marie Kermarrec,et al.  Archiving cold data in warehouses with clustered network coding , 2014, EuroSys '14.

[21]  Fred Douglis,et al.  Migratory compression: coarse-grained data reordering to improve compressibility , 2014, FAST.

[22]  Kang Chen,et al.  MobileCopy: Resisting correlated node failures to enhance data availability in DTNs , 2015, 2015 12th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON).

[23]  Kang Chen,et al.  Exploiting Active Sub-Areas for Multi-Copy Routing in VDTNs , 2015, 2015 24th International Conference on Computer Communication and Networks (ICCCN).

[24]  Komal Shringare,et al.  Apache Hadoop Goes Realtime at Facebook , 2015 .

[25]  Emin Gün Sirer,et al.  Tiered Replication: A Cost-effective Alternative to Full Cluster Geo-replication , 2015, USENIX Annual Technical Conference.

[26]  Jian Yang,et al.  Mojim: A Reliable and Highly-Available Non-Volatile Memory System , 2015, ASPLOS.

[27]  Lei Yu,et al.  Characterizing data deliverability of greedy routing in wireless sensor networks , 2015, SECON.

[28]  Ethan L. Miller,et al.  Purity: Building Fast, Highly-Available Enterprise Flash Storage from Commodity Components , 2015, SIGMOD Conference.

[29]  Jianwei Liu,et al.  SCPS: A Social-Aware Distributed Cyber-Physical Human-Centric Search Engine , 2015, IEEE Transactions on Computers.

[30]  Husnu S. Narman,et al.  CCRP: Customized cooperative resource provisioning for high resource utilization in clouds , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[31]  Haiying Shen,et al.  A Low-Cost Multi-failure Resilient Replication Scheme for High Data Availability in Cloud Storage , 2016, 2016 IEEE 23rd International Conference on High Performance Computing (HiPC).

[32]  Haiying Shen,et al.  Dependency-Aware and Resource-Efficient Scheduling for Heterogeneous Jobs in Clouds , 2016, 2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom).

[33]  Haiying Shen,et al.  CORP: Cooperative Opportunistic Resource Provisioning for Short-Lived Jobs in Cloud Systems , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[34]  Kang Chen,et al.  DIAL: A Distributed Adaptive-Learning Routing Method in VDTNs , 2016, 2016 IEEE First International Conference on Internet-of-Things Design and Implementation (IoTDI).

[35]  Mashrur Chowdhury,et al.  An Efficient Wireless Power Transfer System to Balance the State of Charge of Electric Vehicles , 2016, 2016 45th International Conference on Parallel Processing (ICPP).

[36]  Haiying Shen,et al.  A Survey of Mobile Crowdsensing Techniques: A Critical Component for the Internet of Things , 2016, ICCCN.

[37]  Hongxin Hu,et al.  Load-aware and congestion-free state management in network function virtualization , 2017, 2017 International Conference on Computing, Networking and Communications (ICNC).