De-duplication Approaches in Cloud Computing Environment: A Survey

Nowadays increasing the data storage capacity is one of the important challenges, due to the more demands for using cloud services. There have been offered several approaches to identify and remove duplicated data in virtual machines prior to sending their data to a shared storage resource. Therefore method of storage information should be efficient also the method of finding data should be intelligent as much as possible. However, there is no approach among various storing data approaches, to be absolutely expected to have the best performance in the use of bandwidth for storage. One of the useful strategies to have fast and efficient data storage is de-duplication. In this paper, we will address various deduplication approaches and consider advantages and disadvantages of them.

[1]  Indu Arora,et al.  Opportunities , Concerns and Challenges in the Adoption of Cloud Storage , 2012 .

[2]  Mark Lillibridge,et al.  Improving restore speed for backup systems that use inline chunk-based deduplication , 2013, FAST.

[3]  Jia Xu,et al.  Weak leakage-resilient client-side deduplication of encrypted data in cloud storage , 2013, ASIA CCS '13.

[4]  Benny Pinkas,et al.  Side Channels in Cloud Services: Deduplication in Cloud Storage , 2010, IEEE Security & Privacy.

[5]  George Bebis,et al.  Minutiae-based template synthesis and matching for fingerprint authentication , 2009, Comput. Vis. Image Underst..

[6]  Anand Sivasubramaniam,et al.  Leveraging Value Locality in Optimizing NAND Flash-based SSDs , 2011, FAST.

[7]  Ian Pratt,et al.  Proceedings of the General Track: 2004 USENIX Annual Technical Conference , 2004 .

[8]  Philip Shilane,et al.  Delta Compressed and Deduplicated Storage Using Stream-Informed Locality , 2012, HotStorage.

[9]  Sanjeev Sharma,et al.  Comprehensive study of data de-duplication , 2013 .

[10]  Mark Lillibridge,et al.  Extreme Binning: Scalable, parallel deduplication for chunk-based file backup , 2009, 2009 IEEE International Symposium on Modeling, Analysis & Simulation of Computer and Telecommunication Systems.

[11]  Mostafa Ghobaei Arani,et al.  NASLA: Novel Auto Scaling Approach based on Learning Automata for Web Application in Cloud Computing Environment , 2015 .

[12]  Neha Kaurav,et al.  An Investigation on Data De-duplication Methods And it's Recent Advancements , 2014 .

[13]  Mostafa Ghobaei Arani,et al.  ACCFLA: Access Control in Cloud Federation using Learning Automata , 2014 .

[14]  Mostafa Ghobaei Arani,et al.  ASTAW: Auto-Scaling Threshold-based Approach for Web Application in Cloud Computing Environment , 2015 .

[15]  A. Shulman-Peleg,et al.  Side channels in cloud services , the case of deduplication in cloud storage , 2011 .

[16]  Iuon-Chang Lin,et al.  Data Deduplication Scheme for Cloud Storage , 2012 .

[17]  Jin-Soo Kim,et al.  Deduplication with Block-Level Content-Aware Chunking for Solid State Drives (SSDs) , 2013, 2013 IEEE 10th International Conference on High Performance Computing and Communications & 2013 IEEE International Conference on Embedded and Ubiquitous Computing.

[18]  João Paulo,et al.  Efficient storage of data in cloud computing , 2009 .

[19]  R DeepuS PERFORMANCE COMPARISON OF DEDUPLICATION TECHNIQUES FOR STORAGE IN CLOUD COMPUTING ENVIRONMENT , 2014 .

[20]  Mostafa Ghobaei Arani,et al.  EDLT: An Extended DLT to Enhance Load Balancing in Cloud Computing , 2014 .

[21]  W. Marsden I and J , 2012 .

[22]  James E. Smith,et al.  The architecture of virtual machines , 2005, Computer.

[23]  Dutch T. Meyer,et al.  A study of practical deduplication , 2011, TOS.

[24]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[25]  Zhanhuai Li,et al.  Data deduplication techniques , 2010, 2010 International Conference on Future Information Technology and Management Engineering.

[26]  Windsor W. Hsu,et al.  Duplicate Management for Reference Data , 2004 .

[27]  Chengzhang Peng,et al.  Building a Cloud Storage Service System , 2011 .

[28]  Ethan L. Miller,et al.  The effectiveness of deduplication on virtual machine disk images , 2009, SYSTOR '09.

[29]  Kai Li,et al.  Avoiding the Disk Bottleneck in the Data Domain Deduplication File System , 2008, FAST.

[30]  Fred Douglis,et al.  Redundancy Elimination Within Large Collections of Files , 2004, USENIX Annual Technical Conference, General Track.