Data Storage Layout for Object-Based De-Duplication System

The world is increasingly awash in more and more unstructured data. Object-based data de-duplication is the current most advanced method and is the effective solution for detecting duplicate data. We developed an energy saving policy for conventional disk based RAID systems. According to the characteristics of object-based data de-duplication, we introduce object layout strategies for unstructured data applications; disk accesses are concentrated in a part of the disks in a long time which is conducive to scheduling other disks into standby or shutdown mode. Our proposed methods reduce energy consumption of de-duplication storage system.

[1]  Li Ao,et al.  Data Deduplication Techniques: Data Deduplication Techniques , 2010 .

[2]  Jin Qian,et al.  PARAID: A gear-shifting power-aware RAID , 2007, TOS.

[3]  Dirk Grunwald,et al.  Massive Arrays of Idle Disks For Storage Archives , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[4]  Sean Quinlan,et al.  Venti: A New Approach to Archival Storage , 2002, FAST.

[5]  Zhanhuai Li,et al.  Data deduplication techniques , 2010, 2010 International Conference on Future Information Technology and Management Engineering.

[6]  Lidong Zhou,et al.  Transactional Flash , 2008, OSDI.

[7]  Fang Yan,et al.  A Method of Object-based De-duplication , 2011, J. Networks.