An Effective RAID Data Layout for Object-Based De-duplication Backup System

We propose an alternative Redundant array of independent disks (RAID) data layout, Asymmetrical grouping data organization (AGDO), for object-based data de-duplication backup system. Object-based data deduplication is an effective solution for detecting duplicate data for compound files. We designed an asymmetrical grouping strategy, the disk in the array are partitioned into different groups, and in each group parallelism data access scheme is adopted, different types of objects are stored in different groups with maybe different group size, it can set and manage dynamic group size by using dynamic disk group adjustment algorithms. The performance of AGDO is evaluated and proved to be sufficient for the continuous storage application. The result is that disk accesses are concentrated in a part of the disks over a long time period and reduces the power consumption to 25% in a 10 disk configuration. Moreover, object-based de-duplication combined with AGDO has great potential in increasing data restoration speed for compound files. We have shown that this combination makes average restoration speed improved 11%.