Efficiently Coding Replicas to Erasure Coded Blocks in Distributed Storage Systems

Modern distributed storage systems usually store new data in replicas, and later code these data into erasure coded blocks when they get cold. This letter studies optimal bandwidth consumption problem for this replica to erasure coded blocks (R2E) coding process, and proposes schemes of R2E_singleTree and R2E_multiTree based on problem observations. Theoretical analysis and evaluation are conducted for these two schemes.