Lightweight Dynamic Redundancy Control with Adaptive Encoding for Server-based Storage

With the recent performance improvements in commodity hardware, low-cost commodity server-based storage has become a practical alternative to dedicated-storage appliances. Because of the high failure rate of commodity servers, data redundancy across multiple servers is required in a server-based storage system. However, the extra storage capacity for this redundancy significantly increases the system cost. Although erasure coding (EC) is a promising method to reduce the amount of redundant data, it requires distributing and encoding data among servers. There remains a need to reduce the performance impact of these processes involving much network traffic and processing overhead. Especially, the performance impact becomes significant for random-intensive applications. In this article, we propose a new lightweight redundancy control for server-based storage. Our proposed method uses a new local filesystem-based approach that avoids distributing data by adding data redundancy to locally stored user data. Our method switches the redundancy method of user data between replication and EC according to workloads to improve capacity efficiency while achieving higher performance. Our experiments show up to 230% better online-transaction-processing performance for our method compared with CephFS, a widely used alternative system. We also confirmed that our proposed method prevents unexpected performance degradation while achieving better capacity efficiency.

[1]  Takahiro Nakano,et al.  A Method to Adapt Storage Protocol Stack Using Custom File Metadata to Commodity Linux Servers , 2018 .

[2]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[3]  James S. Plank,et al.  Erasure Codes for Storage Systems: A Brief Primer , 2013, login Usenix Mag..

[4]  Won Kim,et al.  Cloud Computing: Today and Tomorrow , 2009, J. Object Technol..

[5]  Abutalib Aghayev,et al.  File systems unfit as distributed storage backends: lessons from 10 years of Ceph evolution , 2019, SOSP.

[6]  Masaaki Iwasaki,et al.  Directory-Aware File System Backup to Object Storage for Fast On-Demand Restore , 2017 .

[7]  Yanpei Chen,et al.  Design implications for enterprise storage systems via multi-dimensional trace analysis , 2011, SOSP '11.

[8]  Howard Gobioff,et al.  The Google file system , 2003, SOSP '03.

[9]  Xiao Liu,et al.  Basic Performance Measurements of the Intel Optane DC Persistent Memory Module , 2019, ArXiv.

[10]  Erez Zadok,et al.  Filebench: A Flexible Framework for File System Benchmarking , 2016, login Usenix Mag..

[11]  Dutch T. Meyer,et al.  A study of practical deduplication , 2011, TOS.

[12]  Garth A. Gibson,et al.  DiskReduce : Replication as a Prelude to Erasure Coding in Data-Intensive Scalable Computing , 2011 .

[13]  Mariko Sugawara,et al.  Understanding storage traffic characteristics on enterprise virtual desktop infrastructure , 2017, SYSTOR.

[14]  ShinagawaTakahiro,et al.  Practical Quick File Server Migration , 2020 .

[15]  Mrinmoy Ghosh,et al.  Performance analysis of NVMe SSDs and their implication on real world databases , 2015, SYSTOR.