论文信息 - Victim Disk First: An Asymmetric Cache to Boost the Performance of Disk Arrays under Faulty Conditions

Victim Disk First: An Asymmetric Cache to Boost the Performance of Disk Arrays under Faulty Conditions

The buffer cache plays an essential role in smoothing the gap between the upper-level computational components and the lower-level storage devices. A good buffer cache management scheme should be beneficial to not only the computational components, but also to the storage components by reducing disk I/Os. Existing cache replacement algorithms are well optimized for disks in normal mode, but inefficient under faulty scenarios, such as a parity-based disk array with faulty disk(s). To address this issue, we propose a novel asymmetric buffer cache replacement strategy, named Victim (or faulty) Disk(s) First (VDF) cache, to improve the reliability and performance of a storage system consisting of a buffer cache and disk arrays. The basic idea is to give higher priority to cache the blocks on the faulty disks when the disk array fails, thus reducing the I/Os directed to the faulty disks. To verify the effectiveness of the VDF cache, we have integrated VDF into two popular cache algorithms LFU and LRU, named VDF-LFU and VDF-LRU, respectively. We have conducted extensive simulations as well as a prototype implementation. The simulation results show that VDF-LFU can reduce disk I/Os to surviving disks by up to 42.3% and VDF-LRU can reduce those by up to 36.2%. Our measurement results also show that VDF-LFU can speed up the online recovery by up to 46.3% under a spare-rebuilding mode with online reconstruction, or improve the maximum system service rate by up to 47.7% under a degraded mode without a reconstruction workload. Similarly, VDF-LRU can speed up the online recovery by up to 34.6%, or improve the system service rate by up to 28.4%.

[1] John C. S. Lui,et al. Automatic Recovery from Disk Failure in Continuous-Media Servers , 2002, IEEE Trans. Parallel Distributed Syst..

[2] Qi Zhang,et al. Characterization of storage workload traces from production Windows Servers , 2008, 2008 IEEE International Symposium on Workload Characterization.

[3] Randy H. Katz,et al. A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[4] Sang Lyul Min,et al. LRFU: A Spectrum of Policies that Subsumes the Least Recently Used and Least Frequently Used Policies , 2001, IEEE Trans. Computers.

[5] Tao Xie,et al. MICRO: A Multilevel Caching-Based Reconstruction Optimization for Mobile Storage Systems , 2008, IEEE Transactions on Computers.

[6] Gerhard Weikum,et al. An optimality proof of the LRU-K page replacement algorithm , 1999, JACM.

[7] Robert Y. Hou,et al. Balancing I/O response time and disk rebuild time in a RAID5 disk array , 1993, [1993] Proceedings of the Twenty-sixth Hawaii International Conference on System Sciences.

[8] Daniel P. Siewiorek,et al. Fast, on-line failure recovery in redundant disk arrays , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[9] Gerhard Weikum,et al. The LRU-K page replacement algorithm for database disk buffering , 1993, SIGMOD Conference.

[10] J. T. Robinson,et al. Data cache management using frequency-based replacement , 1990, SIGMETRICS '90.

[11] Asit Dan,et al. An approximate analysis of the LRU and FIFO buffer replacement schemes , 1990, SIGMETRICS '90.

[12] J. Menon,et al. Distributed sparing in disk arrays , 1992, Digest of Papers COMPCON Spring 1992.

[13] Ethan L. Miller,et al. Evaluation of distributed recovery in large-scale storage systems , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[14] Garth A. Gibson,et al. Parity declustering for continuous operation in redundant disk arrays , 1992, ASPLOS V.

[15] Nimrod Megiddo,et al. ARC: A Self-Tuning, Low Overhead Replacement Cache , 2003, FAST.

[16] Xubin He,et al. Code-M: A non-MDS erasure code scheme to support fast recovery from up to two-disk failures in storage systems , 2010, 2010 IEEE/IFIP International Conference on Dependable Systems & Networks (DSN).

[17] Xiaozhou Li,et al. Flat XOR-based erasure codes in storage systems: Constructions, efficient recovery, and tradeoffs , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[18] Yuanyuan Zhou,et al. Second-level buffer cache management , 2004, IEEE Transactions on Parallel and Distributed Systems.

[19] Xiaoning Ding,et al. DiskSeen: Exploiting Disk Layout and Access History to Enhance I/O Prefetch , 2007, USENIX Annual Technical Conference.

[20] Song Jiang,et al. Making LRU friendly to weak locality workloads: a novel replacement algorithm to improve buffer cache performance , 2005, IEEE Transactions on Computers.

[21] Yuanyuan Zhou,et al. The Multi-Queue Replacement Algorithm for Second Level Buffer Caches , 2001, USENIX Annual Technical Conference, General Track.

[22] John C. S. Lui,et al. Performance Analysis of Disk Arrays under Failure , 1990, VLDB.

[23] Dennis Shasha,et al. 2Q: A Low Overhead High Performance Buffer Management Replacement Algorithm , 1994, VLDB.

[24] Hong Jiang,et al. PRO: A Popularity-based Multi-threaded Reconstruction Optimization for RAID-Structured Storage Systems , 2007, FAST.

[25] Song Jiang,et al. LIRS: an efficient low inter-reference recency set replacement policy to improve buffer cache performance , 2002, SIGMETRICS '02.

[26] Hong Jiang,et al. Implementation and Evaluation of a Popularity-Based Reconstruction Optimization Algorithm in Availability-Oriented Disk Arrays , 2007, 24th IEEE Conference on Mass Storage Systems and Technologies (MSST 2007).

[27] Xiaoning Ding,et al. DULO: an effective buffer cache management scheme to exploit both temporal and spatial locality , 2005, FAST'05.

[28] Sang Lyul Min,et al. On the existence of a spectrum of policies that subsumes the least recently used (LRU) and least frequently used (LFU) policies , 1999, SIGMETRICS '99.

[29] Hong Jiang,et al. WorkOut: I/O Workload Outsourcing for Boosting RAID Reconstruction Performance , 2009, FAST.