A Correlation-Aware Prefetching Strategy for Object-Based File System

The prefetching strategies used in modern distributed storage systems generally are based on temporal and/or spatial locality of requests Due to the special properties of object-based storage systems, however, the traditional tactics are almost incompetent for the job This paper presents a new prefetching approach, which takes the correlationship among objects into account Two orthogonal replica distribution algorithms are proposed to aggregate prefetching operations A moving window mechanism is also developed to control prefetching We implement these approaches in our object-based file system called NBJLOFS (abbreviated for Nankai-Baidu Joint Lab Object-based File System) The experimental results show that these approaches improves throughput by up to 80%.

[1]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[2]  Xiaoning Ding,et al.  DULO: an effective buffer cache management scheme to exploit both temporal and spatial locality , 2005, FAST'05.

[3]  Margo I. Seltzer,et al.  Berkeley DB , 1999, USENIX Annual Technical Conference, FREENIX Track.

[4]  Tao Yang,et al.  The Panasas ActiveScale Storage Cluster - Delivering Scalable High Bandwidth Storage , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[5]  Y. Charlie Hu,et al.  The Performance Impact of Kernel Prefetching on Buffer Cache Replacement Algorithms , 2005, IEEE Transactions on Computers.

[6]  Randal C. Burns,et al.  Group-based management of distributed file caches , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[7]  Madalin Mihailescu,et al.  Context-Aware Prefetching at the Storage Server , 2008, USENIX Annual Technical Conference.

[8]  Robert E. Gilligan,et al.  IP SAN - from iSCSI to IP-addressable Ethernet disks , 2003, 20th IEEE/11th NASA Goddard Conference on Mass Storage Systems and Technologies, 2003. (MSST 2003). Proceedings..

[9]  Frank Lonczewski,et al.  The FUSE-System: an Integrated User Interface Design Environment , 1996, CADUI.

[10]  Ali R. Hurson,et al.  On the performance issues of object-based buffering , 1991, [1991] Proceedings of the First International Conference on Parallel and Distributed Information Systems.

[11]  Weiwu Hu,et al.  A comparison of two strategies of dynamic data prefetching in software DSM , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[12]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.