PC-OPT: Optimal Offline Prefetching and Caching for Parallel I/O Systems

We address the problem of prefetching and caching in a parallel I/O system and present a new algorithm for parallel disk scheduling. Traditional buffer management algorithms that minimize the number of block misses are substantially suboptimal in a parallel I/O system where multiple I/Os can proceed simultaneously. We show that in the off line case, where a priori knowledge of all the requests is available, PC-OPT performs the minimum number of I/Os to service the given I/O requests. This is the first parallel I/O scheduling algorithm that is provably offline optimal in the parallel disk model. In the online case, we study the context of global L-block lookahead, which gives the buffer management algorithm a lookahead consisting of L distinct requests. We show that the competitive ratio of PC-OPT, with global L-block lookahead, is /spl Theta/(M - L + D), when L /spl les/ M, and /spl Theta/(MD/L), when L > M, where the number of disks is D and buffer size is M.

[1]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[2]  Rakesh M. Verma,et al.  Tight Bounds for Prefetching and Buffer Management Algorithms for Parallel I/O Systems , 1996, FSTTCS.

[3]  Anna R. Karlin,et al.  Competitive snoopy caching , 1986, 27th Annual Symposium on Foundations of Computer Science (sfcs 1986).

[4]  Alok Aggarwal,et al.  The input/output complexity of sorting and related problems , 1988, CACM.

[5]  Amos Fiat,et al.  Competitive Paging Algorithms , 1991, J. Algorithms.

[6]  Anna R. Karlin,et al.  Near-Optimal Parallel Prefetching and Caching , 2000, SIAM J. Comput..

[7]  Peter Sanders,et al.  Fast Concurrent Access to Parallel Disks , 2000, SODA '00.

[8]  Peter J. Varman,et al.  Optimal Read-Once Parallel Disk Scheduling , 1999, IOPADS '99.

[9]  Jeffrey Scott Vitter,et al.  Competitive parallel disk prefetching and buffer management , 1997, IOPADS '97.

[10]  Thomas H. Cormen,et al.  Multidimensional, multiprocessor, out-of-core FFTs with distributed memory and parallel disks (extended abstract) , 1999, SPAA '99.

[11]  Vinay S. Pai,et al.  Markov Analysis of Multiple-Disk Prefetching Strategies for External Merging , 1994, Theor. Comput. Sci..

[12]  Garth A. Gibson,et al.  Automatic I/O hint generation through speculative execution , 1999, OSDI '99.

[13]  Anna R. Karlin,et al.  A study of integrated prefetching and caching strategies , 1995, SIGMETRICS '95/PERFORMANCE '95.

[14]  Susanne Albers,et al.  Minimizing stall time in single and parallel disk systems , 1998, STOC '98.

[15]  Edward F. Grove,et al.  Simple randomized mergesort on parallel disks , 1996, SPAA '96.

[16]  Susanne Albers,et al.  On the Influence of Lookahead in Competitive Paging Algorithms , 1997, Algorithmica.

[17]  Thomas H. Cormen,et al.  Early Experiences in Evaluating the Parallel Disk Model with the ViC* Implementation , 1996, Parallel Comput..

[18]  Michael J. Franklin,et al.  R × W: a scheduling approach for large-scale on-demand data broadcast , 1999, TNET.

[19]  Carla Schlatter Ellis,et al.  Prefetching in File Systems for MIMD Multiprocessors , 1990, IEEE Trans. Parallel Distributed Syst..

[20]  Robert E. Tarjan,et al.  Amortized efficiency of list update and paging rules , 1985, CACM.

[21]  Dany Breslauer,et al.  On Competitive On-Line Paging with Lookahead , 1995, Theor. Comput. Sci..

[22]  Laszlo A. Belady,et al.  A Study of Replacement Algorithms for Virtual-Storage Computer , 1966, IBM Syst. J..