Predicting file system actions from prior events

We have adapted a multi-order context modeling technique used in the data compression method Prediction by Partial Match (PPM) to track sequences of file access events. From this model, we are able to determine file system accesses that have a high probability of occurring as the next event. By prefetching the data for these events, we have transformed an LRU cache into a predictive cache that in our simulations averages 15% more cache hits than LRU. In fact, on average our four-megabyte predictive cache has a higher cache hit rate than a 90 megabyte LRU cache.

[1]  Donald E. Knuth,et al.  Sorting and Searching , 1973 .

[2]  Kishor S. Trivedi Probability and Statistics with Reliability, Queuing, and Computer Science Applications , 1984 .

[3]  J. Rissanen,et al.  A Double-Adaptive File Compression Algorithm , 1983, IEEE Trans. Commun..

[4]  Terry Williams,et al.  Probability and Statistics with Reliability, Queueing and Computer Science Applications , 1983 .

[5]  J. Howard Et El,et al.  Scale and performance in a distributed file system , 1988 .

[6]  Robbert van Renesse,et al.  The design of a high-performance file server , 1989, [1989] Proceedings. The 9th International Conference on Distributed Computing Systems.

[7]  John K. Ousterhout,et al.  Why Aren't Operating Systems Getting Faster As Fast as Hardware? , 1990, USENIX Summer.

[8]  Ian H. Witten,et al.  Text Compression , 1990, 125 Problems in Text Algorithms.

[9]  Mary Baker,et al.  Measurements of a distributed file system , 1991, SOSP '91.

[10]  P. Krishnan,et al.  Optimal prefetching via data compression , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[11]  P. Krishnan,et al.  Practical prefetching via data compression , 1993 .

[12]  Mahadev Satyanarayanan,et al.  A status report on research in transparent informed prefetching , 1993, OPSR.

[13]  The design of the SEER predictive caching system , 1994, Workshop on Mobile Computing Systems and Applications.

[14]  Mahadev Satyanarayanan,et al.  Disconnected Operation in the Coda File System , 1999, Mobidata.

[15]  Geoffrey H. Kuenning,et al.  An Analysis of Trace Data for Predictive File Caching in Mobile Computing , 1994, USENIX Summer.

[16]  Jim Griffioen,et al.  Reducing File System Latency using a Predictive Approach , 1994, USENIX Summer.

[17]  James Griffioen Randy Appleton Performance Measurements of Automatic Prefetching , 1995 .

[18]  Jim Zelenka,et al.  Transparent Informed Pre-fecting , 1995, SOSP 1995.

[19]  Anna R. Karlin,et al.  A study of integrated prefetching and caching strategies , 1995, SIGMETRICS '95/PERFORMANCE '95.

[20]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[21]  P. Krishnan Online prediction algorithms for databases and operating systems , 1996 .