File system usage in Windows NT 4.0

We have performed a study of the usage of the Windows NT File System through long-term kernel tracing. Our goal was to provide a new data point with respect to the 1985 and 1991 trace-based File System studies, to investigate the usage details of the Windows NT file system architecture, and to study the overall statistical behavior of the usage data.In this paper we report on these issues through a detailed comparison with the older traces, through details on the operational characteristics and through a usage analysis of the file system and cache manager. Next to architectural insights we provide evidence for the pervasive presence of heavy-tail distribution characteristics in all aspect of file system usage. Extreme variances are found in session inter-arrival time, session holding times, read/write frequencies, read/write buffer sizes, etc., which is of importance to system engineering, tuning and benchmarking.

[1]  Mahadev Satyanarayanan,et al.  Long Term Distributed File Reference Tracing: Implementation and Experience" Technical Report CMU-CS , 1994 .

[2]  V. Paxson,et al.  WHERE MATHEMATICS MEETS THE INTERNET , 1998 .

[3]  Eric A. Brewer,et al.  Self-similarity in file systems , 1998, SIGMETRICS '98/PERFORMANCE '98.

[4]  Sidney I. Resnick,et al.  Heavy Tail Modelling and Teletraffic Data , 1995 .

[5]  Mor Harchol-Balter,et al.  Exploiting process lifetime distributions for dynamic load balancing , 1995, SIGMETRICS.

[6]  William J. Bolosky,et al.  A large-scale study of file-system contents , 1999, SIGMETRICS '99.

[7]  Alan Jay Smith,et al.  A File System Tracing Package for Berkeley UNIX , 1985 .

[8]  Geoffrey H. Kuenning,et al.  An Analysis of Trace Data for Predictive File Caching in Mobile Computing , 1994, USENIX Summer.

[9]  Mary Baker,et al.  Measurements of a distributed file system , 1991, SOSP '91.

[10]  David A. Solomon,et al.  Inside windows nt second edition , 1998 .

[11]  Mahadev Satyanarayanan,et al.  Disconnected operation in the Coda File System , 1992, TOCS.

[12]  K OusterhoutJohn,et al.  Caching in the Sprite network file system , 1988 .

[13]  Helen Custer,et al.  Inside Windows NT , 1992 .

[14]  M. Crovella,et al.  Heavy-tailed probability distributions in the World Wide Web , 1998 .

[15]  Gennady Samorodnitsky,et al.  Patterns of buffer overflow in a class of queues with long memory in the input stream , 1997 .

[16]  John Kunze,et al.  A trace-driven analysis of the unix 4 , 1985, SOSP 1985.

[17]  John A. Kunze,et al.  A trace-driven analysis of the UNIX 4.2 BSD file system , 1985, SOSP '85.

[18]  Rajeev Nagar,et al.  Windows NT file system internals - a developer's guide: building NT file system drivers , 1997 .

[19]  Michael Dahlin,et al.  A quantitative analysis of cache policies for scalable network file systems , 1994, SIGMETRICS.

[20]  Mahadev Satyanarayanan,et al.  A study of file sizes and functional lifetimes , 1981, SOSP.

[21]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[22]  Margo I. Seltzer,et al.  The case for application-specific benchmarking , 1999, Proceedings of the Seventh Workshop on Hot Topics in Operating Systems.

[23]  Mark A. McComb A Practical Guide to Heavy Tails , 2000, Technometrics.

[24]  Shikharesh Majumdar,et al.  Measurement and analysis of locality phases in file referencing behaviour , 1986, SIGMETRICS '86/PERFORMANCE '86.

[25]  Alan Jay Smith Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms , 1981, IEEE Transactions on Software Engineering.

[26]  H. Liu,et al.  Conference on Measurement and modeling of computer systems , 2001 .

[27]  Mahadev Satyanarayanan,et al.  Long Term Distributed File Reference Tracing: Implementation and Experience , 1996, Softw. Pract. Exp..

[28]  K. K. Ramakrishnan,et al.  Analysis of file I/O traces in commercial computing environments , 1992, SIGMETRICS '92/PERFORMANCE '92.

[29]  Rajeev Nagar,et al.  Windows NT File System Internals , 1997 .