Directory Reference Patterns in Hierarchical File Systems

The authors present a brief description of data on directory reference patterns collected from a 4.2BSD UNIX system. These data are used to examine the importance of the name lookup overhead involved in opening and using files. The analysis shows that paths in the environment are relatively long and that, in the absence of caching, name resolution overhead accounts for over 70% of the disk blocks referenced to open and use files. These results confirm recent conjectures on the high level of directory activity in UNIX file systems. Directory references exhibit strong locality, though, making caches an effective way to decrease directory overhead. Simulations of a least recently used (IRU) whole directory cache show that a cache holding just ten nodes achieves an 85% hit ratio. The implications of these results on the design of both local and distributed file systems are discussed. >

[1]  Mahadev Satyanarayanan,et al.  A study of file sizes and functional lifetimes , 1981, SOSP.

[2]  Carla Schlatter Ellis,et al.  The Roe File System , 1983, Symposium on Reliability in Distributed Software and Database Systems.

[3]  Alan Jay Smith Analysis of Long Term File Reference Patterns for Application to File Migration Algorithms , 1981, IEEE Transactions on Software Engineering.

[4]  K. Thompson,et al.  The UNIX time-sharing system , 1978 .

[5]  John Kunze,et al.  A trace-driven analysis of the unix 4 , 1985, SOSP 1985.

[6]  Edward Preble Stritter,et al.  File migration , 1976 .

[7]  Rick Floyd,et al.  Directory Reference Patterns in a UNIX Environment. , 1986 .

[8]  Bruce Walker,et al.  The LOCUS distributed operating system , 1983, SOSP '83.

[9]  Jeffrey C Mogull Representing information about files , 1986, ICDCS 1986.

[10]  Andrew S. Tanenbaum,et al.  Immediate files , 1984, Softw. Pract. Exp..

[11]  Rick Floyd Short-Term File Reference Patterns in a UNIX Environment, , 1986 .

[12]  Alan Jay Smith,et al.  A File System Tracing Package for Berkeley UNIX , 1985 .

[13]  R. A. Floyd Transparency in distributed file systems , 1989 .

[14]  Robert S. Fabry,et al.  A fast file system for UNIX , 1984, TOCS.

[15]  Walter F. Tichy,et al.  Towards a Distributed File System , 1984 .

[16]  Alan Jay Smith,et al.  Disk cache—miss ratio analysis and design considerations , 1983, TOCS.

[17]  Mahadev Satyanarayanan,et al.  The ITC distributed file system: principles and design , 1985, SOSP 1985.

[18]  Samuel J. Leffler,et al.  Measuring and Improving the Performance of 4.2BSD , 1984 .

[19]  Mahadev Satyanarayanan A methodology for modelling storage systems and its application to a network file system , 1983 .

[20]  Gerald J. Popek,et al.  Name Service Locality and Cache Design in a Distributed Operating System , 1986, ICDCS.

[21]  Juan M. Porcar File migration in distributed computer systems , 1982, Perform. Evaluation.