Efficient Metadata Management in Large Distributed File Systems

Metadata management in large distributed systems is a critical aspect of overall system performance. Directory subtree partitioning and pure hashing are two common techniques used for metadata management. Neither of these techniques can support large systems with high concurrent accesses due to the conflicting need for both load balancing and directory update efficiency. We present a flexible approach that addresses the problems of the existing techniques. The approach, called Lazy Hybrid Hashed Hierarchical directory management, takes advantage of the load balancing and other performance benefits of hashing, while maintaining a hierarchical directory structure and metadata logging to support lazy metadata relocation and updates. A dual-entry Access Control List provides direct permission calculation to the target file, avoiding the overhead of traversing the pathname for permission check.

[1]  R. S. Fabry,et al.  A fast file system for UNIX , 1984, TOCS.

[2]  John A. Kunze,et al.  A trace-driven analysis of the UNIX 4.2 BSD file system , 1985, SOSP '85.

[3]  Bruce J. Walker,et al.  The LOCUS Distributed System Architecture , 1986 .

[4]  Mahadev Satyanarayanan,et al.  Andrew: a distributed personal computing environment , 1986, CACM.

[5]  Andrew R. Cherenson,et al.  The Sprite network operating system , 1988, Computer.

[6]  Mahadev Satyanarayanan,et al.  Coda: a highly available file system for a distributed workstation environment , 1989, Proceedings of the Second Workshop on Workstation Operating Systems.

[7]  Mahadev Satyanarayanan,et al.  Coda: A Highly Available File System for a Distributed Workstation Environment , 1990, IEEE Trans. Computers.

[8]  Abraham Silberschatz,et al.  Distributed file systems: concepts and examples , 1990, CSUR.

[9]  Randy H. Katz,et al.  RAMA: a file system for massively-parallel computers , 1993, [1993] Proceedings Twelfth IEEE Symposium on Mass Storage systems.

[10]  Carl Smith,et al.  NFS Version 3: Design and Implementation , 1994, USENIX Summer.

[11]  Dror G. Feitelson,et al.  The Vesta parallel file system , 1996, TOCS.

[12]  Mahadev Satyanarayanan,et al.  Long Term Distributed File Reference Tracing: Implementation and Experience , 1996, Softw. Pract. Exp..

[13]  Michael J. Callahan,et al.  The InterMezzo File System , 1999 .

[14]  David Robinson,et al.  NFS version 4 Protocol , 2000, RFC.

[15]  Rodney Van Meter,et al.  Network attached storage architecture , 2000, CACM.

[16]  Peter Braam,et al.  The Lustre Storage Architecture , 2019, ArXiv.