Improving the Performance of Read Operations in Distributed File System

Distributed file systems are used in distributed systems to support scalable storage and to enable the parallel and distributed processing of data. Sharing the data stored in the distributed file system among authorized users is a major requirement of the distributed file system. Popular distributed file systems use session semantics to share the data among the users. In this paper, we have proposed a read algorithm developed based on speculative semantics which permits the read process to read the updates done by the concurrent write process from the distributed file system provided such a reading does not affect data consistency.

[1]  Maurice Herlihy,et al.  Linearizability: a correctness condition for concurrent objects , 1990, TOPL.

[2]  P. Krishna Reddy,et al.  Speculation-based protocols for improving the performance of read-only transactions , 2010, Int. J. Comput. Sci. Eng..

[3]  Werner Vogels,et al.  Building reliable distributed systems at a worldwide scale demands trade-offs between consistency and availability. , 2022 .

[4]  John H. Hartman,et al.  Efficient cooperative caching using hints , 1996, OSDI '96.

[5]  Wei-keng Liao,et al.  Collective caching: application-aware client-side file caching , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[6]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[7]  Florin Isaila,et al.  Integrating collective I/O and cooperative caching into the "clusterfile" parallel file system , 2004, ICS '04.

[8]  Jason Flinn,et al.  Speculative execution in a distributed file system , 2005, SOSP '05.

[9]  Masaru Kitsuregawa,et al.  Speculative locking protocols to improve performance for distributed database systems , 2004, IEEE Transactions on Knowledge and Data Engineering.

[10]  Dan Walsh,et al.  Design and implementation of the Sun network filesystem , 1985, USENIX Conference Proceedings.

[11]  Michael Dahlin,et al.  Cooperative caching: using remote client memory to improve file system performance , 1994, OSDI '94.

[12]  Gabriel Antoniu,et al.  BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[13]  Xiaoning Ding,et al.  A Locality-Aware Cooperative Cache Management Protocol to Improve Network File System Performance , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).

[14]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[15]  Mahadev Satyanarayanan,et al.  Coda: A Highly Available File System for a Distributed Workstation Environment , 1990, IEEE Trans. Computers.

[16]  T. Lakshmi Siva Rama Krishna,et al.  A novel technique for improving the performance of read operations in BlobSeer Distributed File System , 2014, 2014 Conference on IT in Business, Industry and Government (CSIBIG).

[17]  Mahadev Satyanarayanan,et al.  Scale and performance in a distributed file system , 1988, TOCS.

[18]  GhemawatSanjay,et al.  The Google file system , 2003 .

[19]  Surendra Byna,et al.  Data access history cache and associated data prefetching mechanisms , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).