A fault tolerant MPI-IO implementation using the Expand parallel file system

Parallelism in file systems is obtained by using several independent server nodes supporting one or more secondary storage devices. This approach increases the performance and scalability of the system, but a fault in one single node can stop the whole system. To avoid this problem, data must be stored using some kind of redundant technique, so any data stored in a faulty element can be recovered. Fault tolerance can be provided in I/O systems using replication or RAID based schemes. However, most of the current systems apply the same technique for all files in the system. This paper describes the fault tolerance support provided by Expand, a parallel file system based on standard servers. Expand allows to define different fault-tolerant mechanisms at file level. The evaluation compares the performance of Expand with different configurations with PVFS using the FLASH-I/O benchmark.

[1]  Michael Stonebraker,et al.  Distributed RAID-a new multiple copy algorithm , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[2]  Jesús Carretero,et al.  The Design of the Expand Parallel File System , 2003, Int. J. High Perform. Comput. Appl..

[3]  Michael Williams,et al.  Replication in the harp file system , 1991, SOSP '91.

[4]  Jesús Carretero,et al.  An Implementation of MPI-IO on Expand: A Parallel File System Based on NFS Servers , 2002, PVM/MPI.

[5]  J. Heidemann,et al.  The Ficus Distributed File System : Replication via Stackable Layers , 1990 .

[6]  Garret Swart,et al.  Availability in the Echo File System , 1996 .

[7]  John S. Heidemann,et al.  Implementation of the Ficus Replicated File System , 1990, USENIX Summer.

[8]  Matthew T. O'Keefe,et al.  The Global File System , 1996 .

[9]  Norbert Podhorszki,et al.  Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing , 2002 .

[10]  Dheeraj Bhardwaj Application I/O on a Parallel File System for Linux Clusters , 2006 .

[11]  H. Apte,et al.  Serverless Network File Systems , 2006 .

[12]  Matt Evans FTFS : The Design of A Fault Tolerant Distributed File-System , 2000 .

[13]  CremonesiPaolo Parallel, distributed and network-based processing , 2006 .

[14]  Robert B. Ross,et al.  PVFS: A Parallel File System for Linux Clusters , 2000, Annual Linux Showcase & Conference.

[15]  Rajeev Thakur,et al.  On implementing MPI-IO portably and with high performance , 1999, IOPADS '99.

[16]  Darrell D. E. Long,et al.  Swift: Using Distributed Disk Striping to Provide High I/O Data Rates , 1991, Comput. Syst..

[17]  Jesús Carretero,et al.  A parallel and fault tolerant file system based on NFS servers , 2003, Eleventh Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2003. Proceedings..