The Design of the Expand Parallel File System

This article describes an implementation of MPI-IO using a new parallel file system, called Expand (Expandable Parallel File System), which is based on NFS servers. Expand combines multiple NFS servers to create a distributed partition where files are striped. Expand requires no changes to the NFS server and uses RPC operations to provide parallel access to the same file. Expand is also independent of the clients, because all operations are implemented using RPC and NFS protocols. Using this system, we can join heterogeneous servers (Linux, Solaris, Windows 2000, etc.) to provide a parallel and distributed partition. The article describes the design, implementation and evaluation of Expand with MPI-IO. This evaluation has been made in Linux clusters and compares Expand and PVFS.

[1]  Andrew A. Chien,et al.  PPFS: a high performance portable parallel file system , 1995, ICS '95.

[2]  David Kotz,et al.  The galley parallel file system , 1997, ICS '96.

[3]  Jesús Carretero,et al.  Performance Increase Mechanisms for Parallel and Distributed File Systems , 1997, Parallel Comput..

[4]  Amin Vahdat,et al.  Interposed request routing for scalable network storage , 2000, TOCS.

[5]  Gene H. Kim,et al.  Bigfoot-NFS : A Parallel File-Striping NFS Server ( Extended Abstract ) , 1994 .

[6]  Marianne Winslett,et al.  Server-Directed Collective I/O in Panda , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[7]  Jim Zelenka,et al.  The Scotch parallel storage systems , 1995, Digest of Papers. COMPCON'95. Technologies for the Information Superhighway.

[8]  Ron Oldfield,et al.  Armada: a parallel file system for computational grids , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[9]  Mark Baker,et al.  Cluster Computing White Paper , 2000, ArXiv.

[10]  Randy H. Katz,et al.  A case for redundant arrays of inexpensive disks (RAID) , 1988, SIGMOD '88.

[11]  Rolf Rabenseifner,et al.  Effective Communication and File-I/O Bandwidth Benchmarks , 2001, PVM/MPI.

[12]  Rajeev Thakur,et al.  On implementing MPI-IO portably and with high performance , 1999, IOPADS '99.

[13]  Steven A. Moyer,et al.  PIOUS: a scalable parallel I/O system for distributed computing environments , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[14]  Richard Wheeler,et al.  it/sfs: A Parallel File System for the CM-5 , 1993, USENIX Summer.

[15]  E. Lusk,et al.  An abstract-device interface for implementing portable parallel-I/O interfaces , 1996, Proceedings of 6th Symposium on the Frontiers of Massively Parallel Computation (Frontiers '96).

[16]  E. DeBenedictis,et al.  nCUBE parallel I/O software , 1992, Eleventh Annual International Phoenix Conference on Computers and Communication [1992 Conference Proceedings].

[17]  Erich Schikuta,et al.  Parallel I/O , 2001, Int. J. High Perform. Comput. Appl..

[18]  Dan Walsh,et al.  Design and implementation of the Sun network filesystem , 1985, USENIX Conference Proceedings.

[19]  Daniel A. Reed,et al.  Automatic classification of input/output access patterns , 1997 .

[20]  Alex Rapaport,et al.  Mpi-2: extensions to the message-passing interface , 1997 .

[21]  Robert Ross,et al.  Implementation and performance of a parallel file system for high performance distributed applications , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[22]  Alok N. Choudhary,et al.  Improved parallel I/O via a two-phase run-time access strategy , 1993, CARN.

[23]  Rakesh Krishnaiyer,et al.  PASSION: Parallel And Scalable Software for Input-Output , 1994 .

[24]  Carla Schlatter Ellis,et al.  Bridge: a high performance file system for parallel processors , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[25]  Message P Forum,et al.  MPI: A Message-Passing Interface Standard , 1994 .

[26]  Michael Stumm,et al.  Hfs: a flexible file system for shared-memory multiprocessors , 1994 .

[27]  Erich Schikuta,et al.  Meta-ViPIOS: Harness Distributed I/O Resources with ViPIOS , 2000, Computación y Sistemas.

[28]  Dror G. Feitelson,et al.  Overview of the Vesta parallel file system , 1993, CARN.

[29]  Jesús Carretero,et al.  An Implementation of MPI-IO on Expand: A Parallel File System Based on NFS Servers , 2002, PVM/MPI.

[30]  David Kotz,et al.  Disk-directed I/O for MIMD multiprocessors , 1994, OSDI '94.

[31]  Jesús Carretero,et al.  High Performance Cache Management for Parallel File Systems , 1998, VECPAR.

[32]  Robert B. Ross,et al.  PVFS: A Parallel File System for Linux Clusters , 2000, Annual Linux Showcase & Conference.