Efficient Distributed File I/O for Visualization in Grid Environments

Large-scale simulations running in metacomputing environments face the problem of efficient file I/O. For efficiency it is desirable to write data locally, distributed across the computing environment, and then to minimize data transfer, that is, reduce remote file access. Both aspects require I/O approaches that differ from existing paradigms. For the data output of distributed simulations, one wants to use fast local parallel I/O for all participating nodes, producing a single distributed logical file, while keeping changes to the simulation code as small as possible. For reading the data file, as in postprocessing and file-based visualization, one wants to have efficient partial access to remote and distributed files, using a global naming scheme and efficient data caching, and again keeping the changes to the postprocessing code small. However, all available software solutions require all data to be staged locally (involving possible data recombination and conversion), or suffer from the performance problems of remote or distributed file systems. In this paper we show how to interface the HDF5 I/O library via its flexible Virtual File Driver layer to the Globus Data Grid. We show that combining these two toolkits in a suitable way provides us with a new I/O framework, which allows efficient, secure, distributed and parallel file I/O in a metacomputing environment.

[1]  John Shalf,et al.  Numerical Relativity in a Distributed Environment , 1999, PPSC.

[2]  Ian T. Foster,et al.  GASS: a data movement and access service for wide area computing systems , 1999, IOPADS '99.

[3]  Vaidy S. Sunderam,et al.  PVM: A Framework for Parallel Distributed Computing , 1990, Concurr. Pract. Exp..

[4]  Peter P. Edwards,et al.  The changing phase of liquid metals , 1996 .

[5]  Ian T. Foster,et al.  The Globus project: a status report , 1998, Proceedings Seventh Heterogeneous Computing Workshop (HCW'98).

[6]  Tom Goodale,et al.  The Cactus computational collaboratory: enabling technologies for relativistic astrophysics, and a toolkit for solving PDE's by communities in science and engineering , 1999, Proceedings. Frontiers '99. Seventh Symposium on the Frontiers of Massively Parallel Computation.

[7]  Larry Smarr,et al.  When Black Holes Collide , 1997 .

[8]  John Shalf,et al.  Solving Einstein's Equations on Supercomputers , 1999, Computer.

[9]  Edward Seidel Black Hole Coalescence and Mergers : Review, Status, and "Where are We Heading?" , 1999 .

[10]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[11]  E. Seidel,et al.  Technologies for collaborative, large scale simulation in astrophysics and a general toolkit for solving PDE's in science and engineering. , 1999 .

[12]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[13]  Mathilde Romberg The UNICORE architecture: seamless access to distributed resources , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[14]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[15]  James C. French,et al.  A Synopsis of the Legion Project , 1994 .

[16]  Philip E. Mosely,et al.  Where are we Heading , 1947 .

[17]  Edward Seidel,et al.  Numerical relativity and black holes , 1996 .

[18]  Jesús Carretero,et al.  ParFiSys: a parallel file system for MPP , 1996, OPSR.

[19]  Jason Lee,et al.  A network-aware distributed storage cache for data intensive environments , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[20]  Joshua R. Smith,et al.  LIGO: the Laser Interferometer Gravitational-Wave Observatory , 1992, Science.