File transfer in Grid applications at deployment, execution and retrieval

In this article a three staged file transfer approach for the Grid is proposed. File transfer in the Grid can take place at three stages: deployment, user application execution, and retrieval (post-execution). Each stage has it's own environmental requirements, and therefore different techniques must be applied. The contributions presented in this article can be grouped in two. First, the integration of heterogeneous resource acquisition and file transfer protocols. This allows on-the-fly deployment and retrieval of files. Secondly, an asynchronous file transfer mechanism based on active objects, wait-by-necessity, and automatic continuation. The proposed file transfer model has been implemented using the ProActive middleware. ProActive provides, among others, a Grid infrastructure abstraction using deployment descriptors, and a programming model based on active objects with transparent futures. Finally, the proposed file transfer model is benchmarked and validated with a real case study: BLAST.

[1]  Ncbi National Center for Biotechnology Information , 2008 .

[2]  Denis Caromel,et al.  A Theory of Distributed Objects , 2005 .

[3]  Andy Oram,et al.  Peer-to-Peer: Harnessing the Power of Disruptive Technologies , 2001 .

[4]  M. Humphrey,et al.  LegionFS: A Secure and Scalable File System Supporting Cross-Domain High-Performance Applications , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[5]  Jarek Nabrzyski,et al.  GridLab--a grid application toolkit and testbed , 2002, Future Gener. Comput. Syst..

[6]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[7]  Yves Robert,et al.  Scheduling tasks sharing files on heterogeneous master-slave platforms , 2004, 12th Euromicro Conference on Parallel, Distributed and Network-Based Processing, 2004. Proceedings..

[8]  Denis Caromel,et al.  Interactive and descriptor-based deployment of object-oriented grid applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[9]  William E. Allcock,et al.  The globus extensible input/output system (XIO): a protocol independent IO system for the grid , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[10]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[11]  Denis Caromel,et al.  Overlapping Communication with Computation in Distributed Object Systems , 1999, HPCN Europe.

[12]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[13]  Gregor von Laszewski,et al.  A File Transfer Component for Grids , 2003, PDPTA.

[14]  Denis Caromel,et al.  Toward a method of object-oriented concurrent programming , 1993, CACM.

[15]  María S. Pérez-Hernández,et al.  MAPFS-Grid: A Flexible Architecture for Data-Intensive Grid Applications , 2003, European Across Grids Conference.