Enhancement of Anticipative Recursively-Adjusting Mechanism for Redundant Parallel File Transfer in Data Grids

In data grid, co-allocation architecture can be used to enable parallel transferring of data file from multiple replicas which stored in the different grid sites. Some schemes base on co-allocation model were proposed and used to exploit the different transfer rates among various client-server network links and to adapt dynamic rate fluctuations by dividing data into fragment. These schemes showed the more fragments used the more performance conducted when data transfer in parallel with evidence. In our previous work, we propose a scheme named anticipative recursively-adjusting mechanism (ARAM) in previous research work. The best thing is performance tuning through the alpha value, it¿s rely on special feature to adapt different network situations in a data grid environment. In this paper, the TCP bandwidth estimation model (TCPBEM) is used to evaluate dynamic link state by detect TCP throughput and packet lost rate between grid nodes. We integrate the model into ARAM, called anticipative recursively-adjusting mechanism plus (ARAM+), that can be more reliable and reasonable then previous one. In the meanwhile, we also design a burst mode which could increase transfer rate of ARAM+. This approach not only adapts worst network link but also speedup the overall performance.

[1]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[2]  Ching-Hsien Hsu,et al.  Redundant Parallel File Transfer with Anticipative Recursively-Adjusting Scheme in Data Grids , 2007, ICA3PP.

[3]  Yoshiaki Katayama,et al.  Dynamic Co-allocation Scheme for Parallel Data Transfer in Grid Environment , 2005, 2005 First International Conference on Semantics, Knowledge and Grid.

[4]  Vassilis Tsaoussidis,et al.  Approaches to Congestion Control in packet networks , 2007 .

[5]  Chao-Tung Yang,et al.  Implementation of a Cyber Transformer for Parallel Download in Co-Allocation Data Grid Environments , 2008, 2008 Seventh International Conference on Grid and Cooperative Computing.

[6]  Ching-Hsien Hsu,et al.  A Recursively-Adjusting Co-allocation scheme with a Cyber-Transformer in Data Grids , 2009, Future generations computer systems.

[7]  Ian T. Foster,et al.  Replica selection in the Globus Data Grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[8]  Donald F. Towsley,et al.  Modeling TCP throughput: a simple model and its empirical validation , 1998, SIGCOMM '98.

[9]  Matthew Mathis,et al.  The macroscopic behavior of the TCP congestion avoidance algorithm , 1997, CCRV.

[10]  Thomas R. Gross,et al.  Bandwidth modelling for network-aware applications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[11]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[12]  Ching-Hsien Hsu,et al.  Performance Analysis of Applying Replica Selection Technology for Data Grid Environments , 2005, PaCT.

[13]  Yoshiaki Katayama,et al.  A Framework for an Integrated Co-allocator for Data Grid in Multi-Sender Environment , 2007, IEICE Trans. Commun..

[14]  Chao-Tung Yang,et al.  Implementation of a dynamic adjustment mechanism with efficient replica selection in data grid environments , 2006, SAC '06.

[15]  Jennifer M. Schopf,et al.  Predicting sporadic grid data transfers , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[16]  Brian Tierney,et al.  File and Object Replication in Data Grids , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[17]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[18]  Chao-Tung Yang,et al.  A Dynamic Adjustment Strategy for File Transformation in Data Grids , 2007, NPC.

[19]  Peter Z. Kunszt,et al.  Giggle: A Framework for Constructing Scalable Replica Location Services , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[20]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[21]  Frank Kelly,et al.  Fairness and Stability of End-to-End Congestion Control , 2003, Eur. J. Control.

[22]  Brian D. Noble,et al.  The end-to-end performance effects of parallel TCP sockets on a lossy wide-area network , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[23]  Ian T. Foster,et al.  Resource co-allocation in computational grids , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[24]  Ian T. Foster,et al.  Predicting the performance of wide area data transfers , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[25]  Jennifer M. Schopf,et al.  Using Regression Techniques to Predict Large Data Transfers , 2003, Int. J. High Perform. Comput. Appl..

[26]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.

[27]  Chien-Min Wang,et al.  Efficient multi-source data transfer in data grids , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[28]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[29]  Sudharshan S. Vazhkudai Enabling the co-allocation of grid data transfers , 2003, Proceedings. First Latin American Web Congress.

[30]  Rajkumar Buyya,et al.  A taxonomy of Data Grids for distributed data sharing, management, and processing , 2005, CSUR.

[31]  Ian T. Foster,et al.  Improving parallel data transfer times using predicted variances in shared networks , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[32]  Eitan Altman,et al.  Parallel TCP Sockets: Simple Model, Throughput and Validation , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[33]  Chao-Tung Yang,et al.  Improvements on dynamic adjustment mechanism in co-allocation data grid environments , 2007, The Journal of Supercomputing.

[34]  Jennifer M. Schopf,et al.  A performance study of monitoring and information services for distributed systems , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.