Efficient data consolidation in grid networks and performance analysis

We examine a task scheduling and data migration problem for grid networks, which we refer to as the Data Consolidation (DC) problem. DC arises when a task concurrently requests multiple pieces of data, possibly scattered throughout the grid network, that have to be present at a selected site before the task's execution starts. In such a case, the scheduler and the data manager must select (i) the data replicas to be used, (ii) the site where these data will be gathered for the task to be executed, and (iii) the routing paths to be followed; this is assuming that the selected datasets are transferred concurrently to the execution site. The algorithms or policies for selecting the data replicas, the data consolidating site and the corresponding paths comprise a Data Consolidation scheme. We propose and experimentally evaluate several DC schemes of polynomial number of operations that attempt to estimate the cost of the concurrent data transfers, to avoid congestion that may appear due to these transfers and to provide fault tolerance. Our simulation results strengthen our belief that DC is an important problem that needs to be addressed in the design of data grids, and can lead, if performed efficiently, to significant benefits in terms of task delay, network load and other performance parameters.

[1]  A. D. Meglio,et al.  Programming the Grid with gLite , 2006 .

[2]  P. Sadayappan,et al.  Distributed job scheduling on computational Grids using multiple simultaneous requests , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[3]  Reda Alhajj,et al.  Replica Placement Strategies in Data Grid , 2008, Journal of Grid Computing.

[4]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[5]  Reda Alhajj,et al.  A Predictive Technique for Replica Selection in Grid Environment , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[6]  Pablo Rodriguez,et al.  Dynamic parallel access to replicated content in the internet , 2002, TNET.

[8]  Rajkumar Buyya,et al.  A taxonomy and survey of grid resource management systems for distributed computing , 2002, Softw. Pract. Exp..

[9]  Kurt Stockinger,et al.  OptorSim-A Grid Simulator for Studying Dynamic Data Replication Strategies , 2003 .

[10]  David Abramson,et al.  Scheduling parameter sweep applications on global Grids: a deadline and budget constrained cost–time optimization algorithm , 2005, Softw. Pract. Exp..

[11]  Michael Mitzenmacher,et al.  Accessing multiple mirror sites in parallel: using Tornado codes to speed up downloads , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[12]  Emmanouel A. Varvarigos,et al.  Fair Scheduling Algorithms in Grids , 2007, IEEE Transactions on Parallel and Distributed Systems.

[13]  Rajkumar Buyya,et al.  Fair resource sharing in hierarchical virtual organizations for global grids , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[14]  Daniel S. Katz,et al.  ASTRONOMICAL IMAGE MOSAICKING ON A GRID: INITIAL EXPERIENCES∗ , 2004 .

[15]  L. R. Esau,et al.  On Teleprocessing System Design Part II: A Method for Approximating the Optimal Network , 1966, IBM Syst. J..

[16]  Albert Y. Zomaya,et al.  Intelligent Scheduling and Replication in Datagrids: a Synergistic Approach , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[17]  Laurence T. Yang,et al.  Engineering the grid - status and perspective , 2006 .

[18]  Javier Jaén Martínez,et al.  Data Management in an International Data Grid Project , 2000, GRID.

[19]  Leonid Oliker,et al.  Scheduling in Heterogeneous Grid Environments: The Effects of DataMigration , 2004 .

[20]  Reda Alhajj,et al.  Study of Different Replica Placement and Maintenance Strategies in Data Grid , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[21]  Klara Nahrstedt,et al.  A distributed resource management architecture that supports advance reservations and co-allocation , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[22]  Emmanouel A. Varvarigos,et al.  A framework for providing hard delay guarantees and user fairness in Grid computing , 2009, Future Gener. Comput. Syst..

[23]  Ladislau Bölöni,et al.  A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems , 2001, J. Parallel Distributed Comput..

[24]  Shubhashis Sengupta,et al.  Integration of Scheduling and Replication in Data Grids , 2004, HiPC.

[25]  Emmanouel A. Varvarigos,et al.  Adjusted fair scheduling and non-linear workload prediction for QoS guarantees in grid computing , 2007, Comput. Commun..

[26]  R. EsauL.,et al.  On teleprocessing system design , 1966 .

[27]  Atakan Dogan,et al.  A study on performance of dynamic file replication algorithms for real-time file access in Data Grids , 2009, Future Gener. Comput. Syst..

[28]  William E. Johnston,et al.  QoS as middleware: bandwidth reservation system design , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[29]  Kurt Stockinger,et al.  Simulation of Dynamic Grid Replication Strategies in OptorSim , 2002, GRID.

[30]  Peter A. Dinda,et al.  Online Prediction of the Running Time of Tasks , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[31]  Emmanouel A. Varvarigos,et al.  Developing Scheduling Policies in gLite Middleware , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[32]  Emmanouel A. Varvarigos,et al.  Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks , 2008, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID).

[33]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[34]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[35]  Chris Develder,et al.  Multi-cost job routing and scheduling in Grid networks , 2009, Future Gener. Comput. Syst..

[36]  Ruay-Shiung Chang,et al.  A multiple parallel download scheme with server throughput and client bandwidth considerations for data grids , 2008, Future Gener. Comput. Syst..

[37]  Henri Casanova,et al.  An Evaluation of Job Scheduling Strategies for Divisible Loads on Grid Platforms , 2006 .

[38]  Paul Millar,et al.  OptorSim : a Simulation Tool for Scheduling and Replica Optimisation in Data Grids , 2005 .

[39]  Ian T. Foster,et al.  Replica selection in the Globus Data Grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.