Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed

Large scale distributed systems such as Grids are difficult to study from theoretical models and simulators only. Most Grids deployed at large scale are production platforms that are inappropriate research tools because of their limited reconfiguration, control and monitoring capabilities. In this paper, we present Grid'5000, a 5000 CPU nation-wide infrastructure for research in Grid computing. Grid'5000 is designed to provide a scientific tool for computer scientists similar to the large-scale instruments used by physicists, astronomers, and biologists. We describe the motivations, design considerations, architecture, control, and monitoring infrastructure of this experimental platform. We present configuration examples and performance results for the reconfiguration subsystem.

[1]  Xin Liu,et al.  Validating and Scaling the MicroGrid: A Scientific Instrument for Grid Dynamics , 2004, Journal of Grid Computing.

[2]  Jean-Yves L'Excellent,et al.  An Overview of the GRID-TLSE Project , 2004 .

[3]  Thomas Stützle,et al.  A simple and effective iterated greedy algorithm for the permutation flowshop scheduling problem , 2007, Eur. J. Oper. Res..

[4]  Georges Da Costa,et al.  2005 IEEE International Symposium on Cluster Computing and the Grid , 2005, CCGRID.

[5]  David E. Culler,et al.  PlanetLab: an overlay testbed for broad-coverage services , 2003, CCRV.

[6]  Eddy Caron,et al.  A Monitoring and Visualization Tool and Its Application for a Network Enabled Server Platform , 2006, ICCSA.

[7]  Philippe Augerat,et al.  Scalable monitoring and configuration tools for grids and clusters , 2002, Proceedings 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing.

[8]  Eddy Caron,et al.  Diet: A Scalable Toolbox to Build Network Enabled Servers on the Grid , 2006, Int. J. High Perform. Comput. Appl..

[9]  Satoshi Matsuoka,et al.  Overview of a performance evaluation system for global computing scheduling algorithms , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[10]  Mike Hibler,et al.  An integrated experimental environment for distributed systems and networks , 2002, OSDI '02.

[11]  Ian T. Foster,et al.  GangSim: a simulator for grid scheduling studies , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[12]  Henri Casanova,et al.  Scheduling distributed applications: the SimGrid simulation framework , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[13]  Ponsich Antonin,et al.  About the relevance of mathematical programming and stochastic optimisation methods: Application to optimal batch plant design problems , 2005 .

[14]  Olivier Richard,et al.  A tool for environment deployment in clusters and light grids , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.