HGRID: Fault Tolerant, Log2N Resource Management for Grids

Grid Resource Discovery Service is currently a very important focus of research. We propose a scheme that presents essential characteristics for efficient, self-configuring and fault-tolerant resource discovery and is able to handle dynamic attributes, such as memory capacity. Our approach consists of an overlay network with a hypercube topology connecting the grid nodes and a scalable, fault-tolerant, self-configuring search algorithm. By design, the algorithm improves the probability of reaching all working nodes in the system even in the presence of failures (inaccessible, crashed or heavy loaded nodes). We analyze the static resilience of the presented approach, that is to say, how well the algorithm is able to find resources without having to update the routing tables. The results show that the presented approach has a high static resilience.

[1]  Luis Díaz de Cerio,et al.  HGRID: An Adaptive Grid Resource Discovery , 2008, 2008 International Conference on Complex, Intelligent and Software Intensive Systems.

[2]  Jörg Liebeherr,et al.  HyperCast: A Protocol for Maintaining Multicast Group Members in a Logical Hypercube Topology , 1999, Networked Group Communication.

[3]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[4]  Jörg Liebeherr,et al.  A scalable control topology for multicast communications , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[5]  Elias P. Duarte,et al.  HyperBone: Uma Rede Overlay Baseada em Hipercubo Virtual sobre a Internet , 2005 .

[6]  Luisa Gargano,et al.  Fault - tolerant hypercube broadcasting via information dispersal , 1993, Networks.

[7]  Ian Foster,et al.  On Fully Decentralized Resource Discovery in Grid Environments , 2001, GRID.

[8]  Krishna P. Gummadi,et al.  The impact of DHT routing geometry on resilience and proximity , 2003, SIGCOMM '03.

[9]  Zhong Liu,et al.  A Hyper-cube based P2P Information Service for Data Grid , 2006, 2006 Fifth International Conference on Grid and Cooperative Computing (GCC'06).

[10]  Y. Chang Fault tolerant broadcasting in SIMD hypercubes , 1993, Proceedings of 1993 5th IEEE Symposium on Parallel and Distributed Processing.

[11]  Ian T. Foster,et al.  Grid information services for distributed resource sharing , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.