Replica Based Distributed Metadata Management in Grid Environment

Metadata management is one of the key techniques in data grid. It is required to achieve two goals: high efficiency and availability. This paper presents a Replication Based Metadata Management System (RBMMS) as metadata server implemented in Global Distributed Storage System (GDSS). To address the above two goals RBMMS maintains a spares strongly connected graph to describe replica structure and relations among the replicas. The graph is used to propagate updating information and replica discovery in the process of replica addition and removal. Cache module is also implemented in RBMMS to further improve the performance of metadata access. The evaluation demonstrates that RBMMS attains high availability and efficiency of metadata management system.

[1]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[2]  Craig Lee,et al.  Grid Computing — GRID 2001: Second International Workshop Denver, CO, USA, November 12, 2001 Proceedings , 2001, Lecture Notes in Computer Science.

[3]  Jin Hai,et al.  Architecture Design of Global Distributed Storage System for Data Grid , 2003 .

[4]  Kavitha Ranganathan,et al.  Identifying Dynamic Replication Strategies for a High-Performance Data Grid , 2001, GRID.

[5]  Erwin Laure,et al.  Replica Management in Data Grids , 2002 .

[6]  Magnus Karlsson,et al.  Choosing replica placement heuristics for wide-area systems , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[7]  E. Deelman,et al.  Data replication strategies in grid environments , 2002, Fifth International Conference on Algorithms and Architectures for Parallel Processing, 2002. Proceedings..

[8]  Jon B. Weissman,et al.  An Adaptive Service Grid Architecture Using Dynamic Replica Management , 2001, GRID.

[9]  M. Humphrey,et al.  LegionFS: A Secure and Scalable File System Supporting Cross-Domain High-Performance Applications , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[10]  M. Makpangou,et al.  A scalable replica selection strategy based on flexible contracts , 2003, Proceedings the Third IEEE Workshop on Internet Applications. WIAPP 2003.

[11]  Antony I. T. Rowstron,et al.  Squirrel: a decentralized peer-to-peer web cache , 2002, PODC '02.

[12]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[13]  Ian Foster,et al.  The Grid: A New Infrastructure for 21st Century Science , 2002 .