Towards Autonomic Grid Data Management with Virtualized Distributed File Systems

Grid data management is a challenging task because of the heterogeneous, dynamic and largescale nature of Grid environments. This paper proposes an autonomic Grid data management architecture based on virtualized distributed file systems and WSRF-compliant management services. Autonomic functions are integrated into the services to provide self-managing control over the different entities of Grid-wide file system sessions in accordance with high-level objectives and operate together to automatically achieve the desired data provisioning behaviors. Important autonomic features are designed and implemented in this system in the aspects of cache configuration, data replication and session redirection. Experiments with a prototype demonstrate that this architecture can automatically and substantially improve the performance and reliability of Grid data access.

[1]  Chris Peterson,et al.  Implementing a Performance Forecasting System for Metacomputing The Network Weather Service , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[2]  David A. Patterson,et al.  Computer Architecture - A Quantitative Approach, 5th Edition , 1996 .

[3]  Brent Callaghan,et al.  NFS Illustrated , 1999 .

[4]  Xiaomin Zhu,et al.  From virtualized resources to virtual computing grids: the In-VIGO system , 2005, Future Gener. Comput. Syst..

[5]  Renato J. O. Figueiredo,et al.  Supporting application-tailored grid file system sessions with WSRF-based services , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[6]  Amin Vahdat,et al.  Consistent and automatic replica regeneration , 2004, TOS.

[7]  Jing Xu,et al.  Towards Autonomic Virtual Applications in the In-VIGO System , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[8]  Andrew S. Grimshaw,et al.  Grid-based file access: the Legion I/O model , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[9]  Stephen P. Morgan,et al.  Clockwork: A new movement in autonomic systems , 2003, IBM Syst. J..

[10]  Gustavo Alonso,et al.  GlobeDB: autonomic data replication for web applications , 2005, WWW '05.

[11]  Keith Marzullo,et al.  Supplying high availability with a standard network file system , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[12]  Richard Wolski,et al.  Dynamically forecasting network performance using the Network Weather Service , 1998, Cluster Computing.

[13]  Peter A. Dinda,et al.  Modeling and taming parallel TCP on the wide area network , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[14]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[15]  Miron Livny,et al.  A framework for self-optimizing, fault-tolerant, high performance bulk data transfers in a heterogeneous grid environment , 2003, Second International Symposium on Parallel and Distributed Computing, 2003. Proceedings..

[16]  Aamer Sachedina,et al.  Policy-Based Autonomic Storage Allocation , 2003, DSOM.

[17]  Renato J. O. Figueiredo,et al.  Application-Tailored Cache Consistency for Wide-Area File Systems , 2006, 26th IEEE International Conference on Distributed Computing Systems (ICDCS'06).

[18]  Carl Kesselman,et al.  Wide area data replication for scientific collaborations , 2005, Int. J. High Perform. Comput. Netw..

[19]  Andrea C. Arpaci-Dusseau,et al.  Explicit Control in the Batch-Aware Distributed File System , 2004, NSDI.

[20]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[21]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[22]  Renato J. O. Figueiredo,et al.  Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing , 2006, Cluster Computing.

[23]  Ian T. Foster,et al.  GASS: a data movement and access service for wide area computing systems , 1999, IOPADS '99.

[24]  Amin Vahdat,et al.  Interposed request routing for scalable network storage , 2000, TOCS.

[25]  Lili Qiu,et al.  On the placement of Web server replicas , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[26]  Steve R. White,et al.  An architectural approach to autonomic computing , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[27]  ZhangJian,et al.  From virtualized resources to virtual computing grids , 2005 .

[28]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[29]  Chen Ding,et al.  Miss rate prediction across all program inputs , 2003, 2003 12th International Conference on Parallel Architectures and Compilation Techniques.

[30]  Magnus Karlsson,et al.  Taming aggressive replication in the Pangaea wide-area file system , 2002, OPSR.

[31]  Renato J. O. Figueiredo,et al.  Seamless Access to Decentralized Storage Services in Computational Grids via a Virtual File System , 2004, Cluster Computing.