A Multiagent Infrastructure for Data-Intensive Grid Applications

Grid constitutes a new computing paradigm, which inherits a great number of its features from distributed systems. This new paradigm enables resource-sharing across networks, being data one of the most important ones. Data-intensive grid systems are grid applications, whose major goal is to provide efficient access to data. Existing data-intensive applications have been used in several domains, such as physics, climate modeling, biology or visualization. The I/O problem is not completely solved in this kind of applications. This chapter presents MAPFS as a flexible and high-performance platform for data-intensive applications and, more specifically, for data grid applications.

[1]  Koen Holtman Object level physics data replication in the Grid , 2001 .

[2]  Ian T. Foster,et al.  Replica selection in the Globus Data Grid , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[3]  R. Rood,et al.  Parallel Computing at the NASA Data Assimilation Office (DAO) , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[4]  Ian T. Foster,et al.  A Grid-Enabled MPI: Message Passing in Heterogeneous Distributed Computing Systems , 1998, Proceedings of the IEEE/ACM SC98 Conference.

[5]  William I. Nowicki,et al.  NFS: Network File System Protocol specification , 1989, RFC.

[6]  Ron Oldfield,et al.  Armada: a parallel I/O framework for computational grids , 2002, Future Gener. Comput. Syst..

[7]  Donald P. Greenberg,et al.  Implementing a Collaboratory for Microscopic Digital Anatomy , 1996, Int. J. High Perform. Comput. Appl..

[8]  William E. Johnston,et al.  A Network-Aware Distributed Storage Cache for Data Intensive Environments , 1999 .

[9]  L.A. Freitag,et al.  Adaptive, Multiresolution Visualization of Large Data Sets using a Distributed Memory Octree , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[10]  Andrew S. Grimshaw,et al.  Capacity and Capability Computing Using Legion , 2001, International Conference on Computational Science.

[11]  P. Metzger,et al.  Network Working Group , 2000 .

[12]  Ian T. Foster,et al.  The data grid: Towards an architecture for the distributed management and analysis of large scientific datasets , 2000, J. Netw. Comput. Appl..

[13]  Jason Lee,et al.  A network-aware distributed storage cache for data intensive environments , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[14]  Ian T. Foster,et al.  GASS: a data movement and access service for wide area computing systems , 1999, IOPADS '99.

[15]  Ian T. Foster,et al.  Remote I/O: fast access to distant storage , 1997, IOPADS '97.

[16]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[17]  Nicholas R. Jennings,et al.  Intelligent agents: theory and practice , 1995, The Knowledge Engineering Review.

[18]  Michael Wooldridge,et al.  Intelligent agents: theory and practice The Knowledge Engineering Review , 1995 .

[19]  M. Humphrey,et al.  LegionFS: A Secure and Scalable File System Supporting Cross-Domain High-Performance Applications , 2001, ACM/IEEE SC 2001 Conference (SC'01).

[20]  G. F. Hughes Wise drives [hard disk drive] , 2002 .

[21]  Howard Rheingold,et al.  Smart Mobs: The Next Social Revolution , 2002 .

[22]  Ian T. Foster,et al.  Grid Services for Distributed System Integration , 2002, Computer.

[23]  Message P Forum,et al.  MPI: A Message-Passing Interface Standard , 1994 .

[24]  Ian Foster,et al.  The Grid 2 - Blueprint for a New Computing Infrastructure, Second Edition , 1998, The Grid 2, 2nd Edition.

[25]  Douglas Thain,et al.  The Kangaroo approach to data movement on the Grid , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[26]  Nick Roussopoulos,et al.  MOCHA: a self-extensible database middleware system for distributed data sources , 2000, SIGMOD 2000.

[27]  María S. Pérez-Hernández,et al.  A new multiagent based architecture for high performance I/O in clusters , 2001, Proceedings International Conference on Parallel Processing Workshops.

[28]  Ian T. Foster,et al.  Data management and transfer in high-performance computational grid environments , 2002, Parallel Comput..

[29]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .