Data Intensive Distributed Computing; A Medical Application Example

Modern scientific computing involves organizing, moving, visualizing, and analyzing massive amounts of data from around the world, as well as employing large-scale computation. The distributed systems that solve large-scale problems will always involve aggregating and scheduling many resources. Data must be located and staged, cache and network capacity must be available at the same time as computing capacity, etc. Every aspect of such a system is dynamic: locating and scheduling resources, adapting running application systems to availability and congestion in the middleware and infrastructure, responding to human interaction, etc. The technologies, the middleware services, and the architectures that are used to build useful high-speed, wide area distributed systems, constitute the field of data intensive computing. This paper explores some of the history and future directions of that field, and describes a specific medical application example.

[1]  Andrew S. Grimshaw,et al.  Metasystems , 1998, CACM.

[2]  Ami Marowka,et al.  The GRID: Blueprint for a New Computing Infrastructure , 2000, Parallel Distributed Comput. Pract..

[3]  I. Richer,et al.  The MAGlC project: from vision to reality , 1996 .

[4]  Parvati Dev,et al.  Digital Libraries In Medicine , 1998, Proceedings. 1998 IEEE International Conference on Information Technology Applications in Biomedicine, ITAB '98 (Cat. No.98EX188).

[5]  William E. Johnston,et al.  The NetLogger methodology for high performance distributed systems performance analysis , 1998, Proceedings. The Seventh International Symposium on High Performance Distributed Computing (Cat. No.98TB100244).

[6]  William E. Johnston,et al.  Distributed health care imaging information systems , 1997, Medical Imaging.

[7]  William E. Johnston,et al.  Performance Analysis in High-Speed Wide Area IP over ATM Networks: Top-to-Bottom End-to-End Monitoring , 1996 .

[8]  William E. Johnston,et al.  The NetLogger Methodology for High Performance Distributed Systems Performance Analysis , 1999 .