Architecture of LA-MPI, a network-fault-tolerant MPI
暂无分享,去创建一个
Mark A. Taylor | Richard L. Graham | Timothy S. Woodall | David J. Daniel | Mitchel W. Sukalski | Rob T. Aulwes | Nehal N. Desai | L. Dean Risinger
[1] Message Passing Interface Forum. MPI: A message - passing interface standard , 1994 .
[2] Jon Postel,et al. User Datagram Protocol , 1980, RFC.
[3] Roy Friedman,et al. Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).
[4] Jack J. Dongarra,et al. FT-MPI: Fault Tolerant MPI, Supporting Dynamic Applications in a Dynamic World , 2000, PVM/MPI.
[5] Jack J. Dongarra,et al. Scalable Networked Information Processing Environment (SNIPE) , 1997, ACM/IEEE SC 1997 Conference (SC'97).
[6] Craig Partridge,et al. Performance of checksums and CRCs over real data , 1995, SIGCOMM '95.
[7] Miron Livny,et al. Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.
[8] Ronald Minnich,et al. A Network-Failure-Tolerant Message-Passing System for Terascale Clusters , 2002, ICS '02.
[9] Georg Stellner,et al. CoCheck: checkpointing and process migration for MPI , 1996, Proceedings of International Conference on Parallel Processing.
[10] Wu-chun Feng,et al. The Quadrics Network: High-Performance Clustering Technology , 2002, IEEE Micro.
[11] Ronald Minnich,et al. A network-failure-tolerant message-passing system for terascale clusters , 2002, ICS '02.
[12] Mitchel W. Sukalski,et al. LA-MPI : The Design and Implementation of a Network-Fault-Tolerant MPI for Terascale Clusters , .
[13] Thomas Hérault,et al. MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes , 2002, ACM/IEEE SC 2002 Conference (SC'02).
[14] Message P Forum,et al. MPI: A Message-Passing Interface Standard , 1994 .