A faster checkpointing and recovery algorithm with a hierarchical storage approach
暂无分享,去创建一个
[1] Georg Stellner,et al. CoCheck: checkpointing and process migration for MPI , 1996, Proceedings of International Conference on Parallel Processing.
[2] Y. Ishikawa. RWC PC Cluster II and SCore Cluster System Software-High Performance Linux Cluster , 1999 .
[3] Takashi Nanya,et al. Evaluation of Checkpointing Mechanism on SCore Cluster System , 2003 .
[4] Kai Li,et al. Faster checkpointing with N+1 parity , 1994, Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing.
[5] Christian Engelmann,et al. A diskless checkpointing algorithm for super-scale architectures applied to the fast fourier transform , 2003, Proceedings of the International Workshop on Challenges of Large Applications in Distributed Environments, 2003..
[6] Thomas Hérault,et al. MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes , 2002, ACM/IEEE SC 2002 Conference (SC'02).