Improving Fault Tolerance in Desktop Grids Based On Incremental Checkpointing
暂无分享,去创建一个
[1] Fabio Kon,et al. Strategies for storage of checkpointing data using non-dedicated repositories on Grid systems , 2005, MGC '05.
[2] Andrew S. Grimshaw,et al. Using Reflection for Incorporating Fault-Tolerance Techniques into Distributed Applications , 1998, Parallel Process. Lett..
[3] David P. Anderson,et al. BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.
[4] Miron Livny,et al. Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System , 1997 .
[5] Kai Li,et al. Libckpt: Transparent Checkpointing under UNIX , 1995, USENIX.
[6] Lei Gao,et al. PRACTI Replication for Large-Scale Systems , 2004 .
[7] L. Alvisi,et al. A Survey of Rollback-Recovery Protocols , 2002 .
[8] Song Jiang,et al. Transparent, Incremental Checkpointing at Kernel Level: a Foundation for Fault Tolerance for Parallel Computers , 2005, ACM/IEEE SC 2005 Conference (SC'05).
[9] Robert Hood,et al. Use-Cases for Grid Checkpoint and Recovery , 2007 .
[10] Gilles Fedak,et al. XtremWeb: a generic global computing system , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.
[11] Hua Zhong,et al. CRAK: Linux Checkpoint/Restart As a Kernel Module , 1996 .