Design, implementation, and performance of checkpointing in NetSolve
暂无分享,去创建一个
[1] Micah Beck,et al. Libckpt: Transparent Checkpointing under Unix Libckpt: Transparent Checkpointing under Unix , 1995 .
[2] Roy Friedman,et al. Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).
[3] Jack Dongarra,et al. The use of Java in the NetSolve project , 1997 .
[4] James S. Plank,et al. Experimental assessment of workstation failures and their impact on checkpointing systems , 1998, Digest of Papers. Twenty-Eighth Annual International Symposium on Fault-Tolerant Computing (Cat. No.98CB36224).
[5] L. Alvisi,et al. A Survey of Rollback-Recovery Protocols , 2002 .
[6] Willy Zwaenepoel,et al. On the use and implementation of message logging , 1994, Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing.
[7] Georg Stellner,et al. CoCheck: checkpointing and process migration for MPI , 1996, Proceedings of International Conference on Parallel Processing.
[8] Miron Livny,et al. Managing Checkpoints for Parallel Programs , 1996, JSSPP.
[9] Richard Wolski,et al. The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..
[10] Yi-Min Wang,et al. Checkpointing and its applications , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.
[11] Kai Li,et al. CLIP: A Checkpointing Tool for Message Passing Parallel Programs , 1997, ACM/IEEE SC 1997 Conference (SC'97).
[12] D. Manivannan,et al. Quasi-Synchronous Checkpointing: Models, Characterization, and Classification , 1999, IEEE Trans. Parallel Distributed Syst..
[13] Kai Li,et al. Libckpt: Transparent Checkpointing under UNIX , 1995, USENIX.
[14] Henri Casanova,et al. Netsolve: a Network-Enabled Server for Solving Computational Science Problems , 1997, Int. J. High Perform. Comput. Appl..
[15] Micah Beck,et al. The Internet Backplane Protocol: Storage in the Network , 1999 .
[16] Jonathan Walpole,et al. MIST: PVM with Transparent Migration and Checkpointing , 1995 .
[17] Willy Zwaenepoel,et al. The performance of consistent checkpointing , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.
[18] Nitin H. Vaidya,et al. Impact of Checkpoint Latency on Overhead Ratio of a Checkpointing Scheme , 1997, IEEE Trans. Computers.
[19] Volker Strumpen,et al. Portable Checkpointing and Recovery in Heterogeneous Environments , 1996 .
[20] Willy Zwaenepoel,et al. Manetho: Transparent Rollback-Recovery with Low Overhead, Limited Rollback, and Fast Output Commit , 1992, IEEE Trans. Computers.