The performance of consistent checkpointing
暂无分享,去创建一个
Willy Zwaenepoel | E. N. Elnozahy | David B. Johnson | David B. Johnson | E. Elnozahy | W. Zwaenepoel
[1] Brian Randell,et al. System structure for software fault tolerance , 1975, IEEE Transactions on Software Engineering.
[2] David L. Russell,et al. State Restoration in Systems of Communicating Processes , 1980, IEEE Transactions on Software Engineering.
[3] Richard D. Schlichting,et al. Fail-stop processors: an approach to designing fault-tolerant computing systems , 1983, TOCS.
[4] Yuval Tamir,et al. ERROR RECOVERY IN MULTICOMPUTERS USING GLOBAL CHECKPOINTS , 1984 .
[5] Augusto Ciuffoletti,et al. A Distributed Domino-Effect free recovery Algorithm , 1984, Symposium on Reliability in Distributed Software and Database Systems.
[6] Robert E. Strom,et al. Optimistic recovery in distributed systems , 1985, TOCS.
[7] Marvin Theimer,et al. Preemptable remote execution facilities for the V-system , 1985, SOSP '85.
[8] Leslie Lamport,et al. Distributed snapshots: determining global states of distributed systems , 1985, TOCS.
[9] Madalene Spezialetti,et al. Efficient Distributed Snapshots , 1986, ICDCS.
[10] Robert P. Fitzgerald,et al. The integration of virtual memory management and interprocess communication in Accent , 1986, TOCS.
[11] RICHARD KOO,et al. Checkpointing and Rollback-Recovery for Distributed Systems , 1986, IEEE Transactions on Software Engineering.
[12] Hon Fung Li,et al. Optimal Checkpointing and Local Recording for Domino-Free Rollback Recovery , 1987, Inf. Process. Lett..
[13] Ten-Hwang Lai,et al. On Distributed Snapshots , 1987, Inf. Process. Lett..
[14] David R. Cheriton,et al. The V distributed system , 1988, CACM.
[15] Bharat K. Bhargava,et al. Concurrent robust checkpointing and recovery in distributed systems , 1988, Proceedings. Fourth International Conference on Data Engineering.
[16] Bharat K. Bhargava,et al. Independent checkpointing and concurrent rollback for recovery in distributed systems-an optimistic approach , 1988, Proceedings [1988] Seventh Symposium on Reliable Distributed Systems.
[17] Wolfgang Graetsch,et al. Fault tolerance under UNIX , 1989, TOCS.
[18] Wei-Tek Tsai,et al. A low overhead checkpointing and rollback recovery scheme for distributed systems , 1989, Proceedings of the Eighth Symposium on Reliable Distributed Systems.
[19] Luke Lin,et al. Using checkpoints to localize the effects of faults in distributed systems , 1989, Proceedings of the Eighth Symposium on Reliable Distributed Systems.
[20] D. Morris,et al. A non-intrusive checkpointing protocol , 1989, Eighth Annual International Phoenix Conference on Computers and Communications. 1989 Conference Proceedings.
[21] Jeffrey F. Naughton,et al. Real-time, concurrent checkpoint for parallel programs , 1990, PPOPP '90.
[22] Kun-Lung Wu,et al. Recoverable Distributed Shared Virtual Memory , 1990, IEEE Trans. Computers.
[23] Bharat K. Bhargava,et al. Experimental evaluation of concurrent checkpointing and rollback-recovery algorithms , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.
[24] David B. Johnson,et al. Distributed system fault tolerance using message logging and checkpointing , 1990 .
[25] David B. Johnson,et al. Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing , 1988, J. Algorithms.
[26] Flaviu Cristian,et al. A timestamp-based checkpointing protocol for long-lived distributed computations , 1991, [1991] Proceedings Tenth Symposium on Reliable Distributed Systems.
[27] Jeffrey F. Naughton,et al. Checkpointing multicomputer applications , 1991, [1991] Proceedings Tenth Symposium on Reliable Distributed Systems.
[28] Henri E. Bal,et al. Transparent fault-tolerance in parallel Orca programs , 1992 .
[29] Mendel Rosenblum,et al. The design and implementation of a log-structured file system , 1991, SOSP '91.