On page-based optimistic process checkpointing

Persistent object systems must provide some form of checkpointing to ensure that changes to persistent data are secured on non-volatile storage. When processes share or exchange modified data, mechanisms must be provided to ensure that they may be consistently checkpointed. This may be performed eagerly by synchronously checkpointing all dependent data. Alternatively, optimistic techniques may be used where processes are individually checkpointed and globally consistent states are found asynchronously. This paper examines two eager checkpointing techniques and describes a new optimistic technique. The technique is applicable in systems such as SASOS, where the notion of process and address space are decoupled.<<ETX>>

[1]  Raymond A. Lorie,et al.  Physical integrity in a large segmented database , 1977, TODS.

[2]  Kun-Lung Wu,et al.  Recoverable Distributed Shared Virtual Memory , 1990, IEEE Trans. Computers.

[3]  Rasool Jalili,et al.  Using directed graphs to describe entity dependency in stable distributed persistent stores , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[4]  Willy Zwaenepoel,et al.  The performance of consistent checkpointing , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[5]  John Rosenberg,et al.  The grand unified theory of address spaces , 1995, Proceedings 5th Workshop on Hot Topics in Operating Systems (HotOS-V).

[6]  Alan Dearle,et al.  Casper: A Cached Architecture Supporting Persistence , 1992, Comput. Syst..

[7]  Frans Henskens,et al.  A Capability-based Distributed Shared Memory , 1991 .

[8]  David B. Johnson,et al.  Recovery in Distributed Systems Using Optimistic Message Logging and Checkpointing , 1988, J. Algorithms.

[9]  Frans Henskens Addressing moved modules in a capability-based distributed shared memory , 1992, Proceedings of the Twenty-Fifth Hawaii International Conference on System Sciences.

[10]  Michael F. Challis Database Consistency and Integrity in a Multi-User Environment , 1978, JCDKB.

[11]  Colin J. Fidge,et al.  Timestamps in Message-Passing Systems That Preserve the Partial Ordering , 1988 .

[12]  Frans Henskens,et al.  A model for user-level memory management in a distributed, persistent environment , 1994 .

[13]  David B. Johnson,et al.  Efficient transparent optimistic rollback recovery for distributed application programs , 1993, Proceedings of 1993 IEEE 12th Symposium on Reliable Distributed Systems.

[14]  John Rosenberg,et al.  MONADS-PC - a capability-based workstation to support software engineering , 1985 .

[15]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[16]  John Rosenberg,et al.  Grasshopper: An Orthogonally Persistent Operating System , 1994, Comput. Syst..