Distributed Multi-Level Recovery in Main-Memory Databases

The authors present two schemes for concurrency control and recovery in distributed main-memory databases. In the client-server scheme, clients ship log records to the server, which applies the updates to its database copy. In the shared disk scheme, each site broadcasts its updates to other sites. The above enable the schemes to support concurrent updates to the same page at different sites. Both schemes support an explicit multi-level recovery abstraction for high concurrency, reduced disk I/O by writing only redo log records to disk during normal processing, and use of per-transaction redo and undo logs to reduce contention. Further, they use a fuzzy checkpointing scheme that writes only dirty pages to disk, yet minimally interferes with normal processing, not requiring updaters to even acquire a latch before updating a page.

[1]  Hector Garcia-Molina,et al.  System M: A Transaction Processing Testbed for Memory Resident Data , 1990, IEEE Trans. Knowl. Data Eng..

[2]  S. Sudarshan,et al.  Dalí: A High Performance Main Memory Storage Manager , 1994, VLDB.

[3]  Erhard Rahm,et al.  Recovery concepts for data sharing systems , 1991, [1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium.

[4]  C. Mohan,et al.  ARIES/CSA: a method for database recovery in client-server architectures , 1994, SIGMOD '94.

[5]  S. Sudarshan,et al.  Recovering from Main-Memory Lapses , 1993, VLDB.