Comparison of Aperiodic Checkpoint Placement Algorithms

In this article we summarize some aperiodic checkpoint placement algorithms for a software system over a finite operation-time horizon, and compare them in terms of computational accuracy. The underlying problem is formulated as a maximization of steady-state system availability and is to determine the optimal aperiodic checkpoint sequence under the assumption that the system failure occurs in accordance with a non-exponential distribution. In a numerical example with Weibull system-failure time distribution, it is shown that the approximate solutions tend to underestimate the resulting system availability.

[1]  Tadashi Dohi,et al.  APERIODIC OPTIMAL CHECKPOINT SEQUENCE UNDER STEADY-STATE SYSTEM AVAILABILITY CRITERION , 2006 .

[2]  Tadashi Dohi,et al.  A DP-BASED CHECKPOINTING SCHEME IN REAL-TIME APPLICATIONS , 2006 .

[3]  Kenichiro Naruse,et al.  Optimal Checkpoint Intervals for Computer Systems , 2009 .

[4]  Tadashi Dohi,et al.  Distribution-free checkpoint placement algorithms based on min-max principle , 2006, IEEE Transactions on Dependable and Secure Computing.

[5]  Kenichiro Naruse,et al.  Optimal Checking Times of Backup Operation for a Database System , 2007 .

[6]  Toshio Nakagawa,et al.  OPTIMAL SEQUENTIAL CHECKPOINT INTERVALS FOR ERROR DETECTION , 2009 .

[7]  Toshio Nakagawa,et al.  Stochastic Reliability Modeling, Optimization and Applications , 2010 .

[8]  Tadashi Dohi,et al.  Recent Advances in Stochastic Operations Research , 2007 .

[9]  Özalp Babaoglu,et al.  On the Optimum Checkpoint Selection Problem , 1984, SIAM J. Comput..

[10]  Satoshi Fukumoto,et al.  Optimal checkpointing policies using the checkpointing density , 1992 .

[11]  Tadashi Dohi,et al.  Numerical computation algorithms for sequential checkpoint placement , 2009, Perform. Evaluation.

[12]  Michael R. Lyu Software Fault Tolerance , 1995 .

[13]  Shunji Osaki,et al.  A note on optimum checkpointing policies , 1985 .

[14]  Xiaola Lin,et al.  A Variational Calculus Approach to Optimal Checkpoint Placement , 2001, IEEE Trans. Computers.

[15]  Marvin Zelen,et al.  Mathematical Theory of Reliability , 1965 .