Towards Development of Risk-based Checkpointing Scheme Via Parametric Bootstrapping

Optimal checkpoint placement is a commonly used technique to generate the optimal checkpoint time sequence minimizing the system cost with the recovery overhead from a system failure and the checkpoint overhead caused by check pointing itself. When the system failure occurs by an exponential distribution, especially, the corresponding optimal checkpoint interval is periodic and can be characterized by the unique solution of a non-linear equation. In this paper, we focus on statistical estimation problems and consider the probability distributions of estimators of system failure rate, optimal checkpoint interval and its associated minimum expected system cost, when the system failure time data is given in advance. Based on the parametric bootstrapping, we develop a novel approach to estimate the higher moments and confidence intervals on the above estimators. This enables us to develop a risk-based optimal checkpoint placement policy caused by the estimation error.

[1]  K. Mani Chandy,et al.  Analytic models for rollback and recovery strategies in data base systems , 1975, IEEE Transactions on Software Engineering.

[2]  Tadashi Dohi,et al.  Distribution-free checkpoint placement algorithms based on min-max principle , 2006, IEEE Transactions on Dependable and Secure Computing.

[3]  B. Efron Bootstrap confidence intervals for a class of parametric problems , 1985 .

[4]  Tadashi Dohi,et al.  Bayesian perspective of optimal checkpoint placement , 2005, Ninth IEEE International Symposium on High-Assurance Systems Engineering (HASE'05).

[5]  Victor F. Nicola,et al.  Checkpointing and the modeling of program execution time , 1994 .

[6]  Xiaola Lin,et al.  A Variational Calculus Approach to Optimal Checkpoint Placement , 2001, IEEE Trans. Computers.

[7]  Andrzej Duda,et al.  The Effects of Checkpointing on Program Execution Time , 1983, Inf. Process. Lett..

[8]  Jacques Malenfant,et al.  Computing Optimal Checkpointing Strategies for Rollback and Recovery Systems , 1988, IEEE Trans. Computers.

[9]  Tadashi Dohi,et al.  A DP-BASED CHECKPOINTING SCHEME IN REAL-TIME APPLICATIONS , 2006 .

[10]  Bradley Efron,et al.  Censored Data and the Bootstrap , 1981 .

[11]  John W. Young,et al.  A first order approximation to the optimum checkpoint interval , 1974, CACM.

[12]  Özalp Babaoglu,et al.  On the Optimum Checkpoint Selection Problem , 1984, SIAM J. Comput..

[13]  K. Mani Chandy,et al.  A Survey of Analytic Models of Rollback and Recovery Stratergies , 1975, Computer.

[14]  Tadashi Dohi,et al.  Numerical computation algorithms for sequential checkpoint placement , 2009, Perform. Evaluation.

[15]  G. V. Kulkarni,et al.  Effects of Checkpointing and Queueing on Program Performance , 1987 .