Robustness benchmarking for hardware maintenance events

This paper proposes a method to measure a specific aspect of a system's robustness - the handling of maintenance events. Past research in the area of robustness benchmarking has focused exclusively on injecting a realistic faultload into a system and examining the system's robustness against the set of injected faults. However, faults are not the only events that can cause a system outage. Maintenance events, such as the replacement of a failed hardware component or the installation of a software patch, may also result in a system outage. In this paper, we describe a benchmark for measuring a system's robustness against a class of hardware maintenance events.

[1]  Philip Koopman,et al.  Dependability Benchmarking: making choices in an n-dimensional problem space , 2001 .

[2]  Daniel P. Siewiorek,et al.  Measuring Software Dependability by Robustness Benchmarking , 1997, IEEE Trans. Software Eng..

[3]  Joe Marshall,et al.  Measuring robustness of a fault tolerant aerospace system , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[4]  Jean Arlat,et al.  A Framework for Dependability Benchmarking , 2002 .

[5]  David A. Patterson,et al.  Towards Availability Benchmarks: A Case Study of Software RAID Systems , 2000, USENIX Annual Technical Conference, General Track.

[6]  Brian Randell,et al.  Fundamental Concepts of Dependability , 2000 .

[7]  Barton P. Miller,et al.  An empirical study of the reliability of UNIX utilities , 1990, Commun. ACM.

[8]  Ravishankar K. Iyer,et al.  An approach towards benchmarking of fault-tolerant commercial systems , 1996, Proceedings of Annual Symposium on Fault Tolerant Computing.

[9]  Daniel P. Siewiorek,et al.  Development of a benchmark to measure system robustness , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[10]  Daniel P. Siewiorek,et al.  Comparing operating systems using robustness benchmarks , 1997, Proceedings of SRDS'97: 16th IEEE Symposium on Reliable Distributed Systems.

[11]  Ji Zhu,et al.  R-cubed (R 3 ): rate, robustness, and recovery - an availability benchmark framework , 2002 .