Common Software-Aging-Related Faults in Fault-Tolerant Systems

In recent years, remarkable attention has been focused on software aging phenomena, in which the performance of software systems degrades with time. Fault-tolerant software systems which provide high assurance may suffer from such phenomena. Based on the common software-aging-related faults in fault-tolerant systems, a behavior model of a double-version fault-tolerant software system is established using Markov reward model. The performance of the system such as expected service rate in steady state is evaluated and the sensitivity analysis of some parameters is performed.

[1]  Nancy G. Leveson,et al.  Analysis of Faults in an N-Version Software Experiment , 1990, IEEE Trans. Software Eng..

[2]  Luís Moura Silva,et al.  Software Aging and Rejuvenation in a SOAP-based Server , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[3]  Wei Xie,et al.  Performability analysis of clustered systems with rejuvenation under varying workload , 2007, Perform. Evaluation.

[4]  Tadashi Dohi,et al.  Dependability analysis of a client/server software system with rejuvenation , 2002, 13th International Symposium on Software Reliability Engineering, 2002. Proceedings..

[5]  Kishor S. Trivedi,et al.  Proactive management of software systems: analysis and implementation , 2002 .

[6]  Sungsoo Kim,et al.  Availability analysis and improvement of Active/Standby cluster systems using software rejuvenation , 2002, J. Syst. Softw..

[7]  Kishor S. Trivedi,et al.  Adaptive software rejuvenation: degradation model and rejuvenation scheme , 2003, 2003 International Conference on Dependable Systems and Networks, 2003. Proceedings..

[8]  Tadashi Dohi,et al.  Statistical non-parametric algorithms to estimate the optimal software rejuvenation schedule , 2000, Proceedings. 2000 Pacific Rim International Symposium on Dependable Computing.

[9]  Matteo Sereno,et al.  Fine Grained Software Degradation Models for Optimal Rejuvenation Policies , 2001, Perform. Evaluation.

[10]  Kishor S. Trivedi,et al.  An approach for estimation of software aging in a Web server , 2002, Proceedings International Symposium on Empirical Software Engineering.

[11]  Kishor S. Trivedi,et al.  A workload-based analysis of software aging, and rejuvenation , 2005, IEEE Transactions on Reliability.

[12]  Long Zhao,et al.  Availability and Cost Analysis of a Fault-Tolerant Software System with Rejuvenation , 2008, 2008 International Conference on Advanced Computer Theory and Engineering.

[13]  Hoang Pham,et al.  Optimal design of fault-tolerant distributed systems based on a recursive algorithm , 1991 .

[14]  Kishor S. Trivedi,et al.  A methodology for detection and estimation of software aging , 1998, Proceedings Ninth International Symposium on Software Reliability Engineering (Cat. No.98TB100257).

[15]  Kishor S. Trivedi,et al.  Modeling High Availability , 2006, 2006 12th Pacific Rim International Symposium on Dependable Computing (PRDC'06).

[16]  Wei Xie,et al.  Software rejuvenation policies for cluster systems under varying workload , 2004, 10th IEEE Pacific Rim International Symposium on Dependable Computing, 2004. Proceedings..

[17]  Algirdas Avizienis,et al.  Software Fault Tolerance , 1989, IFIP Congress.

[18]  Yennun Huang,et al.  Software rejuvenation: analysis, module and applications , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[19]  Laura L. Pullum,et al.  Software Fault Tolerance Techniques and Implementation , 2001 .

[20]  Kishor S. Trivedi,et al.  Analysis and implementation of software rejuvenation in cluster systems , 2001, SIGMETRICS '01.

[21]  Agapios N. Platis,et al.  VoIP Availability and Service Reliability through Software Rejuvenation Policies , 2007, 2nd International Conference on Dependability of Computer Systems (DepCoS-RELCOMEX '07).

[22]  Kishor S. Trivedi,et al.  Analysis of software rejuvenation using Markov Regenerative Stochastic Petri Net , 1995, Proceedings of Sixth International Symposium on Software Reliability Engineering. ISSRE'95.

[23]  Tadashi Dohi,et al.  Behavioral analysis of a fault-tolerant software system with rejuvenation , 2005, Proceedings Autonomous Decentralized Systems, 2005. ISADS 2005..

[24]  Kishor S. Trivedi,et al.  A measurement-based model for estimation of resource exhaustion in operational software systems , 1999, Proceedings 10th International Symposium on Software Reliability Engineering (Cat. No.PR00443).

[25]  Jong Sou Park,et al.  A rejuvenation methodology of cluster recovery , 2005, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005..

[26]  David Lorge Parnas,et al.  Software aging , 1994, Proceedings of 16th International Conference on Software Engineering.