Availability Analysis of Application Servers Using Software Rejuvenation and Virtualization

Demands on software reliability and availability have increased tremendously due to the nature of present day applications. We focus on the aspect of software for the high availability of application servers since the unavailability of servers more often originates from software faults rather than hardware faults. The software rejuvenation technique has been widely used to avoid the occurrence of unplanned failures, mainly due to the phenomena of software aging or caused by transient failures. In this paper, first we present a new way of using the virtual machine based software rejuvenation named VMSR to offer high availability for application server systems. Second we model a single physical server which is used to host multiple virtual machines (VMs) with the VMSR framework using stochastic modeling and evaluate it through both numerical analysis and SHARPE (Symbolic Hierarchical Automated Reliability and Performance Evaluator) tool simulation. This VMSR model is very general and can capture application server characteristics, failure behavior, and performability measures. Our results demonstrate that VMSR approach is a practical way to ensure uninterrupted availability and to optimize performance for aging applications.

[1]  Jordi Torres,et al.  Using Virtualization to Improve Software Rejuvenation , 2009, IEEE Trans. Computers.

[2]  Kishor S. Trivedi,et al.  A measurement-based model for estimation of resource exhaustion in operational software systems , 1999, Proceedings 10th International Symposium on Software Reliability Engineering (Cat. No.PR00443).

[3]  Kishor S. Trivedi,et al.  A methodology for detection and estimation of software aging , 1998, Proceedings Ninth International Symposium on Software Reliability Engineering (Cat. No.98TB100257).

[4]  Shigeru Chiba,et al.  A Fast Rejuvenation Technique for Server Consolidation with Virtual Machines , 2007, 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07).

[5]  Kishor S. Trivedi,et al.  Proactive management of software aging , 2001, IBM J. Res. Dev..

[6]  Jordi Torres,et al.  High-available grid services through the use of virtualized clustering , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[7]  Kishor S. Trivedi,et al.  Analysis of Software Aging in a Web Server , 2006, IEEE Transactions on Reliability.

[8]  Mark Sullivan,et al.  Software defects and their impact on system availability-a study of field failures in operating systems , 1991, [1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium.

[9]  Orran Krieger,et al.  Virtualization for high-performance computing , 2006, OPSR.

[10]  Robert S. Hanmer,et al.  Software rejuvenation , 2010, PLOP '10.

[11]  Kishor S. Trivedi,et al.  Analysis of Preventive Maintenance in Transactions Based Software Systems , 1998, IEEE Trans. Computers.

[12]  T. Chiueh,et al.  A Survey on Virtualization Technologies , 2005 .

[13]  Kishor S. Trivedi,et al.  Reliability and Performability Modeling Using SHARPE 2000 , 2000, Computer Performance Evaluation / TOOLS.

[14]  Kishor S. Trivedi SHARPE 2002: Symbolic Hierarchical Automated Reliability and Performance Evaluator , 2002, Proceedings International Conference on Dependable Systems and Networks.

[15]  Daniel P. Siewiorek,et al.  High-availability computer systems , 1991, Computer.

[16]  Katerina Goseva-Popstojanova,et al.  Modeling and analysis of software aging and rejuvenation , 2000, Proceedings 33rd Annual Simulation Symposium (SS 2000).

[17]  Yennun Huang,et al.  Software rejuvenation: analysis, module and applications , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[18]  Christian Engelmann,et al.  Proactive fault tolerance for HPC with Xen virtualization , 2007, ICS '07.

[19]  Matthew Disney,et al.  Scalable Deployment and Configuration of High-Performance Virtual Clusters , 2008 .