A Recovery Model for Survivable Distributed Systems through the Use of Virtualization

The interest in system survivability under a variety of attacks, faults or accidents has been on the rise in recent years and survivability is receiving increasing attention as a key property of the mission critical system. Distributed systems are vital for all and it is a critical task to make such systems secure and survive. Survivability in the presence of attacks or failures depends on three key system capabilities: resistance, recognition, and recovery. Virtualization at levels (system, storage, and network) became important as a way to improve the system security, reliability, availability and survivability, reduce costs, and provide greater flexibility. In this paper, we focus attention on continued service and recovery issue. We introduce a survivability framework for distributed systems through the use of virtualization technology and software rejuvenation methodology. We present a recovery model and evaluate the steady-state system availability and survivability based on the familiar Markovian analysis through SHARPE tools.

[1]  Kishor S. Trivedi SHARPE 2002: Symbolic Hierarchical Automated Reliability and Performance Evaluator , 2002, Proceedings International Conference on Dependable Systems and Networks.

[2]  Kevin J. Sullivan,et al.  Towards a rigorous definition of information system survivability , 2003, Proceedings DARPA Information Survivability Conference and Exposition.

[3]  T. Chiueh,et al.  A Survey on Virtualization Technologies , 2005 .

[4]  Jordi Torres,et al.  Using Virtualization to Improve Software Rejuvenation , 2007, IEEE Transactions on Computers.

[5]  Nancy R. Mead,et al.  Requirements definition for survivable network systems , 1998, Proceedings of IEEE International Symposium on Requirements Engineering: RE '98.

[6]  Jordi Torres,et al.  High-available grid services through the use of virtualized clustering , 2007, 2007 8th IEEE/ACM International Conference on Grid Computing.

[7]  Kishor S. Trivedi,et al.  Reliability and Performability Modeling Using SHARPE 2000 , 2000, Computer Performance Evaluation / TOOLS.

[8]  Somesh Jha,et al.  Survivability analysis of networked systems , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[9]  John C. Knight,et al.  TOWARDS A DEFINITION OF SURVIVABILITY , 2000 .