A framework for cluster availability specification and evaluation

Cluster Computing is becoming cost effective and popular for its enormous computational power. High availability features need to be included to ensure that cluster computing environments can provide continuous services. A typical availability modeling method is based on the analytical formalisms such as fault tree, Markov chains, Stochastic Petri Net (SPN), etc. However, people in system design and development may not be familiar with the analytical modeling techniques. This inevitably creates a gap between system designers and reliability engineers. Moreover, the analytical models are still primitive. As a consequence, Markov chain and Petri Net models are often large when the modeled systems are complicated. These large models may be out of the intuitive of modelers, lose the view of the system, and be error prone. We propose a framework that models cluster computing systems' availability based on UML design notations, and evaluates system availability by transforming the UML availability model into corresponding analytical models. The UML-based availability modeling framework is to bridge the gap between the two communities. With our approach, the availability analysis of cluster computing systems can be done at the design stage with ease.

[1]  강문설 [서평]「The Unified Modeling Language User Guide」 , 1999 .

[2]  Betty H. C. Cheng,et al.  A general framework for formalizing UML with formal languages , 2001, Proceedings of the 23rd International Conference on Software Engineering. ICSE 2001.

[3]  Stefan Müller,et al.  Diagram Interchange for UML , 2002, UML.

[4]  T. Y. WilliamJ,et al.  Numerical Methods in Markov Chain Modeling , 1992, Operational Research.

[5]  Yousef Saad,et al.  Numerical Methods in Markov Chain Modelling , 1996 .

[6]  Joanne Bechta Dugan,et al.  Automatic synthesis of dynamic fault trees from UML system models , 2002, 13th International Symposium on Software Reliability Engineering, 2002. Proceedings..

[7]  Boudewijn R. Haverkort,et al.  Specification techniques for Markov reward models , 1993, Discret. Event Dyn. Syst..

[8]  Tong Liu,et al.  Availability prediction and modeling of high mobility OSCAR cluster , 2003, 2003 Proceedings IEEE International Conference on Cluster Computing.

[9]  Kishor S. Trivedi,et al.  Specification and generation of Markov reward models , 1992 .

[10]  Kishor S. Trivedi,et al.  Reliability analysis of interconnection networks using hierarchical composition , 1989 .

[11]  Hany H. Ammar,et al.  Architectural-Level Risk Analysis Using UML , 2003, IEEE Trans. Software Eng..

[12]  Andrea Bondavalli,et al.  Stochastic Dependability Analysis of System Architecture Based on UML Models , 2002, WADS.

[13]  Raja Nassar,et al.  A light-weight solution for large sparse Markov processes , 2005, ACM-SE 43.