Non-Markovian Modeling of a BladeCenter Chassis Midplane

In distributed contexts such as Cloud computing, the reliability and availability of the provided resources and services have to be assured in order to meet user requirements. At the infrastructure level, this specification is translated into tighter ones on the datacenter hosting physical resources. In this paper, starting from a real case study of the IBM BladeCenter, we provide a technique for the quantitative evaluation of datacenter infrastructure availability. The proposed technique allows one to take into account both aging phenomena and multiple operating conditions. In particular, one subsystem of the BladeCenter, the chassis midplane, is studied. Indeed, based on the stochastic characterization of the midplane reliability through statistic measurements, a model dealing with the non-exponential failure time distribution thus obtained is evaluated to demonstrate the suitability and the effectiveness of the proposed technique.

[1]  Kishor S. Trivedi,et al.  Markov Dependability Models of Complex Systems: Analysis Techniques , 1996 .

[2]  Antonio Puliafito,et al.  Reliability assessment of wireless sensor nodes with non-linear battery discharge , 2010, 2010 IFIP Wireless Days.

[3]  Kishor S. Trivedi,et al.  Availability analysis of blade server systems , 2008, IBM Syst. J..

[4]  Juan Eloy Ruiz-Castro,et al.  Two models for a repairable two-system with phase-type sojourn time distributions , 2004, Reliab. Eng. Syst. Saf..

[5]  Süleyman Özekici,et al.  Reliability and Maintenance of Complex Systems , 2010, NATO ASI Series.

[6]  Francesco Longo,et al.  Two-layer symbolic representation for stochastic models with phase-type distributed events , 2015, Int. J. Syst. Sci..

[7]  Francesco Longo,et al.  Applying Symbolic Techniques to the Representation of Non-Markovian Models with Continuous PH Distributions , 2009, EPEW.

[8]  Kishor S. Trivedi,et al.  Application of semi-Markov process and CTMC to evaluation of UPS system availability , 2002, Annual Reliability and Maintainability Symposium. 2002 Proceedings (Cat. No.02CH37318).

[9]  Andrea Bondavalli,et al.  Markov Regenerative Stochastic Petri Nets to Model and Evaluate Phased Mission Systems Dependability , 2001, IEEE Trans. Computers.

[10]  Tadashi Dohi,et al.  Component Importance Analysis of Virtualized System , 2012, 2012 9th International Conference on Ubiquitous Intelligence and Computing and 9th International Conference on Autonomic and Trusted Computing.

[11]  Kishor S. Trivedi,et al.  Scalable Analytics for IaaS Cloud Availability , 2014, IEEE Transactions on Cloud Computing.

[12]  Antonio Puliafito,et al.  Evaluating wireless sensor node longevity through Markovian techniques , 2012, Comput. Networks.