Quantitative RAS Comparison of Sun CMT/Solaris and X86/Linux Servers

By incorporating the Chip Multi-Threading (CMT) and operating system predictive self-healing technologies, the Sun CMT/Solaris based servers are not only cost/performance effective, but also more robust in reliability, availability, and serviceability (RAS) than the X86/Linux servers with similar performance. The differentiators include higher levels of hardware integration, more fault tolerance provisions in processors, the Solaris memory page retirement (MPR), and the Solaris/SPARC processor offlining (PO) capabilities of the CMT/Solaris server. This study applies analytical models, with parameters calibrated by field experience, to quantitatively compare system RAS, against hardware faults, between the CMT/Solaris and X86/Linux servers. The results show significant RAS benefits of the CMT, MPR, and PO technologies.

[1]  Dong Tang,et al.  Assessment of the Effect of Memory Page Retirement on System RAS Against Hardware Faults , 2006, International Conference on Dependable Systems and Networks (DSN'06).

[2]  Thomas A. Corbi,et al.  The dawning of the autonomic computing era , 2003, IBM Syst. J..

[3]  Michael W. Shapiro Self-Healing in Modern Operating Systems , 2004, ACM Queue.

[4]  Dong Tang,et al.  Automatic generation of availability models in RAScad , 2002, Proceedings International Conference on Dependable Systems and Networks.