Closed-form solution for reliability of SCI-based multiprocessor systems using Weibull distribution and self-healing rings

Abstract This paper introduces a new closed-form solution for the reliability of large-scale multiprocessor systems. The systems are based on SCI rings interconnected in hierarchical structures. Reliability expressions using enumeration technique are derived assuming Weibull failure process. The reliability function derived in this paper is general and valid for any hierarchical ring-based system with arbitrary number of levels. The hierarchical interconnections are constructed from self-healing rings and basic rings. The analysis shows the improvement achieved in reliability when self-healing rings are used. Although we used hierarchical systems based on SCI rings, the technique followed in this work is applied for any type of rings such as slotted or token rings.

[1]  Jon M. Peha,et al.  Analyzing the fault tolerance of double-loop networks , 1994, TNET.

[2]  Michael Stumm,et al.  Performance Evaluation of Hierarchical Ring-Based Shared Memory Multiprocessors , 1994, IEEE Trans. Computers.

[3]  James K. Archibald,et al.  The two-processor reliability of hierarchical large-scale ring-based networks , 1996, Proceedings of HICSS-29: 29th Hawaii International Conference on System Sciences.

[4]  John D. Spragins Token ring reliability models , 1992, [Proceedings] IEEE INFOCOM '92: The Conference on Computer Communications.

[5]  O.W.W. Yang Terminal-pair reliability of three-type computer communication networks , 1992 .

[6]  Mirko Vujošević,et al.  Reliability analyses for a tree-structured hierarchic control system , 1992 .

[7]  Cauligi S. Raghavendra,et al.  A Survey of Multi-Connected Loop Topologies for Local Computer Networks , 1986, Comput. Networks.

[8]  Kishor S. Trivedi,et al.  Reliability analysis of the double counter-rotating ring with concentrator attachments , 1994, TNET.

[9]  James R. Goodman,et al.  The Impact of Pipelined Channels on k-ary n-Cube Networks , 1994, IEEE Trans. Parallel Distributed Syst..

[10]  Amy P. Felty,et al.  Cache Coherency in SCI: Specification and a Sketch of Correctness , 1999, Formal Aspects of Computing.

[11]  Jiahnsheng Yin,et al.  K-terminal reliability in ring networks , 1994 .

[12]  Oliver C. Ibe,et al.  Reliability comparison of token-ring network schemes , 1992 .

[13]  Alan D. George,et al.  Simulative performance analysis of distributed switching fabrics for SCI-based systems , 2000, Microprocess. Microsystems.