Reliability modeling of SCI ring-based topologies

Reliability prediction is an important factor in the study of multiprocessor and cluster interconnects. One such interconnect is the scalable coherent interface (SCI), a point-to-point, ring-based interconnect that can be connected into switched ring topologies. However, the reliability of SCI-based topologies cannot be deduced from earlier network reliability work as link failures within an SCI-based interconnect are not independent of one another. This paper presents the results of a reliability study on 1D and 2D k-ary n-cube switching fabrics for SCI based on the elimination of rings rather than of links. The reliability models were created in UltraSAN and verified using analytical modeling. The results show that the reliability of a single-ring system can be greatly enhanced by the addition of a redundant ring yet the reliability of a torus does not increase significantly with additional redundant rings. Hence, the benefit of adding redundant rings is dependent both upon the topology and the degree of reliability sought.

[1]  Andrew L. Reibman,et al.  Reliability Models for Fault-Tolerant Private Network Applications , 1994, IEEE Trans. Computers.

[2]  William T. Yost Cost Effective Fault Tolerance for Network Routing , 1995 .

[3]  R. Ramaswami,et al.  Book Review: Design and Analysis of Fault-Tolerant Digital Systems , 1990 .

[4]  José A. B. Fortes,et al.  Detailed Modeling and Reliability Analysis of Fault-Tolerant Processor Arrays , 1992, IEEE Trans. Computers.

[5]  Xian Cheng,et al.  Reliability of a Class of Multistage Interconnection Networks , 1992, IEEE Trans. Parallel Distributed Syst..

[6]  Jean Arlat,et al.  SURF-2: A program for dependability evaluation of complex hardware and software systems , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[7]  Kishor S. Trivedi,et al.  Dependability Evaluation of a Class of Multi-Loop Topologies for Local Area Networks , 1989, IBM J. Res. Dev..

[8]  Charles J. Colbourn,et al.  Assessing Reliability of Multistage Interconnection Networks , 1993, IEEE Trans. Computers.

[9]  Dharma P. Agrawal,et al.  Cost-Performance Trade-Offs in Manhattan Street Network Versus 2-D Torus , 1990, IEEE Trans. Computers.

[10]  Christoph Lindemann,et al.  DSPNexpress: A Software Package for the Efficient Solution of Deterministic and Stochastic Petri Nets , 1993, Perform. Evaluation.

[11]  Kishor S. Trivedi,et al.  SPNP: stochastic Petri net package , 1989, Proceedings of the Third International Workshop on Petri Nets and Performance Models, PNPM89.

[12]  Barry W. Johnson Design & analysis of fault tolerant digital systems , 1988 .

[13]  Kishor S. Trivedi,et al.  Reliability analysis of interconnection networks using hierarchical composition , 1989 .

[14]  Cauligi S. Raghavendra,et al.  A Survey of Multi-Connected Loop Topologies for Local Computer Networks , 1986, Comput. Networks.

[15]  Kishor S. Trivedi,et al.  Analysis of Typical Fault-Tolerant Architectures using HARP , 1987, IEEE Transactions on Reliability.

[16]  M.A. Qureshi,et al.  The UltraSAN Modeling Environment , 1995, Perform. Evaluation.

[17]  Cauligi S. Raghavendra,et al.  Reliability analysis of redundant-path interconnection networks , 1989 .

[18]  Toby Berger,et al.  Reliability and availability analysis of Manhattan street networks , 1993, IEEE Trans. Commun..

[19]  Bernard L. Menezes,et al.  New Bounds on the Reliability of Augmented Shuffle-Exchange Networks , 1995, IEEE Trans. Computers.

[20]  Kishor S. Trivedi,et al.  Multistage Interconnection Network Reliability , 1989, IEEE Trans. Computers.

[21]  Alan D. George,et al.  Simulative performance analysis of distributed switching fabrics for SCI-based systems , 2000, Microprocess. Microsystems.