Fault Diagnosis and Automatic Reconfiguration for a Ring Subsystem

Abstract A ring subsystem has been developed for a full-scale high performance heterogeneous computer network. The design goals and the implementation issues of the automatic reconfiguration of the ring subsystem are presented. Particular emphasis is laid upon reliability improvement techniques, based on duplication of the ring subsystem components, and the automatic reconfiguration algorithm, which determines network configuration when faults occur.

[1]  Katsuo Ikeda,et al.  GAMMA - NET: Computer network coupled by 100 MBPS Optical Fiber Ring Bus - system planning and Ring Bus subsystem description , 1980, Operating Systems Engineering.

[2]  B. K. Penney,et al.  Survey of computer communications loop networks: Part 1 , 1979, Comput. Commun..

[3]  W. Kropfl An experimental data block switching system , 1972 .

[4]  Katsuo Ikeda,et al.  Network management in a local computer network , 1985, Softw. Pract. Exp..

[5]  Joel R. Sklaroff,et al.  Redundancy Management Technique for Space Shuttle Computers , 1976, IBM J. Res. Dev..

[6]  A. Avizienis,et al.  Fault-tolerance: The survival attribute of digital systems , 1978, Proceedings of the IEEE.

[7]  J. Goldberg,et al.  SIFT: Design and analysis of a fault-tolerant computer for aircraft control , 1978, Proceedings of the IEEE.

[8]  Katsuo Ikeda,et al.  GAMMA-NET: A Local Computer Network Coupled by a High Speed Optical Fiber Ring Bus -System Concept and Structure , 1983, Comput. Networks.

[9]  Yan Hong Ng The distributed computer system , 1983 .

[10]  Andrew S. Tanenbaum,et al.  Computer Networks , 1981 .

[11]  Lewis M. Branscomb,et al.  Computer Communications in the Eighties - Time to put It All Together , 1981, Comput. Networks.

[12]  Ralph E. Kuehn Computer Redundancy: Design, Performance, and Future , 1969 .

[13]  Walter H. Kohler,et al.  A Survey of Techniques for Synchronization and Recovery in Decentralized Computer Systems , 1981, CSUR.

[14]  Maurice V. Wilkes,et al.  The Cambridge Model Distributed System , 1980, OPSR.

[15]  W. D. Farmer,et al.  An experimental distributed switching system to handle bursty computer traffic , 1969, Symposium on Problems in the Optimization of Data Communications Systems.

[16]  Pitro Alois Zafiropulo Performance Evaluation of Reliability Improvement Techniques for Single-Loop Communications Systems , 1974, IEEE Trans. Commun..

[17]  Lorenzo Strigini,et al.  Reconfiguration procedure in a distributed multiprocessor system , 1982 .

[18]  Webb T. Comfort A Fault-Tolerant System Architecture for Navy Applications , 1983, IBM J. Res. Dev..

[19]  J. R. Pierce,et al.  Network for block switching of data , 1972 .

[20]  Ming T. Liu,et al.  A loop network for simultaneous transmission of variable-length messages , 1974, ISCA '75.

[21]  Jerome H. Saltzer,et al.  A Star-Shaped Ring Network with High Maintainability , 1980, Comput. Networks.

[22]  David E. Morgan,et al.  A computer network monitoring system , 1975, IEEE Transactions on Software Engineering.