Diagnosis and Communication in Distributed Systems

This paper discusses diagnosis problems in distributed systems within the context of a language- theoretic discrete event formalism. A distributed system is seen as a system with multiple spatially separated sites with each site having a diagnoser that observes some of the events generated by the system and diagnoses the faults associated with the site. We allow the diagnosers to share information by sending messages to each other. Distributed systems are classified as being centrally, decentrally, and independently diagnosable. We characterize the class of distributed systems for which there exists a centralized diagnoser but there exists no inter-diagnoser messaging scheme that can replicate the information available to a centralized diagnoser. Plant properties that are necessary and sufficient for the three kinds of diagnosability are derived. The formulation and results are motivated by a discussion on the diagnosis of failures in a wireless LAN used to support the real-time operation of automated vehicles.

[1]  Jana Kosecka,et al.  Control of Discrete Event Systems , 1992 .

[2]  Bret Foreman A SURVEY OF WIRELESS COMMUNICATIONS TECHNOLOGIES FOR AUTOMATED VEHICLE CONTROL , 1995 .

[3]  J. Hedrick,et al.  String stability of interconnected systems , 1996, IEEE Trans. Autom. Control..

[4]  J. Hedrick,et al.  String stability of interconnected systems , 1995, Proceedings of 1995 American Control Conference - ACC'95.

[5]  D. N. Pandalai,et al.  Template languages for fault monitoring of single-instance and multiple-instance discrete event processes , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[6]  A. E. Lindsey,et al.  Design, verification and failure diagnosis of wireless communication protocols for the AHS , 1997, Proceedings of Conference on Intelligent Transportation Systems.

[7]  A. Benveniste,et al.  A Petri net approach to fault detection and diagnosis in distributed systems. I. Application to telecommunication networks, motivations, and modelling , 1997, Proceedings of the 36th IEEE Conference on Decision and Control.

[8]  Stéphane Lafortune,et al.  Failure diagnosis using discrete event models , 1994, Proceedings of 1994 33rd IEEE Conference on Decision and Control.

[9]  Raja Sengupta,et al.  Diagnosability of discrete-event systems , 1995, IEEE Trans. Autom. Control..

[10]  JardClaude,et al.  Fault Detection and Diagnosis in Distributed Systems , 1998 .

[11]  E. S. Pearson,et al.  On the Problem of the Most Efficient Tests of Statistical Hypotheses , 1933 .

[12]  Datta N. Godbole,et al.  Tools for the design of fault management systems [automated highway systems] , 1997, Proceedings of Conference on Intelligent Transportation Systems.

[13]  Rajesh Rajamani,et al.  A COMPLETE FAULT DIAGNOSTIC SYSTEM FOR LONGITUDINAL CONTROL OF AUTOMATED VEHICLES , 1997 .