Distributed Diagnosis of Networked, Embedded Systems

Abstract : Networked embedded systems are composed of a large number of physically distributed nodes that interact with the physical world via a set of sensors and actuators, have their own computational capabilities, and communicate with each other via a wired or wireless network. Monitoring and diagnosis for such systems must address several challenges caused by the distribution of resources, communication limitations, and node and link failures. This paper presents a distributed diagnosis framework that exploits the topology of a physical system to be diagnosed to limit inter-diagnoser communication and compute diagnoses in an anytime and any information manner, making it robust to communication and processor failures. The framework adopts the consistency-based diagnosis formalism and develops a distributed constraint satisfaction realization of the diagnosis algorithm. Each local diagnoser first computes locally consistent diagnoses, taking into account local sensing information only. The local diagnosis sets are reduced to globally consistent diagnoses through pairwise communications between local diagnosers. The algorithm has been successfully demonstrated for the diagnosis of paper path faults for the Xeros DC265 printer.

[1]  Shmuel Katz,et al.  Self-Stabilizing Distributed Constraint Satisfaction , 1999, Chic. J. Theor. Comput. Sci..

[2]  D. B. Davis,et al.  Sun Microsystems Inc. , 1993 .

[3]  Stéphane Lafortune,et al.  Coordinated Decentralized Protocols for Failure Diagnosis of Discrete Event Systems , 2000, Discret. Event Dyn. Syst..

[4]  Yves Deville,et al.  A Distributed Arc-Consistency Algorithm , 1995, Sci. Comput. Program..

[5]  James Kurien,et al.  Continuous Measurements and Quantitative Constraints: Challenge Problems for Discrete Modeling Techniques , 2001 .

[6]  W. M. Wonham,et al.  Distributed diagnosis for qualitative systems , 2002, Sixth International Workshop on Discrete Event Systems, 2002. Proceedings..

[7]  Makoto Yokoo,et al.  Algorithms for Distributed Constraint Satisfaction: A Review , 2000, Autonomous Agents and Multi-Agent Systems.

[8]  Feng Zhao,et al.  Distributed Monitoring of Hybrid Systems: A model-directed approach , 2001, IJCAI.

[9]  Feng Zhao,et al.  Scalable Information-Driven Sensor Querying and Routing for Ad Hoc Heterogeneous Sensor Networks , 2002, Int. J. High Perform. Comput. Appl..

[10]  W. B. Jackson,et al.  Constrained optimization based control of real time large-scale systems: airjet object movement system , 2001, Proceedings of the 40th IEEE Conference on Decision and Control (Cat. No.01CH37228).

[11]  Raymond Reiter,et al.  A Theory of Diagnosis from First Principles , 1986, Artif. Intell..

[12]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[13]  Gianfranco Lamperti,et al.  Diagnosis of Large Active Systems , 1999, Artif. Intell..

[14]  Makoto Yokoo,et al.  Distributed constraint satisfaction for formalizing distributed problem solving , 1992, [1992] Proceedings of the 12th International Conference on Distributed Computing Systems.

[15]  Alan K. Mackworth,et al.  Parallel and Distributed Finite Constraint Satisfaction: Complexity, Algorithms and Experiments , 1992 .

[16]  P. Pandurang Nayak,et al.  A Model-Based Approach to Reactive Self-Configuring Systems , 1996, AAAI/IAAI, Vol. 2.

[17]  Makoto Yokoo,et al.  The Distributed Constraint Satisfaction Problem: Formalization and Algorithms , 1998, IEEE Trans. Knowl. Data Eng..

[18]  Nicola Muscettola,et al.  Design of the Remote Agent experiment for spacecraft autonomy , 1998, 1998 IEEE Aerospace Conference Proceedings (Cat. No.98TH8339).

[19]  Brian C. Williams,et al.  Diagnosing Multiple Faults , 1987, Artif. Intell..

[20]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[21]  Luca Console,et al.  Readings in Model-Based Diagnosis , 1992 .

[22]  Brian C. Williams,et al.  Diagnosis with Behavioral Modes , 1989, IJCAI.

[23]  Laura A. Sanchis,et al.  Multiple-Way Network Partitioning , 1989, IEEE Trans. Computers.

[24]  Pieter J. Mosterman,et al.  Monitoring, Prediction, and Fault Isolation in Dynamic Physical Systems , 1997, AAAI/IAAI.

[25]  Peter Van Roy On the separation of concerns in distributed programming: application to distribution structure and fault tolerance in Mozart , 1999 .

[26]  Sheng Ma,et al.  Efficient fault diagnosis using probing , 2002 .