Efficient symmetric comparison-based self-diagnosis using backpropagation artificial neural networks

One of the main challenging problems in distributed systems is the comparison-based self-diagnosis which aims at identifying the set faulty of nodes (or units) based on the matching and mismatching among the system's nodes. The comparison-based self-diagnosis approach assigns tasks to the nodes that need to be diagnosed. The outcomes from each pair of units performing the same task are compared, and based on such comparison their fault status is identified. In this work, we consider symmetric t-comparison-based diagnosable systems in which at most t nodes can fail permanently at the same time. We introduce a novel backpropagation neural network-based (BPNN) approach to implement a new fault identification algorithm. The BPNN diagnosis algorithm uses the comparison approach to collect the agreements and disagreements among the nodes, and then uses these comparison outcomes to identify which nodes are faulty and which ones are fault-free. The BPNN-based diagnosis algorithm needs first to undergo an extensive training phase using various faulty situations. Results from a thorough simulation study demonstrate the effectiveness of the BPNN-based self diagnosis algorithm for randomly generated diagnosable systems, making it a viable addition to existing diagnosis algorithms. The novel approach is shown to scale very well for large systems.

[1]  Azzedine Boukerche,et al.  Diagnosing mobile ad-hoc networks: two distributed comparison-based self-diagnosis protocols , 2006, MobiWac '06.

[2]  Abhijit Sengupta,et al.  On Self-Diagnosable Multiprocessor Systems: Diagnosis by the Comparison Approach , 1992, IEEE Trans. Computers.

[3]  Alex M. Andrew,et al.  Fault Diagnosis: Models, Artificial Intelligence, Applications , 2005 .

[4]  B. Yegnanarayana,et al.  Artificial Neural Networks , 2004 .

[5]  Zhi-Hua Zhou,et al.  Using neural networks for fault diagnosis , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[6]  Kang G. Shin,et al.  Probabilistic diagnosis of multiprocessor systems , 1994, CSUR.

[7]  Kyung-Yong Chwa,et al.  Schemes for Fault-Tolerant Computing: A Comparison of Modularly Redundant and t-Diagnosable Systems , 1981, Inf. Control..

[8]  Mourad ELHADEF,et al.  Efficient comparison-based fault diagnosis of multiprocessor systems using an evolutionary approach , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[9]  Azzedine Boukerche,et al.  A distributed fault identification protocol for wireless and mobile ad hoc networks , 2008, J. Parallel Distributed Comput..

[10]  Miroslaw Malek,et al.  A comparison connection assignment for diagnosis of multiprocessor systems , 1980, ISCA '80.

[11]  Andrzej Pelc,et al.  Complexity of Fault Diagnosis in Comparison Models , 1992, IEEE Trans. Computers.

[12]  Amiya Nayak,et al.  System-Level Fault Diagnosis Using Comparison Models: An Artificial-Immune-Systems-Based Approach , 2006, J. Networks.

[13]  Carl E. Landwehr,et al.  Basic concepts and taxonomy of dependable and secure computing , 2004, IEEE Transactions on Dependable and Secure Computing.

[14]  GERNOT METZE,et al.  On the Connection Assignment Problem of Diagnosable Systems , 1967, IEEE Trans. Electron. Comput..

[15]  Douglas M. Blough,et al.  The Broadcast Comparison Model for On-Line Fault Diagnosis in Multicomputer Systems , 1999, IEEE Trans. Computers.

[16]  Andrzej Pelc,et al.  Undirected Graph Models for System-Level Fault Diagnosis , 1991, IEEE Trans. Computers.

[17]  Yuan Yan Tang,et al.  Efficient Fault Identification of Diagnosable Systems under the Comparison Model , 2007, IEEE Transactions on Computers.

[18]  Mourad Elhadef A Perceptron Neural Network for Asymmetric Comparison-Based System-Level Fault Diagnosis , 2009, 2009 International Conference on Availability, Reliability and Security.

[19]  Miroslaw Malek,et al.  The consensus problem in fault-tolerant computing , 1993, CSUR.

[20]  Stefano Chessa,et al.  Comparison-based system-level fault diagnosis in ad hoc networks , 2001, Proceedings 20th IEEE Symposium on Reliable Distributed Systems.