Leader based adaptive fault diagnosis algorithm for distributed systems

This paper proposes a new algorithm named Leader Based Adaptive Fault Diagnosis (L-AFD) algorithm for distributed systems. This algorithm detects all faulty nodes in the network where, the network is not fully connected. This algorithm works for the arbitrary network. The t-diagnosibility of a system under consideration is (n-1) where n is total number of nodes or computer systems in the network. This algorithm supports new node entry in the network. It allows re-entry of the repaired faulty nodes during the next diagnostic cycle. This algorithm can also work with more than one leader. This algorithm executes periodically on each node.

[1]  Amiya Nayak,et al.  Comparison-Based System-Level Fault Diagnosis: A Neural Network Approach , 2012, IEEE Transactions on Parallel and Distributed Systems.

[2]  Elias Procópio Duarte,et al.  Distributed Diagnosis of Dynamic Events in Partitionable Arbitrary Topology Networks , 2012, IEEE Transactions on Parallel and Distributed Systems.

[3]  Douglas M. Blough,et al.  The Broadcast Comparison Model for On-Line Fault Diagnosis in Multicomputer Systems , 1999, IEEE Trans. Computers.

[4]  Elias Procópio Duarte,et al.  A Nearly Optimal Comparison-Based Diagnosis Algorithm for Systems of Arbitrary Topology , 2016, IEEE Transactions on Parallel and Distributed Systems.

[5]  S. Louis Hakimi,et al.  An optimal algorithm for distributed system level diagnosis , 1991, [1991] Digest of Papers. Fault-Tolerant Computing: The Twenty-First International Symposium.

[6]  Takashi Nanya,et al.  A Hierarachical Adaptive Distributed System-Level Diagnosis Algorithm , 1998, IEEE Trans. Computers.

[7]  Jürgen Schönwälder,et al.  DisCaRia—Distributed Case-Based Reasoning System for Fault Management , 2015, IEEE Transactions on Network and Service Management.

[8]  Sampath Rangarajan,et al.  A Distributed System-Level Diagnosis Algorithm for Arbitrary Network Topologies , 1995, IEEE Trans. Computers.

[9]  Jiannong Cao,et al.  A Simple Broadcast Algorithm for Recurrent Dynamic Systems , 2014, 2014 IEEE 28th International Conference on Advanced Information Networking and Applications.