FDAN: Failure Detection Protocol for Mobile Ad Hoc Networks

This work deals with fault tolerance in distributed MANET (Mobile Ad hoc Networks) systems. However, the major issue for a failure detection protocol is to confound between a fault and a voluntary or an involuntary disconnection of nodes, and therefore to suspect correct nodes to be failing and conversely. Within this context, we propose in this paper a failure detection protocol that copes with MANET systems constraints. The aim of this work is to allow to the system to launch recovery process. For this effect, our protocol, called FDAN, is based on the class of heartbeat protocols. It takes into account: no preliminary knowledge of the network, the nodes disconnection and reconnection, resources limitation...Hence, we show that by using temporary lists and different timeout levels, we achieve to reduce sensibly the number of false suspicions.

[1]  Achour Mostéfaoui,et al.  A hybrid approach for building eventually accurate failure detectors , 2004, 10th IEEE Pacific Rim International Symposium on Dependable Computing, 2004. Proceedings..

[2]  Roy Friedman,et al.  Evaluating failure detection in mobile ad-hoc networks , 2009, Int. J. Pervasive Comput. Commun..

[3]  Marcos K. Aguilera,et al.  Stable Leader Election , 2001, DISC.

[4]  Pierre Sens,et al.  An Unreliable Failure Detector for Unknown and Mobile Networks , 2008, OPODIS.

[5]  Michel Raynal,et al.  The k-simultaneous consensus problem , 2010, Distributed Computing.

[6]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[7]  Sam Toueg,et al.  The weakest failure detector for solving consensus , 1996, JACM.

[8]  Mikel Larrea,et al.  Eventually consistent failure detectors , 2005, J. Parallel Distributed Comput..

[9]  Achour Mostéfaoui,et al.  Narrowing power vs efficiency in synchronous set agreement: Relationship, algorithms and lower bound , 2010, Theor. Comput. Sci..

[10]  Denis Conan,et al.  Détection de partition pour la gestion de groupes en environnement mobile , 2005, UbiMob '05.

[11]  Marcos K. Aguilera,et al.  Using the Heartbeat Failure Detector for Quiescent Reliable Communication and Consensus in Partitionable Networks , 1999, Theor. Comput. Sci..

[12]  Ozalp Babaoglu,et al.  Consistent global states of distributed systems: fundamental concepts and mechanisms , 1993 .

[13]  Danny Dolev,et al.  On the minimal synchronism needed for distributed consensus , 1983, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).

[14]  Pierre Sens,et al.  Failure, Disconnection and Partition Detection in Mobile Environment , 2008, 2008 Seventh IEEE International Symposium on Network Computing and Applications.

[15]  Michel Raynal,et al.  Group membership failure detection: a simple protocol and its probabilistic analysis , 1999, Distributed Syst. Eng..