A Fault Tolerance Management Framework for Wireless Sensor Networks

Wireless Sensor Networks (WSN) have the potential of significantly enhancing our ability to monitor and interact with our physical environment. Realizing a fault-tolerant operation is critical to the success of WSN. The main challenge is providing fault-tolerance (FT) while conserving the limited resources of the network. Our main contribution in this paper is to propose a general framework for fault-tolerance in WSN. The proposed framework can be used to guide the design and development of FT solutions and to evaluate existing ones. We present a comparative study of the existing schemes and identify potential enhancements. A primary module of the framework is the learning and refinement module which enables a FT solution to be adaptive and self-configurable based on changes in the network conditions. We view this as vital to the resource-constrained and highly dynamic WSN. Up to our knowledge, we are the first to propose the implementation of such module in FT solutions for WSN.

[1]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[2]  Mohamed F. Younis,et al.  On handling QoS traffic in wireless sensor networks , 2004, 37th Annual Hawaii International Conference on System Sciences, 2004. Proceedings of the.

[3]  Roy Friedman,et al.  Evaluating distributed checkpointing protocols , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[4]  Mohamed F. Younis,et al.  Safe base-station repositioning in wireless sensor networks , 2006, 2006 IEEE International Performance Computing and Communications Conference.

[5]  Iman Saleh,et al.  In-network fault tolerance in networked sensor systems , 2006, DIWANS '06.

[6]  William H. Sanders,et al.  The Mobius modeling tool , 2001, Proceedings 9th International Workshop on Petri Nets and Performance Models.

[7]  Nitin H. Vaidya,et al.  On Checkpoint Latency , 1995 .

[8]  James S. Plank,et al.  An Overview of Checkpointing in Uniprocessor and DistributedSystems, Focusing on Implementation and Performance , 1997 .

[9]  C. R. Lin On-demand QoS routing in multihop mobile networks , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[10]  Miodrag Potkonjak,et al.  Fault Tolerance in Wireless Ad-Hoc Sensor Networks , 2007 .

[11]  M. Potkonjak,et al.  Fault tolerance techniques for wireless ad hoc sensor networks , 2002, Proceedings of IEEE Sensors.

[12]  Klara Nahrstedt,et al.  Distributed quality-of-service routing in ad hoc networks , 1999, IEEE J. Sel. Areas Commun..

[13]  Chenxi Zhu,et al.  QoS routing for mobile ad hoc networks , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[14]  Chun-Di Mu,et al.  An efficient algorithm for fault tolerance in multisensor networks , 2004, Proceedings of 2004 International Conference on Machine Learning and Cybernetics (IEEE Cat. No.04EX826).

[15]  D. N. Jayasimha Fault tolerance in multisensor networks , 1996, IEEE Trans. Reliab..

[16]  Rami G. Melhem,et al.  RideSharing: Fault Tolerant Aggregation in Sensor Networks Using Corrective Actions , 2006, 2006 3rd Annual IEEE Communications Society on Sensor and Ad Hoc Communications and Networks.

[17]  S. Sitharama Iyengar,et al.  Functional characterization of fault tolerant integration in distributed sensor networks , 1991, IEEE Trans. Syst. Man Cybern..

[18]  Keith Marzullo,et al.  Tolerating failures of continuous-valued sensors , 1990, TOCS.

[19]  Deborah Estrin,et al.  Data-Centric Storage in Sensornets with GHT, a Geographic Hash Table , 2003, Mob. Networks Appl..