Troubleshooting multihop wireless networks

Effective network troubleshooting is critical for maintaining efficient and reliable network operation. Troubleshooting is especially challenging in multihop wireless networks because the behavior of such networks depends on complicated interactions between many unpredictable factors such as RF noise, signal propagation, node interference, and traffic flows. In this paper we propose a new direction for research on fault diagnosis in wireless networks. Specifically, we present a diagnostic system that employs trace-driven simulations to detect faults and perform root cause analysis. We apply this approach to diagnose performance problems caused by packet dropping, link congestion, external noise, and MAC misbehavior. In a 25 node multihop wireless network, we are able to diagnose over 10 simultaneous faults of multiple types with more than 80% coverage. Our framework is general enough for a wide variety of wireless and wired networks.

[1]  Niki Pissinou,et al.  Mobile Agents to Automate Fault Management in Wireless and Mobile Networks , 2000, IPDPS Workshops.

[2]  Nitin H. Vaidya,et al.  Detection and handling of MAC layer misbehavior in wireless networks , 2003, 2003 International Conference on Dependable Systems and Networks, 2003. Proceedings..

[3]  Martin Heusse,et al.  Performance anomaly of 802.11b , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[4]  Paramvir Bahl,et al.  Fault Detection, Isolation, and Diagnosis in Multihop Wireless Networks , 2004 .

[5]  Wenli Chen,et al.  ANMP: ad hoc network management protocol , 1999, IEEE J. Sel. Areas Commun..

[6]  Jean-Yves Le Boudec,et al.  Nodes bearing grudges: towards routing security, fairness, and robustness in mobile ad hoc networks , 2002, Proceedings 10th Euromicro Workshop on Parallel, Distributed and Network-based Processing.

[7]  Robert Tappan Morris,et al.  Link-level measurements from an 802.11b mesh network , 2004, SIGCOMM '04.

[8]  Baruch Awerbuch,et al.  Provably Secure Competitive Routing against Proactive Byzantine Adversaries via Reinforcement Learning , 2003 .

[9]  David D. Clark,et al.  A knowledge plane for the internet , 2003, SIGCOMM '03.

[10]  Donald F. Towsley,et al.  On integrating fluid models with packet simulation , 2004, IEEE INFOCOM 2004.

[11]  Jean-Yves Le Boudec,et al.  The Effect of Rumor Spreading in Reputation Systems for Mobile Ad-hoc Networks , 2003 .

[12]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[13]  Calvin Newport,et al.  The mistaken axioms of wireless-network research , 2003 .

[14]  Jennifer C. Hou,et al.  A fast simulation framework for IEEE 802.11-operated wireless LANs , 2004, SIGMETRICS '04/Performance '04.

[15]  Robert Tappan Morris,et al.  a high-throughput path metric for multi-hop wireless routing , 2003, MobiCom '03.

[16]  Yih-Chun Hu,et al.  Wormhole Detection in Wireless Ad Hoc Networks , 2002 .

[17]  Jeffrey D. Case,et al.  Simple Network Management Protocol (SNMP) , 1989, RFC.

[18]  Yechiam Yemini,et al.  Towards self-configuring networks , 2002, Proceedings DARPA Active Networks Conference and Exposition.

[19]  Jitendra Padhye,et al.  Routing in multi-radio, multi-hop wireless mesh networks , 2004, MobiCom '04.

[20]  Eugene C. Freuder,et al.  Generating Diagnositc Tools for Network Fault Management , 1997, Integrated Network Management.

[21]  Voon Chin Phua,et al.  Wireless lan medium access control (mac) and physical layer (phy) specifications , 1999 .

[22]  Paramvir Bahl,et al.  Architecture and techniques for diagnosing faults in IEEE 802.11 infrastructure networks , 2004, MobiCom '04.

[23]  Bhaskaran Raman,et al.  Turning 802.11 inside-out , 2004, Comput. Commun. Rev..

[24]  Daniel R. Simon,et al.  Secure traceroute to detect faulty or malicious routing , 2003, CCRV.

[25]  Mukesh Singhal,et al.  Low-Cost Checkpointing and Failure Recovery in Mobile Computing Systems , 1996, IEEE Trans. Parallel Distributed Syst..

[26]  A. M. Abdullah,et al.  Wireless lan medium access control (mac) and physical layer (phy) specifications , 1997 .

[27]  David A. Maltz,et al.  DSR: the dynamic source routing protocol for multihop wireless ad hoc networks , 2001 .

[28]  Mary Baker,et al.  Mitigating routing misbehavior in mobile ad hoc networks , 2000, MobiCom '00.

[29]  Chien-Chung Shen,et al.  The Guerrilla management architecture for ad hoc networks , 2002, MILCOM 2002. Proceedings.