Data Mining for Diagnostic Debugging in Sensor Networks: Preliminary Evidence and Lessons Learned

Sensor networks and pervasive computing systems intimately combine computation, communication and interactions with the physical world, thus increasing the complexity of the development effort, violating communication protocol layering, and making traditional network diagnostics and debugging less effective at catching problems. Tighter coupling between communication, computation, and interaction with the physical world is likely to be an increasing trend in emerging edge networks and pervasive systems. This paper reviews recent tools developed by the authors to understand the root causes of complex interaction bugs in edge network systems that combine computation, communication and sensing. We concern ourselves with automated failure diagnosis in the face of non-reproducible behavior, high interactive complexity, and resource constraints. Several examples are given to finding bugs in real sensor network code using the tools developed, demonstrating the efficacy of the approach.

[1]  Tarek F. Abdelzaher,et al.  The LiteOS Operating System: Towards Unix-Like Abstractions for Wireless Sensor Networks , 2008, 2008 International Conference on Information Processing in Sensor Networks (ipsn 2008).

[2]  Heikki Mannila,et al.  Principles of Data Mining , 2001, Undergraduate Topics in Computer Science.

[3]  Mohammed J. Zaki,et al.  SPADE: An Efficient Algorithm for Mining Frequent Sequences , 2004, Machine Learning.

[4]  Jian Pei,et al.  Mining frequent patterns without candidate generation , 2000, SIGMOD 2000.

[5]  Vipin Kumar,et al.  Introduction to Data Mining, (First Edition) , 2005 .

[6]  Hiroki Arimura,et al.  LCM ver. 2: Efficient Mining Algorithms for Frequent/Closed/Maximal Itemsets , 2004, FIMI.

[7]  Mohammed J. Zaki,et al.  CHARM: An Efficient Algorithm for Closed Itemset Mining , 2002, SDM.

[8]  Pavel A. Pevzner,et al.  Computational molecular biology : an algorithmic approach , 2000 .

[9]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[10]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[11]  Tarek F. Abdelzaher,et al.  2008 International Conference on Information Processing in Sensor Networks A Practical Multi-Channel Media Access Control Protocol for Wireless Sensor Networks ∗ , 2022 .

[12]  Tarek F. Abdelzaher,et al.  Towards Diagnostic Simulation in Sensor Networks , 2008, DCOSS.

[13]  Jiawei Han,et al.  Dustminer: troubleshooting interactive complexity bugs in sensor networks , 2008, SenSys '08.

[14]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[15]  Deborah Estrin,et al.  Directed diffusion for wireless sensor networking , 2003, TNET.

[16]  Xifeng Yan,et al.  CloSpan: Mining Closed Sequential Patterns in Large Datasets , 2003, SDM.

[17]  Johannes Gehrke,et al.  Mining Very Large Databases , 1999, Computer.

[18]  Tarek F. Abdelzaher,et al.  SNTS: Sensor Network Troubleshooting Suite , 2007, DCOSS.

[19]  Ian Witten,et al.  Data Mining , 2000 .

[20]  Theodore Johnson,et al.  Exploratory Data Mining and Data Cleaning , 2003 .

[21]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[22]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[23]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[24]  Ramakrishnan Srikant,et al.  Mining Sequential Patterns: Generalizations and Performance Improvements , 1996, EDBT.

[25]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[26]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[27]  Zhaohui Tang,et al.  Data Mining with SQL Server 2005 , 2005 .

[28]  Gösta Grahne,et al.  Efficiently Using Prefix-trees in Mining Frequent Itemsets , 2003, FIMI.

[29]  Sholom M. Weiss,et al.  Predictive data mining - a practical guide , 1997 .

[30]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.