Hierarchical distributed data classification in wireless sensor networks

Wireless sensor networks promise an unprecedented opportunity to monitor physical environments via inexpensive wireless embedded devices. Given the sheer amount of sensed data, efficient classification of them becomes a critical task in many sensor network applications. The large scale and the stringent energy constraints of such networks however challenge the conventional classification techniques that demand enormous storage space and centralized computation. In this paper, we propose a novel decision-tree-based hierarchical distributed classification approach, in which local classifiers are built by individual sensors and merged along the routing path forming a spanning tree. The classifiers are iteratively enhanced by combining strategically generated pseudo data and new local data, eventually converging to a global classifier for the whole network. We also introduce some control factors to facilitate the effectiveness of our approach. Through extensive simulations, we study the impact of the introduced control factors, and demonstrate that our approach maintains high classification accuracy with very low storage and communication overhead. The approach also addresses a critical issue of heterogeneous data distribution among the sensors.

[1]  Parameswaran Ramanathan,et al.  Distributed target classification and tracking in sensor networks , 2003 .

[2]  Bruce H. Krogh,et al.  Lightweight detection and classification for wireless sensor networks in realistic environments , 2005, SenSys '05.

[3]  Tolga Coplu,et al.  SENDROM: Sensor networks for disaster relief operations management , 2007, Wirel. Networks.

[4]  Anantha P. Chandrakasan,et al.  An application-specific protocol architecture for wireless microsensor networks , 2002, IEEE Trans. Wirel. Commun..

[5]  Matt Welsh,et al.  Deploying a wireless sensor network on an active volcano , 2006, IEEE Internet Computing.

[6]  Zoran Obradovic,et al.  The distributed boosting algorithm , 2001, KDD '01.

[7]  Madjid Merabti,et al.  A Node Recovery Scheme for Data Dissemination in Wireless Sensor Networks , 2007, 2007 IEEE International Conference on Communications.

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Vinayak S. Naik,et al.  A line in the sand: a wireless sensor network for target detection, classification, and tracking , 2004, Comput. Networks.

[10]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[11]  Ian F. Akyildiz,et al.  Sensor Networks , 2002, Encyclopedia of GIS.

[12]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[13]  Hui Xiong,et al.  Distributed classification in peer-to-peer networks , 2007, KDD '07.

[14]  John Anderson,et al.  Wireless sensor networks for habitat monitoring , 2002, WSNA '02.

[15]  Krishna M. Sivalingam,et al.  Data Gathering Algorithms in Sensor Networks Using Energy Metrics , 2002, IEEE Trans. Parallel Distributed Syst..

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[18]  Yoav Freund,et al.  Boosting a weak learning algorithm by majority , 1995, COLT '90.

[19]  Ying Zhang,et al.  Combs, needles, haystacks: balancing push and pull for discovery in large-scale sensor networks , 2004, SenSys '04.

[20]  Jian Pei,et al.  Hierarchical distributed data classification inwireless sensor networks , 2009, 2009 IEEE 6th International Conference on Mobile Adhoc and Sensor Systems.

[21]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[22]  Nitesh V. Chawla,et al.  Learning Ensembles from Bites: A Scalable and Accurate Approach , 2004, J. Mach. Learn. Res..

[23]  Biswanath Mukherjee,et al.  Wireless sensor network survey , 2008, Comput. Networks.

[24]  Vipin Kumar,et al.  Introduction to Data Mining, (First Edition) , 2005 .

[25]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[26]  Satish Kumar,et al.  Next century challenges: scalable coordination in sensor networks , 1999, MobiCom.