Sensor Field Resource Management for Sensor Network Data Mining

This research is motivated by data mining for wireless sensor network applications. The authors consider applications where data is acquired in real-time, and thus data mining is performed on live streams of data rather than on stored databases. One challenge in supporting such applications is that sensor node power is a precious resource that needs to be managed as such. To conserve energy in the sensor field, the authors propose and evaluate several approaches to acquiring, and then caching data in a sensor field data server. The authors show that for true real-time applications, for which response time dictates data quality, policies that emulate cache hits by computing and returning approximate values for sensor data yield a simultaneous quality improvement and cost saving. This “win-win” is because when data acquisition response time is sufficiently important, the decrease in resource consumption and increase in data quality achieved by using approximate values outweighs the negative impact on data accuracy due to the approximation. In contrast, when data accuracy drives quality, a linear trade-off between resource consumption and data accuracy emerges. The authors then identify caching and lookup policies for which the sensor field query rate is bounded when servicing an arbitrary workload of user queries. This upper bound is achieved by having multiple user queries share the cost of a sensor field query. Finally, the authors discuss the challenges facing sensor network data mining applications in terms of data collection, warehousing, and mining techniques. DOI: 10.4018/978-1-60566-328-9.ch013

[1]  Le Gruenwald,et al.  Estimating Missing Values in Related Sensor Data Streams , 2005, COMAD.

[2]  Samuel H. Fuller,et al.  The C.mmp Multiprocessor , 1978 .

[3]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[4]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[5]  Dimitrios Katsaros Tree and Graph Mining , 2009, Encyclopedia of Data Warehousing and Mining.

[6]  Mohamed A. Sharaf,et al.  Balancing energy efficiency and quality of aggregate data in sensor networks , 2004, The VLDB Journal.

[7]  Sudipto Guha,et al.  Clustering Data Streams , 2000, FOCS.

[8]  Azzedine Boukerche,et al.  A Novel Data Mining Technique for Extracting Events and Inter Knowledge based Information from Wireless Sensor Networks , 2006, Proceedings. 2006 31st IEEE Conference on Local Computer Networks.

[9]  Nael B. Abu-Ghazaleh,et al.  Infrastructure tradeoffs for sensor networks , 2002, WSNA '02.

[10]  Erich M. Nahum,et al.  Data Quality and Query Cost in Pervasive Sensing Systems , 2008, 2008 Sixth Annual IEEE International Conference on Pervasive Computing and Communications (PerCom).

[11]  Chieh-Yih Wan,et al.  CODA: congestion detection and avoidance in sensor networks , 2003, SenSys '03.

[12]  Rajmohan Rajaraman,et al.  The Cougar Project: a work-in-progress report , 2003, SGMD.

[13]  Ari Luotonen,et al.  World-Wide Web Proxies , 1994, Comput. Networks ISDN Syst..

[14]  LastMark Online classification of nonstationary data streams , 2002 .

[15]  Chenyang Lu,et al.  RAP: a real-time communication architecture for large-scale wireless sensor networks , 2002, Proceedings. Eighth IEEE Real-Time and Embedded Technology and Applications Symposium.

[16]  Deborah Estrin,et al.  Directed diffusion for wireless sensor networking , 2003, TNET.

[17]  Wei Hong,et al.  TinyDB: an acquisitional query processing system for sensor networks , 2005, TODS.

[18]  Deborah Estrin,et al.  The Tenet architecture for tiered sensor networks , 2006, SenSys '06.

[19]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[20]  Ben Kao,et al.  Online Algorithms for Mining Inter-stream Associations from Large Sensor Networks , 2005, PAKDD.

[21]  Rajeev Motwani,et al.  Maintaining variance and k-medians over data stream windows , 2003, PODS.

[22]  Mani Srivastava,et al.  Energy-aware wireless microsensor networks , 2002, IEEE Signal Process. Mag..

[23]  Ramesh Govindan,et al.  Understanding packet delivery performance in dense wireless sensor networks , 2003, SenSys '03.

[24]  Richard Granger,et al.  Beyond Incremental Processing: Tracking Concept Drift , 1986, AAAI.

[25]  Philip S. Yu,et al.  On demand classification of data streams , 2004, KDD.

[26]  Shonali Krishnaswamy,et al.  Mining data streams: a review , 2005, SGMD.

[27]  Charu C. Aggarwal,et al.  A framework for diagnosing changes in evolving data streams , 2003, SIGMOD '03.

[28]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[29]  Mung Chiang,et al.  The value of clustering in distributed estimation for sensor networks , 2005, 2005 International Conference on Wireless Networks, Communications and Mobile Computing.

[30]  Srinivasan Seshan,et al.  IrisNet: An Architecture for a Worldwide Sensor Web , 2003, IEEE Pervasive Comput..

[31]  Philip Levis,et al.  The design and implementation of a declarative sensor network system , 2007, SenSys '07.

[32]  Rina Panigrahy,et al.  Better streaming algorithms for clustering problems , 2003, STOC '03.

[33]  Philip Calvert,et al.  Encyclopedia of Data Warehousing and Mining , 2006 .

[34]  Geoff Hulten,et al.  A General Method for Scaling Up Machine Learning Algorithms and its Application to Clustering , 2001, ICML.

[35]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[36]  Ramesh Govindan,et al.  Macro-programming Wireless Sensor Networks Using Kairos , 2005, DCOSS.

[37]  Ossama Younis,et al.  HEED: a hybrid, energy-efficient, distributed clustering approach for ad hoc sensor networks , 2004, IEEE Transactions on Mobile Computing.

[38]  Christopher M. Bishop,et al.  Neural networks for pattern recognition , 1995 .

[39]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[40]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[41]  Konstantinos Psounis,et al.  Modeling spatially correlated data in sensor networks , 2006, TOSN.

[42]  Archan Misra,et al.  CAPS: energy-efficient processing of continuous aggregate queries in sensor networks , 2006, Fourth Annual IEEE International Conference on Pervasive Computing and Communications (PERCOM'06).

[43]  Saurabh Ganeriwal,et al.  Aggregation in sensor networks: an energy-accuracy trade-off , 2003, Ad Hoc Networks.

[44]  M. Welsh,et al.  The Regiment Macroprogramming System , 2007, 2007 6th International Symposium on Information Processing in Sensor Networks.

[45]  John Scott Bridle,et al.  Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition , 1989, NATO Neurocomputing.

[46]  Herb Schwetman,et al.  Introduction to process-oriented simulation and CSIM , 1990, 1990 Winter Simulation Conference Proceedings.

[47]  Azzedine Boukerche,et al.  A New Representation Structure for Mining Association Rules from Wireless Sensor Networks , 2007, 2007 IEEE Wireless Communications and Networking Conference.

[48]  Mahadev Satyanarayanan,et al.  Scale and performance in a distributed file system , 1988, TOCS.

[49]  Christos Faloutsos,et al.  Adaptive, Hands-Off Stream Mining , 2003, VLDB.

[50]  Viktor K. Prasanna,et al.  Energy-latency tradeoffs for data gathering in wireless sensor networks , 2004, IEEE INFOCOM 2004.