Knowledge discovery for behavioral patterns in wireless sensor networks

The research consolidated in this thesis is motivated by the recent evolvement of wireless technologies and microelectronic devices, which instigated the emergence of Wireless Sensor Networks (WSNs). Many WSN-based applications have come about; these applications are related, but not limited, to the fields of military, environment and health care. Research in WSNs is still in its early stages, and efforts have been put forward to design fast, reliable, and fault-tolerant protocols that guarantee acceptable levels of quality for events delivery, to meet the limited capabilities of sensor nodes and the effects of unreliable wireless communication. In this thesis, we focus on the design of a Knowledge-based framework for extracting behavioral patterns regarding sensor nodes from WSNs. Three types of behavioral patterns are introduced: Sensor Association Rules, Coverage-based Rules and Sensor Chronological Patterns. The proper steps in the Knowledge Discovery process that pertain to the extraction of the behavioral patterns are defined. These steps are: (i) a formal definition of the required ‘knowledge’; (ii) the data preparation stage that covers the communication aspects of the process of preparing data that is needed to extract these patterns; (iii) the data mining techniques that are essential for extracting the required patterns. A set of schemes have been proposed to attain these steps, and meet the critical properties of WSNs. In contrast to other techniques, the proposed behavioral patterns are mainly about the sensor nodes, instead of the area under monitoring. The direct application of the proposed patterns is enhancing the performance of WSNs by participating in the resource management process of sensor nodes, and reducing the undesired cons of wireless communication; thus improving the Quality of Service of WSNs. Several experiments have been conducted, using synthetic and real data, to report about the performance of proposed schemes.

[1]  Philip S. Yu,et al.  An effective hash-based algorithm for mining association rules , 1995, SIGMOD '95.

[2]  Gösta Grahne,et al.  Efficiently Using Prefix-trees in Mining Frequent Itemsets , 2003, FIMI.

[3]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[4]  Liang Ding,et al.  Agent Collaborative Target Localization and Classification in Wireless Sensor Networks , 2007, Sensors (Basel, Switzerland).

[5]  Kay Römer,et al.  An Adaptive Strategy for Quality-Based Data Reduction in Wireless Sensor Networks , 2006 .

[6]  N. Noury,et al.  Monitoring behavior in home using a smart fall sensor and position sensors , 2000, 1st Annual International IEEE-EMBS Special Topic Conference on Microtechnologies in Medicine and Biology. Proceedings (Cat. No.00EX451).

[7]  Kay Römer,et al.  Distributed Mining of Spatio-Temporal Event Patterns in Sensor Networks , 2007 .

[8]  Yu-Chee Tseng,et al.  The Coverage Problem in a Wireless Sensor Network , 2003, WSNA '03.

[9]  Nathalie Japkowicz,et al.  The Class Imbalance Problem: Significance and Strategies , 2000 .

[10]  Bin Shen,et al.  Ontology-based Association Rules Retrieval using Protege Tools , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[11]  B. R. Badrinath,et al.  Energy map construction for wireless sensor network under a finite energy budget , 2004, MSWiM '04.

[12]  Wei Hong,et al.  Proceedings of the 5th Symposium on Operating Systems Design and Implementation Tag: a Tiny Aggregation Service for Ad-hoc Sensor Networks , 2022 .

[13]  Kay Römer,et al.  Middleware challenges for wireless sensor networks , 2002, MOCO.

[14]  Samuel Madden,et al.  PAQ: Time Series Forecasting for Approximate Query Answering in Sensor Networks , 2006, EWSN.

[15]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[16]  David B. Skillicorn,et al.  A Distributed Approach for Prediction in Sensor Networks , 2005 .

[17]  Shiow-Fen Hwang,et al.  A Cluster-Based Coverage-Preserved Node Scheduling Scheme in Wireless Sensor Networks , 2006, 2006 Third Annual International Conference on Mobile and Ubiquitous Systems: Networking & Services.

[18]  Carson Kai-Sang Leung,et al.  Distributed Mining of Constrained Patterns from Wireless Sensor Data , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[19]  Kay Römer,et al.  The design space of wireless sensor networks , 2004, IEEE Wireless Communications.

[20]  Luca Lombardi,et al.  Challenges for Data Mining in Distributed Sensor Networks , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[21]  Jian Pei,et al.  Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach , 2006, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06).

[22]  Chee-Yee Chong,et al.  Sensor networks: evolution, opportunities, and challenges , 2003, Proc. IEEE.

[23]  Mohamed F. Younis,et al.  A survey on routing protocols for wireless sensor networks , 2005, Ad Hoc Networks.

[24]  Jennifer Widom,et al.  Models and issues in data stream systems , 2002, PODS.

[25]  Wendi B. Heinzelman,et al.  Adaptive protocols for information dissemination in wireless sensor networks , 1999, MobiCom.

[26]  Lindsay I. Smith,et al.  A tutorial on Principal Components Analysis , 2002 .

[27]  Devavrat Shah,et al.  Turbo-charging vertical mining of large databases , 2000, SIGMOD '00.

[28]  John Anderson,et al.  Wireless sensor networks for habitat monitoring , 2002, WSNA '02.

[29]  Azzedine Boukerche,et al.  A low latency and energy aware event ordering algorithm for wireless actor and sensor networks , 2005, MSWiM '05.

[30]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[31]  Gianluca Bontempi,et al.  An adaptive modular approach to the mining of sensor network data , 2005 .

[32]  Hannu Toivonen,et al.  Data Mining In Bioinformatics , 2005 .

[33]  Peter Desnoyers,et al.  PRESTO: A Predictive Storage Architecture for Sensor Networks , 2005, HotOS.

[34]  Hendrik Blockeel,et al.  Web mining research: a survey , 2000, SKDD.

[35]  Mauro Birattari,et al.  Lazy Learning Meets the Recursive Least Squares Algorithm , 1998, NIPS.

[36]  Bernard Widrow,et al.  Least-mean-square adaptive filters , 2003 .

[37]  Edward Y. Chang,et al.  Adaptive stream resource management using Kalman Filters , 2004, SIGMOD '04.

[38]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery: An Overview , 1996, Advances in Knowledge Discovery and Data Mining.

[39]  Yi Hu,et al.  A data mining approach for database intrusion detection , 2004, SAC '04.

[40]  Leonidas J. Guibas,et al.  Wireless sensor networks - an information processing approach , 2004, The Morgan Kaufmann series in networking.

[41]  Panganamala Ramana Kumar Ad Hoc Wireless Networks: Analysis, Protocols, Architecture, and Convergence , 2001, Infrastructure for Mobile and Wireless Systems.

[42]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[43]  Nael B. Abu-Ghazaleh,et al.  A taxonomy of wireless micro-sensor network models , 2002, MOCO.

[44]  Rajeev Motwani,et al.  Dynamic itemset counting and implication rules for market basket data , 1997, SIGMOD '97.

[45]  Anne-Marie Kermarrec,et al.  The many faces of publish/subscribe , 2003, CSUR.

[46]  Wendi Heinzelman,et al.  Energy-efficient communication protocol for wireless microsensor networks , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[47]  Mohammed J. Zaki Scalable Algorithms for Association Mining , 2000, IEEE Trans. Knowl. Data Eng..

[48]  Silvia Santini,et al.  Adaptive model selection for time series prediction in wireless sensor networks , 2007, Signal Process..

[49]  Hongjun Lu,et al.  H-mine: hyper-structure mining of frequent patterns in large databases , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[50]  Philip S. Yu,et al.  ASAP: An Adaptive Sampling Approach to Data Collection in Sensor Networks , 2007, IEEE Transactions on Parallel and Distributed Systems.

[51]  Cauligi S. Raghavendra,et al.  PEGASIS: Power-efficient gathering in sensor information systems , 2002, Proceedings, IEEE Aerospace Conference.

[52]  Jianyong Wang,et al.  Mining sequential patterns by pattern-growth: the PrefixSpan approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[53]  Feng Zhao,et al.  Scalable Information-Driven Sensor Querying and Routing for Ad Hoc Heterogeneous Sensor Networks , 2002, Int. J. High Perform. Comput. Appl..

[54]  Osmar R. Zaïane,et al.  Non-recursive Generation of Frequent K-itemsets from Frequent Pattern Tree Representations , 2003, DaWaK.

[55]  Guang-Zhong Yang,et al.  Pervasive body sensor network: an approach to monitoring the post-operative surgical patient , 2006, International Workshop on Wearable and Implantable Body Sensor Networks (BSN'06).

[56]  Lukasz Golab,et al.  Issues in data stream management , 2003, SGMD.

[57]  Charu C. Aggarwal,et al.  A Tree Projection Algorithm for Generation of Frequent Item Sets , 2001, J. Parallel Distributed Comput..

[58]  Sharad Mehrotra,et al.  Capturing sensor-generated time series with quality guarantees , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[59]  Ian F. Akyildiz,et al.  Wireless sensor and actor networks: research challenges , 2004, Ad Hoc Networks.

[60]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[61]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[62]  Fabrizio Silvestri,et al.  Adaptive and resource-aware mining of frequent sets , 2002, 2002 IEEE International Conference on Data Mining, 2002. Proceedings..

[63]  Krishna M. Sivalingam,et al.  Learning from class-imbalanced data in wireless sensor networks , 2003, 2003 IEEE 58th Vehicular Technology Conference. VTC 2003-Fall (IEEE Cat. No.03CH37484).

[64]  Yu Hen Hu,et al.  Vehicle classification in distributed sensor networks , 2004, J. Parallel Distributed Comput..

[65]  Jie Chen,et al.  Survey on Coverage Problems in Wireless Ad Hoc Sensor Networks , 2007 .

[66]  Deborah Estrin,et al.  Modelling Data-Centric Routing in Wireless Sensor Networks , 2002 .

[67]  B. R. Badrinath,et al.  Prediction-based energy map for wireless sensor networks , 2003, Ad Hoc Networks.

[68]  Wei Hong,et al.  Approximate Data Collection in Sensor Networks using Probabilistic Models , 2006, 22nd International Conference on Data Engineering (ICDE'06).

[69]  S. Sitharama Iyengar,et al.  Distributed Bayesian algorithms for fault-tolerant event region detection in wireless sensor networks , 2004, IEEE Transactions on Computers.

[70]  C. Ordonez,et al.  Constraining and summarizing association rules in medical data , 2006 .

[71]  Chenyang Lu,et al.  SPEED: a stateless protocol for real-time communication in sensor networks , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[72]  Mohammed J. Zaki,et al.  Efficiently mining maximal frequent itemsets , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[73]  Peter Desnoyers,et al.  Ultra-low power data storage for sensor networks , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[74]  Shamkant B. Navathe,et al.  An Efficient Algorithm for Mining Association Rules in Large Databases , 1995, VLDB.

[75]  Randy H. Katz,et al.  An architecture for building self-configurable systems , 2000, 2000 First Annual Workshop on Mobile and Ad Hoc Networking and Computing. MobiHOC (Cat. No.00EX444).

[76]  Azzedine Boukerche,et al.  A fast and reliable protocol for wireless sensor networks in critical conditions monitoring applications , 2004, MSWiM '04.

[77]  Deborah Estrin,et al.  Directed diffusion for wireless sensor networking , 2003, TNET.

[78]  Di Tian,et al.  A coverage-preserving node scheduling scheme for large wireless sensor networks , 2002, WSNA '02.

[79]  Ben Kao,et al.  Online Algorithms for Mining Inter-stream Associations from Large Sensor Networks , 2005, PAKDD.

[80]  Deborah Estrin,et al.  Rumor Routing Algorithm For Sensor Networks , 2002 .

[81]  Lars Schmidt-Thieme,et al.  The Relation of Closed Itemset Mining, Complete Pruning Strategies and Item Ordering in Apriori-Based FIM Algorithms , 2005, PKDD.

[82]  Jan M. Rabaey,et al.  Energy aware routing for low energy ad hoc sensor networks , 2002, 2002 IEEE Wireless Communications and Networking Conference Record. WCNC 2002 (Cat. No.02TH8609).

[83]  Mohamed Medhat Gaber,et al.  Knowledge Discovery from Sensor Data , 2008 .

[84]  Mani Srivastava,et al.  Energy efficient routing in wireless sensor networks , 2001, 2001 MILCOM Proceedings Communications for Network-Centric Operations: Creating the Information Force (Cat. No.01CH37277).

[85]  Mohammed J. Zaki Parallel and distributed association mining: a survey , 1999, IEEE Concurr..

[86]  Ian F. Akyildiz,et al.  Wireless sensor networks: a survey , 2002, Comput. Networks.

[87]  L. Yann-Ael,et al.  Round Robin Cycle for Predictions in Wireless Sensor Networks , 2005, 2005 International Conference on Intelligent Sensors, Sensor Networks and Information Processing.

[88]  Heikki Mannila,et al.  Discovering Frequent Episodes in Sequences , 1995, KDD.

[89]  Bruce H. Krogh,et al.  Lightweight detection and classification for wireless sensor networks in realistic environments , 2005, SenSys '05.

[90]  Raj P. Gopalan,et al.  CT-ITL : Efficient Frequent Item Set Mining Using a Compressed Prefix Tree with Pattern Growth , 2003, ADC.

[91]  Tomasz Imielinski,et al.  Prediction-based monitoring in sensor networks: taking lessons from MPEG , 2001, CCRV.

[92]  I. Davidson,et al.  Distributed Pre-Processing of Data on Networks of Berkeley Motes using Non-Parametric EM , 2005 .

[93]  Raj P. Gopalan,et al.  TreeITL-Mine: Mining Frequent Itemsets Using Pattern Growth, Tid Intersection, and Prefix Tree , 2002, Australian Joint Conference on Artificial Intelligence.

[94]  Le Gruenwald,et al.  Estimating Missing Values in Related Sensor Data Streams , 2005, COMAD.

[95]  Carlos Ordonez,et al.  Discovering Interesting Association Rules in Medical Data , 2000, ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery.

[96]  Wei Hong,et al.  Model-Driven Data Acquisition in Sensor Networks , 2004, VLDB.

[97]  K. Römer Temporal Message Ordering in Wireless Sensor Networks , 2002 .

[98]  Jennifer Widom,et al.  Adaptive filters for continuous queries over distributed data streams , 2003, SIGMOD '03.

[99]  Aleksandar Milenkovic,et al.  Wireless sensor networks for personal health monitoring: Issues and an implementation , 2006, Comput. Commun..