Detecting time-related changes in Wireless Sensor Networks using symbol compression and Probabilistic Suffix Trees

Our research focuses on anomaly detection problems in unknown environments using Wireless Sensor Networks (WSN). We are interested in detecting two types of abnormal events: sensory level anomalies (e.g., noise in an office without lights on) and time-related anomalies (e.g., freezing temperature in a mid-summer day).We present a novel, distributed, machine learning based anomaly detector that is able to detect time-related changes. It consists of three components. First, a Fuzzy Adaptive Resonance Theory (ART) neural network classifier is used to label multi-dimensional sensor data into discrete classes and detect sensory level anomalies. Over time, the labeled classes form a sequence of classes. Next, a symbol compressor is used to extract the semantic meaning of the temporal sequence. Finally, a Variable Memory Markov (VMM) model in the form of a Probabilistic Suffix Tree (PST) is used to model and detect time-related anomalies in the environment. To our knowledge, this is the first work that analyzes/models the temporal sensor data using a symbol compressor and PST in a WSN. Our proposed detection algorithm is distributed, “light-weight”, and practical for resource constrained sensor nodes. We verify the proposed approach using a volcano monitoring dataset. Our results show that this approach yields the same performance as the traditional Markov models with much less cost.

[1]  Alberto Apostolico,et al.  Optimal amnesic probabilistic automata or how to learn and classify proteins in linear time and space , 2000, RECOMB '00.

[2]  Lynne E. Parker,et al.  Detecting and monitoring time-related abnormal events using a wireless sensor network and mobile robot , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Samuel Madden,et al.  PAQ: Time Series Forecasting for Approximate Query Answering in Sensor Networks , 2006, EWSN.

[4]  G. Blelloch Introduction to Data Compression * , 2022 .

[5]  Lynne E. Parker,et al.  A spatial-temporal imputation technique for classification with missing data in a wireless sensor network , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[6]  Richard Ford,et al.  Probabilistic suffix models for API sequence analysis of Windows XP applications , 2008, Pattern Recognit..

[7]  Dana Ron,et al.  The power of amnesia: Learning probabilistic automata with variable memory length , 1996, Machine Learning.

[8]  Ming-Syan Chen,et al.  Exploring Group Moving Pattern for an Energy-Constrained Object Tracking Sensor Network , 2007, PAKDD.

[9]  Alex Wang,et al.  Signal Interpretation of Multifunction Radars: Modeling and Statistical Signal Processing With Stochastic Context Free Grammar , 2008, IEEE Transactions on Signal Processing.

[10]  Edward Y. Chang,et al.  Adaptive stream resource management using Kalman Filters , 2004, SIGMOD '04.

[11]  Matt Welsh,et al.  Fidelity and yield in a volcano monitoring sensor network , 2006, OSDI '06.

[12]  Danco Davcev,et al.  Tracking of unusual events in wireless sensor networks based on artificial neural-networks algorithms , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[13]  Douglas A. Reynolds,et al.  Speaker Verification Using Adapted Gaussian Mixture Models , 2000, Digit. Signal Process..