A Service Selection Framework for Anomaly Detection in IoT Stream Data

Many anomaly detection algorithms have been provided as services for more convenient and efficient utilization in IoT era. Due to concept drift existed in dynamic IoT stream data, it is a changeling task to apply proper anomaly detection services at run time. For effective on-line anomalous data discovery, this paper proposes a service selection framework to dynamically select and configure anomaly detection services. A fast classification model based on XGBoost is trained to identify the pattern of various stream data, so that suitable service can be selected and configured according to the pattern of stream data. Extensive experiments on real and synthetic data sets show that our framework can select suitable service for different scenarios, and the accuracy of the chosen services outperforms state-of-the-art methods.

[1]  Chao Yi,et al.  Time-Series Anomaly Detection Service at Microsoft , 2019, KDD.

[2]  Subutai Ahmad,et al.  Evaluating Real-Time Anomaly Detection Algorithms -- The Numenta Anomaly Benchmark , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[3]  Lovekesh Vig,et al.  Long Short Term Memory Networks for Anomaly Detection in Time Series , 2015, ESANN.

[4]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[5]  Takehisa Yairi,et al.  Anomaly Detection Using Autoencoders with Nonlinear Dimensionality Reduction , 2014, MLSDA'14.

[6]  J. Ma,et al.  Time-series novelty detection using one-class support vector machines , 2003, Proceedings of the International Joint Conference on Neural Networks, 2003..

[7]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[8]  Shen Su,et al.  An Approach to Modeling and Discovering Event Correlation for Service Collaboration , 2017, ICSOC.

[9]  Nenad Stojanovic,et al.  Big-data-driven anomaly detection in industry (4.0): An approach and a case study , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[10]  Yanbo Han,et al.  A Data-Driven Service Creation Approach for Effectively Capturing Events from Multiple Sensor Streams , 2019, 2019 IEEE International Conference on Web Services (ICWS).

[11]  Guoqiang Peter Zhang,et al.  Time series forecasting using a hybrid ARIMA and neural network model , 2003, Neurocomputing.

[12]  Vipin Kumar,et al.  Comparative Evaluation of Anomaly Detection Techniques for Sequence Data , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[13]  Jehn-Ruey Jiang,et al.  Anomaly Detection for Univariate Time Series with Statistics and Deep Learning , 2019, 2019 IEEE Eurasia Conference on IOT, Communication and Engineering (ECICE).

[14]  Slim Abdennadher,et al.  Enhancing one-class support vector machines for unsupervised anomaly detection , 2013, ODD '13.

[15]  Shen Su,et al.  A Proactive Service Model Facilitating Stream Data Fusion and Correlation , 2017, Int. J. Web Serv. Res..

[16]  Lovekesh Vig,et al.  LSTM-based Encoder-Decoder for Multi-sensor Anomaly Detection , 2016, ArXiv.

[17]  Fei Tony Liu,et al.  Isolation-Based Anomaly Detection , 2012, TKDD.

[18]  Saeed Amizadeh,et al.  Generic and Scalable Framework for Automated Time-series Anomaly Detection , 2015, KDD.

[19]  Shenglin Zhang,et al.  Rapid Deployment of Anomaly Detection Models for Large Number of Emerging KPI Streams , 2018, 2018 IEEE 37th International Performance Computing and Communications Conference (IPCCC).

[20]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[21]  Tsuyoshi Idé Why Does Subsequence Time-Series Clustering Produce Sine Waves? , 2006, PKDD.

[22]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[23]  Sebastian Wagner,et al.  Anomaly Detection in Univariate Time-series: A Survey on the State-of-the-Art , 2020, ArXiv.

[24]  Valeria De Antonellis,et al.  A Relevance-based approach for Big Data Exploration , 2019, Future Gener. Comput. Syst..

[25]  B. Nelson,et al.  Statistical methodology: V. Time series analysis using autoregressive integrated moving average (ARIMA) models. , 1998, Academic emergency medicine : official journal of the Society for Academic Emergency Medicine.