An Intelligent Outlier Detection Method With One Class Support Tucker Machine and Genetic Algorithm Toward Big Sensor Data in Internet of Things

Various types of sensor data can be collected by the Internet of Things (IoT). Each sensor node has spatial attributes and may also be associated with a large number of measurement data that evolve over time; therefore, these high-dimensional sensor data are inherently large scale. Detecting outliers in large-scale IoT sensor data is a challenging task. Most existing anomaly detection methods are based on a vector representation. However, large-scale IoT sensor data have characteristics that make tensor methods more efficient for extracting information. The vector-based methods can destroy original structural information and correlation within large-scale sensor data, resulting in the problem of the “curse of dimensionality,” and some outliers hence cannot be detected. In this paper, we propose a one-class support Tucker machine (OCSTuM) and an OCSTuM based on tensor Tucker factorization and a genetic algorithm called GA-OCSTuM. These methods extend one-class support vector machines to tensor space. OCSTuM and GA-OCSTuM are unsupervised anomaly detection approaches for big sensor data. They retain the structural information of data while improving the accuracy and efficiency of anomaly detection. The experimental evaluations on real data sets demonstrate that our proposed method improves the accuracy and efficiency of anomaly detection while retaining the intrinsic structure of big sensor data.

[1]  Minqiang Xu,et al.  A fault diagnosis scheme for planetary gearboxes using adaptive multi-scale morphology filter and modified hierarchical permutation entropy , 2018 .

[2]  João Gama,et al.  Event detection from traffic tensors: A hybrid model , 2016, Neurocomputing.

[3]  Irene Kotsia,et al.  Support tucker machines , 2011, CVPR 2011.

[4]  M. Gallo,et al.  Three-way compositional analysis of water quality monitoring data , 2014, Environmental and Ecological Statistics.

[5]  James Bailey,et al.  R1STM: One-class Support Tensor Machine with Randomised Kernel , 2016, SDM.

[6]  Johan A. K. Suykens,et al.  A kernel-based framework to tensorial data analysis , 2011, Neural Networks.

[7]  Cheng-Lung Huang,et al.  A GA-based feature selection and parameters optimizationfor support vector machines , 2006, Expert Syst. Appl..

[8]  Jiawei Han,et al.  Learning with Tensor Representation , 2006 .

[9]  Xiaowei Yang,et al.  A Linear Support Higher-Order Tensor Machine for Classification , 2013, IEEE Transactions on Image Processing.

[10]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[11]  Minqiang Xu,et al.  A Combined Model-Based and Intelligent Method for Small Fault Detection and Isolation of Actuators , 2016, IEEE Transactions on Industrial Electronics.

[12]  Xiaowei Yang,et al.  A Low-Rank Approximation-Based Transductive Support Tensor Machine for Semisupervised Classification , 2015, IEEE Transactions on Image Processing.

[13]  Jimeng Sun,et al.  Limestone: High-throughput candidate phenotype generation via tensor factorization , 2014, J. Biomed. Informatics.

[14]  Jie Yu,et al.  Quality relevant nonlinear batch process performance monitoring using a kernel based multiway non-Gaussian latent subspace projection approach , 2014 .

[15]  Haiping Lu,et al.  MPCA: Multilinear Principal Component Analysis of Tensor Objects , 2008, IEEE Transactions on Neural Networks.

[16]  Philip S. Yu,et al.  DuSK: A Dual Structure-preserving Kernel for Supervised Tensor Learning with Applications to Neuroimages , 2014, SDM.

[17]  Minqiang Xu,et al.  A fault diagnosis scheme for planetary gearboxes using modified multi-scale symbolic dynamic entropy and mRMR feature selection , 2017 .

[18]  Jianmin Jiang,et al.  Tucker decomposition-based tensor learning for human action recognition , 2015, Multimedia Systems.

[19]  Yanyan Chen,et al.  One-Class Support Tensor Machine , 2016, Knowl. Based Syst..

[20]  Weiwei Guo,et al.  Higher rank Support Tensor Machines for visual recognition , 2012, Pattern Recognit..

[21]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[22]  Xuelong Li,et al.  Supervised Tensor Learning , 2005, ICDM.

[23]  Jian Xu,et al.  Real time contextual collective anomaly detection over multiple data streams , 2014 .

[24]  Peng Jiang,et al.  Support high-order tensor data description for outlier detection in high-dimensional big sensor data , 2018, Future Gener. Comput. Syst..

[25]  Hans-Peter Kriegel,et al.  Angle-based outlier detection in high-dimensional data , 2008, KDD.

[26]  Clara Pizzuti,et al.  Distance-based detection and prediction of outliers , 2006, IEEE Transactions on Knowledge and Data Engineering.