Data driven hyperparameter optimization of one-class support vector machines for anomaly detection in wireless sensor networks

One-class support vector machines (OCSVM) have been recently applied to detect anomalies in wireless sensor networks (WSNs). Typically, OCSVM is kernelized by radial bais functions (RBF, or Gausian kernel) whereas selecting Gaussian kernel hyperparameter is based upon availability of anomalies, which is rarely applicable in practice. This article investigates the application of OCSVM to detect anomalies in WSNs with data-driven hyperparameter optimization. Specifically, the information of the farthest and the nearest neighbors of each sample is used to construct the objective cost instead of labeling based metrics such as geometric mean accuracy (G-mean) or area under the receiver operating characteristic (AUROC). The efficiency of this method is illustrated over the IBRL dataset whereas the resulting estimated boundary as well as anomaly detection performance are comparable with existing methods.

[1]  David A. Clifton,et al.  Novelty Detection with Multivariate Extreme Value Statistics , 2011, J. Signal Process. Syst..

[2]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[3]  Andrew S. Miner,et al.  Anomaly intrusion detection using one class SVM , 2004, Proceedings from the Fifth Annual IEEE SMC Information Assurance Workshop, 2004..

[4]  Kim Phuc Tran,et al.  The Efficiency of the 4-Out-of-5 Runs Rules Scheme for Monitoring the Ratio of Population Means of a Bivariate Normal Distribution , 2016 .

[5]  Marimuthu Palaniswami,et al.  Hyperspherical cluster based distributed anomaly detection in wireless sensor networks , 2014, J. Parallel Distributed Comput..

[6]  Biming Tian,et al.  Anomaly detection in wireless sensor networks: A survey , 2011, J. Netw. Comput. Appl..

[7]  Giovanni Celano,et al.  Monitoring the Ratio of Two Normal Variables Using EWMA Type Control Charts , 2016, Qual. Reliab. Eng. Int..

[8]  Philippe Castagliola,et al.  On the Performance of Shewhart median Chart in the Presence of Measurement Errors , 2017, Qual. Reliab. Eng. Int..

[9]  Kim Phuc Tran,et al.  The efficiency of CUSUM schemes for monitoring the coefficient of variation , 2016 .

[10]  Asma Rabaoui,et al.  Using One-Class SVMs and Wavelets for Audio Surveillance , 2008, IEEE Transactions on Information Forensics and Security.

[11]  Saad B. Qaisar,et al.  One-class support vector machines: analysis of outlier detection for wireless sensor networks in harsh environments , 2013, Artificial Intelligence Review.

[12]  Giovanni Celano,et al.  Monitoring the ratio of population means of a bivariate normal distribution using CUSUM type control charts , 2016, Statistical Papers.

[13]  Zhen Feng,et al.  A new approach of anomaly detection in wireless sensor networks using support vector data description , 2017, Int. J. Distributed Sens. Networks.

[14]  Ramesh Govindan,et al.  Sensor faults: Detection methods and prevalence in real-world datasets , 2010, TOSN.

[15]  Kim Phuc Tran,et al.  Run Rules median control charts for monitoring process mean in manufacturing , 2017, Qual. Reliab. Eng. Int..

[16]  Lin Zhang,et al.  Two methods of selecting Gaussian kernel parameters for one-class SVM and their application to fault detection , 2014, Knowl. Based Syst..

[17]  Bernhard Schölkopf,et al.  Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[18]  O. Kalinina,et al.  Detection of atypical genes in virus families using a one-class SVM , 2014, BMC Genomics.

[19]  Joni-Kristian Kämäräinen,et al.  Gaussian mixture pdf in one-class classification: computing and utilizing confidence values , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[20]  Giovanni Celano,et al.  Monitoring the ratio of two normal variables using Run Rules type control charts , 2016 .

[21]  Wei Hong,et al.  TASK: sensor network in a box , 2005, Proceeedings of the Second European Workshop on Wireless Sensor Networks, 2005..