Anomaly detection in earth dam and levee passive seismic data using support vector machines and automatic feature selection

We investigate techniques for earth dam and levee health monitoring and automatic detection of anomalous events in passive seismic data. We have developed a novel data-driven workflow specific to our domain, which could be generalized for monitoring other systems with time series data. We use machine learning and geophysical data collected from sensors located on the surface of the levee to identify internal erosion events. In this paper, we describe our research experiments with two-class and one-class support vector machines (SVMs). We use two different data sets from experimental laboratory earth embankments (each having approximately 80% normal and 20% anomalies) to ensure our workflow is robust enough to work with multiple data sets and different types of anomalous events (e.g., cracks and piping). We apply wavelet-denoising techniques and extract nine spectral features from decomposed segments of the time series data. The two-class SVM with 10-fold cross validation achieved over 94% overall accuracy and 96% F1-score. Experiments with the one-class SVM (no labeled data for anomalies) using the top features selected by our automatic feature selection algorithm increase our overall results from 83% accuracy and 89% F1-score to over 91% accuracy and 95% F1-score. Results show that we can successfully separate normal from anomalous data observations.

[1]  Marc J. Rubin Efficient and automatic wireless geohazard monitoring , 2014 .

[2]  Marko Robnik-Sikonja,et al.  Theoretical and Empirical Analysis of ReliefF and RReliefF , 2003, Machine Learning.

[3]  R. Fell,et al.  The statistics of embankment dam failures and accidents , 2000 .

[4]  Chein-I Chang,et al.  Anomaly detection and classification for hyperspectral imagery , 2002, IEEE Trans. Geosci. Remote. Sens..

[5]  Peter M. A. Sloot,et al.  Time-Frequency Methods for Structural Health Monitoring , 2014, Sensors.

[6]  Linden J. Ball,et al.  Using ethnography to design a mass detection tool (MDT) for the early discovery of insurance fraud , 2003, CHI Extended Abstracts.

[7]  Yong Hu,et al.  The application of data mining techniques in financial fraud detection: A classification framework and an academic review of literature , 2011, Decis. Support Syst..

[8]  Nicolas H. Younan,et al.  Earthen levee slide detection via automated analysis of synthetic aperture radar imagery , 2016, Landslides.

[9]  Michael A. Mooney,et al.  Preliminary Implementation of Geophysical Techniques to Monitor Embankment Dam Filter Cracking at the Laboratory Scale , 2012 .

[10]  Tieniu Tan,et al.  Similarity based vehicle trajectory clustering and anomaly detection , 2005, IEEE International Conference on Image Processing 2005.

[11]  Nicolas H. Younan,et al.  Classification of levee slides from airborne synthetic aperture radar images with efficient spatial feature extraction , 2015 .

[12]  Chu-Hsing Lin,et al.  Anomaly Detection Using LibSVM Training Tools , 2008, 2008 International Conference on Information Security and Assurance (isa 2008).

[13]  O. Lartillot,et al.  A MATLAB TOOLBOX FOR MUSICAL FEATURE EXTRACTION FROM AUDIO , 2007 .

[14]  Petri Toiviainen,et al.  MIR in Matlab (II): A Toolbox for Musical Feature Extraction from Audio , 2007, ISMIR.

[15]  Felix Naumann,et al.  Data fusion , 2009, CSUR.

[16]  Wei Li,et al.  Levee anomaly detection using polarimetric synthetic aperture radar data , 2012, 2012 IEEE International Geoscience and Remote Sensing Symposium.

[17]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[18]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[19]  Valeria V. Krzhizhanovskaya,et al.  Crack Detection in Earth Dam and Levee Passive Seismic Data Using Support Vector Machines , 2016, ICCS.

[20]  Jason Weston,et al.  A user's guide to support vector machines. , 2010, Methods in molecular biology.

[21]  R. Cohen Signal Denoising Using Wavelets Project Report , 2012 .

[22]  Debashis Ghosh,et al.  COPA - cancer outlier profile analysis , 2006, Bioinform..

[23]  Valeria V. Krzhizhanovskaya,et al.  Anomaly Detection in Earth Dam and Levee Passive Seismic Data Using Multivariate Gaussian , 2017, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).

[24]  Valeria V. Krzhizhanovskaya,et al.  Detecting Erosion Events in Earth Dam and Levee Passive Seismic Data with Clustering , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).