Unsupervised Learning Used in Automatic Detection and Classification of Ambient‐Noise Recordings from a Large‐N Array

Cite this article as Chamarczuk, M., Y. Nishitsuji, M. Malinowski, and D. Draganov (2019). Unsupervised Learning Used in Automatic Detection and Classification of Ambient-Noise Recordings from a Large-N Array, Seismol. Res. Lett. 91, 370–389, doi: 10.1785/ 0220190063. We present a method for automatic detection and classification of seismic events from continuous ambient-noise (AN) recordings using an unsupervised machine-learning (ML) approach. We combine classic and recently developed array-processing techniques with ML enabling the use of unsupervised techniques in the routine processing of continuous data. We test our method on a dataset from a large-number (large-N) array, which was deployed over the Kylylahti undergroundmine (Finland), and show the potential to automatically process and cluster the volumes of AN data. Automatic sorting of detected events into different classes allows faster data analysis and facilitates the selection of desired parts of the wavefield for imaging (e.g., using seismic interferometry) and monitoring. First, using array-processing techniques, we obtain directivity, location, velocity, and frequency representations of AN data. Next, we transform these representations into vector-shapedmatrices. The transformeddata are input into a clustering algorithm (called k-means) to define groups of similar events, and optimizationmethods are used to obtain the optimal number of clusters (called elbow and silhouette tests). We use these techniques to obtain the optimal number of classes that characterize the AN recordings and consequently assign the proper class membership (cluster) to each data sample. For the Kylylahti AN, the unsupervised clustering produced 40 clusters. After visual inspection of events belonging to different clusters that were quality controlled by the silhouette method, we confirm the reliability of 10 clusters with a prediction accuracy higher than 90%. The obtained division into separate seismic-event classes proves the feasibility of the unsupervised ML approach to advance the automation of processing and the utilization of array AN data. Our workflow is very flexible and can be easily adapted for other input features and classification algorithms. Introduction Arrays with ever-increasing station counts are fundamental in seismology (Rost and Thomas, 2002). Developments of the nodal technology to acquire seismic data by the oil and gas industry brought the concept of large-number (large-N) arrays to academia (Hand, 2014). Their applications include structural imaging, studies of seismicity, and monitoring (e.g., Lin et al., 2013; Ben-Zion et al., 2015; Quiros et al., 2015; Karplus and Schmandt, 2018). Large-N arrays are often combined with long recording times, which facilitates seismic ambient-noise (AN) recordings (Karplus and Schmandt, 2018). AN is generally defined as a complex wavefield composed of the superposition of signals from natural and anthropogenic sources that are not generated specifically for the purpose of a study. Here, we address the issue of characterizing the AN content by developing automatic detection and classification of the various seismic events in the recorded wavefield. Because AN is recorded and stored during every regular continuous acquisition campaign (in particular, using nodal systems; Dean et al., 2015), our methodology is a step forward to maximize the information from passive recordings. The key technique using AN is called seismic interferometry (SI; Schuster et al., 2004; Wapenaar and Fokkema, 2006; Draganov et al., 2007; Wapenaar et al., 2008; Schuster, 2009). SI allows the retrieval of virtual-source records by correlating noise recordings between pairs of receivers. SI is considered a cost-effective alternative for controlled-source operations, especially when terrain access is an issue. Successful applications 1. Institute of Geophysics PAS, Warsaw, Poland; 2. Faculty of Civil Engineering and Geosciences, Delft University of Technology, CN Delft, Netherlands; 3. Petro Summit E&P Corporation, Tokyo, Japan *Corresponding author: mchamarczuk@igf.edu.pl © Seismological Society of America 370 Seismological Research Letters www.srl-online.org • Volume 91 • Number 1 • January 2020 Downloaded from https://pubs.geoscienceworld.org/ssa/srl/article-pdf/91/1/370/4910542/srl-2019063.1.pdf by Instytut Nauk-Geologicznych user on 03 January 2020 of AN SI can provide the velocity and structural information at the exploration scale (Draganov et al., 2009), and at the crustal scale (Ruigrok et al., 2010). Imaging and monitoring of the shallow crust require highfrequency data, that is, high-frequency sources recorded with high sampling rate (Niu and Yamaoka, 2018). Moreover, dense geophone arrays deployed in areas of abundant noise activity such as operating mine sites or volcanoes enable unaliased spatial sampling of the noise-source distribution characteristic for such high-seismicity areas (Rost and Thomas, 2002). For such continuous event-rich AN recordings, the preferred processing approach should be automatic and require minimum human interaction (Hansen and Schmandt, 2015). At the same time, most conventional array-processing techniques require high signal coherency across the array, implying important constraints on the array geometry, spatial extent, and data quality (Almendros et al., 1999). Therefore, for an existing dataset, the main interest is to optimize the processing time (including tuning array-processing parameters), especially in cases where months of human work could be necessary. Detection and classification of seismic signals using machinelearning (ML) already has awell-established history (Dowla et al., 1990; Dysart and Pulli, 1990; Wang and Teng, 1995; Del Pezzo et al., 2003; Wiszniowski et al., 2014). These studies were a step toward developing effective ML techniques for distinguishing tremors and earthquakes (Nakano et al., 2019), geyser-eruption signals detection (Yuan et al., 2019), earthquake early warning (Li, Meier, et al., 2018; Kong et al., 2019), and many automatic approaches for accurate earthquake-parameter estimation (Böse et al., 2008; Meier et al., 2015; Cuéllar et al., 2018; Ochoa et al., 2018), including the almost separate branch of accurate phasepicking methods (Chen, 2018; Zhu and Beroza, 2018). From the previously mentioned applications of seismic arrays, imaging studies (Araya-Polo et al., 2018), monitoring volcanic tremors (Malfante et al., 2018), and earthquake early warning (Kong et al., 2016; Li, Meier, et al., 2018) are a few of the most interesting and challenging areas for employing ML techniques. An inherent processing part related to some of the ML applications in seismology is the detection and classification of specific event types (Rouet-Leduc et al., 2017; Zhou et al., 2019), for example, signals (tremors) in volcano monitoring (Bhatti et al., 2016), specific precursors observed in the seismograms in earthquake early warning (Minson et al., 2018), and bodyor surface-wave events for AN SI imaging studies (Nakata et al., 2016). For some of these signals such as the volcanic tremors or body-wave events, the sole detection might not suffice because the waveforms of the different event types are similar to each other in the time domain. They might be differentiated only by minor features when transformed using signal representations (i.e., transformed to other domains) such as Fourier transform, envelope, autocorrelation, and kurtosis to name just a few from the long list of signal transformations used in the ML detection context (see the review in Malfante et al., 2018). These applications demand more detailed classification and assessment of AN event types performed in real time. Such processing appears to be a good target for combined ML and array-processing techniques. A hybrid approach that combines ML and array processing is an emphasized and anticipated development in seismology (Kong et al., 2018; Bergen et al., 2019). Motivation Here, we focus on the performance of ML for detecting and assessing different categories of seismic events present in continuous AN recordings. We rely on the fact that the appearance of AN in seismic records differs in amplitude and frequency and that the various kinds of signal transformations derived therefrom provide varying characteristics (Bormann, 1998). We aim for evaluation of the feasibility of unsupervised clustering methods using these differences to track down various AN events without the need to know their exact representation in different domains. These characteristics can be retrieved using arrayprocessing techniques (Rost and Thomas, 2002). This idea is applied to data recorded using a large-N array above the Kylylahti active undergroundmine in eastern Finland for testing AN SI for mineral exploration. The same dataset has already been used to demonstrate extraction of body-wave events using supervised ML (support-vector machine) in combination with a two-step wavefield evaluation and detection method (Chamarczuk et al., 2019). With this hybrid approach, the authors made a binary classification to discriminate between bodyand surface-wave events. They applied two-step wavefield evaluation and detection method on a small portion of the AN recordings to obtain labels (in this study, label is the type of recorded seismic event) and then used the labels as input to a support-vector machine to classify the remaining part of the data. In our study, we do not provide input labels and aim to discriminate between several (>2) classes of seismic events and thus provide a more detailed description of the recorded AN data. In addition to the lack of labels, typical for unsupervised clustering, we treat the number of clusters as an unknown parameter to be established (we refer to this approach as “blind clustering”). In blind clustering, we estimate the range of optimal number of clusters by comparing the results of k-means for a broad range o

[1]  J. Ebel,et al.  Aftershock Imaging with Dense Arrays (AIDA) following the Mw 4.0 Waterboro earthquake of 16 October 2012 Maine, U.S.A. , 2015 .

[2]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[3]  Mustafa Erdik,et al.  PreSEIS: A Neural Network-Based Approach to Earthquake Early Warning for Finite Faults , 2008 .

[4]  Néstor Becerra Yoma,et al.  Automatic detection of volcano-seismic events by modeling state and event duration in hidden Markov models , 2016 .

[5]  Ting Chen,et al.  Preface to the Focus Section on Machine Learning in Seismology , 2019, Seismological Research Letters.

[6]  G. N. Lance,et al.  A General Theory of Classificatory Sorting Strategies: 1. Hierarchical Systems , 1967, Comput. J..

[7]  Yue Wu,et al.  DeepDetect: A Cascaded Region-Based Densely Connected Network for Seismic Event Detection , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[8]  K. Wapenaar,et al.  Reflection imaging of the Moon's interior using deep‐moonquake seismic interferometry , 2016 .

[9]  Qingkai Kong,et al.  MyShake: A smartphone seismic network for earthquake early warning and beyond , 2016, Science Advances.

[10]  Robert W. Clayton,et al.  High-resolution 3D shallow crustal structure in Long Beach, California: Application of ambient noise tomography on a dense seismic array , 2013 .

[11]  J. H. Ward Hierarchical Grouping to Optimize an Objective Function , 1963 .

[12]  Zefeng Li,et al.  High-resolution seismic event detection using local similarity for Large-N arrays , 2018, Scientific Reports.

[13]  Arno Puder,et al.  Machine Learning Aspects of the MyShake Global Smartphone Seismic Network , 2018, Seismological Research Letters.

[14]  G. Bellefleur,et al.  Feasibility of virtual source reflection seismology using interferometry for mineral exploration: A test study in the Lalor Lake volcanogenic massive sulphide mining area, Manitoba, Canada , 2015 .

[15]  Yingjie Yang,et al.  Processing seismic ambient noise data to obtain reliable broad-band surface wave dispersion measurements , 2007 .

[16]  J. Peterson,et al.  Observations and modeling of seismic background noise , 1993 .

[17]  Kees Wapenaar,et al.  Reflection images from ambient seismic noise , 2009 .

[18]  Gerardo Suárez,et al.  A Fast Earthquake Early Warning Algorithm Based on the First 3 s of the P‐Wave Coda , 2018, Bulletin of the Seismological Society of America.

[20]  Ta-Liang Teng,et al.  Artificial neural network-based seismic detector , 1995, Bulletin of the Seismological Society of America.

[21]  Andreas Rietbrock,et al.  Convolutional Neural Network for Seismic Phase Classification, Performance Demonstration over a Local Seismic Network , 2019, Seismological Research Letters.

[22]  Amir Adler,et al.  Deep-learning tomography , 2018 .

[23]  Masaru Nakano,et al.  Discrimination of Seismic Signals from Earthquakes and Tectonic Tremor by Applying a Convolutional Neural Network to Running Spectral Images , 2019, Seismological Research Letters.

[24]  Zefeng Li,et al.  Machine Learning Seismic Wave Discrimination: Application to Earthquake Early Warning , 2018, Geophysical Research Letters.

[25]  E. Roots,et al.  Interferometric seismic imaging around the active Lalor mine in the Flin Flon greenstone belt, Canada , 2017 .

[26]  Vipin Kumar,et al.  The Challenges of Clustering High Dimensional Data , 2004 .

[27]  Alma Leora Culén,et al.  Preface to the Focus Section , 2015, IxD&A.

[28]  Farid U. Dowla,et al.  Seismic discrimination with artificial neural networks: Preliminary results with regional spectral data , 1990 .

[29]  Chris H. Q. Ding,et al.  K-means clustering via principal component analysis , 2004, ICML.

[30]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[31]  Ajay Rana,et al.  K-means with Three different Distance Metrics , 2013 .

[32]  P. Bormann Conversion and comparability of data presentations on seismic background noise , 1998 .

[33]  Men-Andrin Meier,et al.  The limits of earthquake early warning: Timeliness of ground motion estimates , 2018, Science Advances.

[34]  Peter Gerstoft,et al.  A year of microseisms in southern California , 2007 .

[35]  R. L. Thorndike Who belongs in the family? , 1953 .

[36]  Kees Wapenaar,et al.  Seismic exploration‐scale velocities and structure from ambient seismic noise (>1 Hz) , 2013 .

[37]  C. Humphreys,et al.  Machine Learning Predicts Laboratory Earthquakes , 2017, Geophysical Research Letters.

[38]  T. Dean,et al.  The coherency of ambient seismic noise recorded during land surveys and the resulting implications for the effectiveness of geophone arrays , 2015 .

[39]  M. Taner,et al.  SEMBLANCE AND OTHER COHERENCY MEASURES FOR MULTICHANNEL DATA , 1971 .

[40]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[41]  Felix J. Herrmann,et al.  Seismic denoising with nonuniformly sampled curvelets , 2006, Computing in Science & Engineering.

[42]  Kees Wapenaar,et al.  High-resolution lithospheric imaging with seismic interferometry , 2010 .

[43]  Peter Gerstoft,et al.  Phase velocities from seismic noise using beamforming and cross correlation in Costa Rica and Nicaragua , 2008 .

[44]  P. Willmore,et al.  Manual of seismological observatory practice , 1979 .

[45]  Peter Gerstoft,et al.  Machine Learning in Seismology: Turning Data into Insights , 2018, Seismological Research Letters.

[46]  Paul L. Stoffa,et al.  The traveltime equation, tau‐p mapping, and inversion of common midpoint data , 1981 .

[47]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[48]  Yehuda Ben-Zion,et al.  Basic data features and results from a spatially dense seismic array on the San Jacinto fault zone , 2015 .

[49]  Luis Hernan Ochoa,et al.  Fast magnitude determination using a single seismological station record implementing machine learning techniques , 2017 .

[50]  C. Thomas,et al.  Improving Seismic Resolution Through Array Processing Techniques , 2009 .

[51]  Nasser M. Nasrabadi,et al.  Pattern Recognition and Machine Learning , 2006, Technometrics.

[52]  Brandon Schmandt,et al.  Automated detection and location of microseismicity at Mount St. Helens with a large‐N geophone array , 2015 .

[53]  N. Nakata,et al.  Toward 4D Noise-Based Seismic Probing of Volcanoes: Perspectives from a Large-N Experiment on Piton de la Fournaise Volcano , 2016 .

[54]  D. Davies,et al.  Vespa Process for Analysis of Seismic Signals , 1971 .

[55]  Michaël Gharbi,et al.  Convolutional neural network for earthquake detection and location , 2017, Science Advances.

[56]  Brandon Schmandt,et al.  Preface to the Focus Section on Geophone Array Seismology , 2018, Seismological Research Letters.

[57]  Thomas H. Heaton,et al.  The Gutenberg Algorithm: Evolutionary Bayesian Magnitude Estimates for Earthquake Early Warning with a Filter Bank , 2015 .

[58]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[59]  W. Mooney,et al.  Reflection imaging with earthquake sources and dense arrays , 2017 .

[60]  Anna Esposito,et al.  Discrimination of Earthquakes and Underwater Explosions Using Neural Networks , 2003 .

[61]  Edoardo Del Pezzo,et al.  Array analysis using circular-wave-front geometry: an application to locate the nearby seismo-volcanic source , 1999 .

[62]  Kesari Verma,et al.  Investigations on Impact of Feature Normalization Techniques on Classifier's Performance in Breast Tumor Classification , 2015 .

[63]  Jay J. Pulli,et al.  Regional seismic event classification at the NORESS array: Seismological measurements and the use of trained neural networks , 1990 .

[64]  N. Nakata,et al.  Body and surface wave reconstruction from seismic noise correlations between arrays at Piton de la Fournaise volcano , 2016 .

[65]  Michal Malinowski,et al.  Automatic 3D illumination-diagnosis method for large-N arrays: robust data scanner and machine-learning feature provider , 2019 .

[66]  J. Schmittbuhl,et al.  Reservoir Imaging Using Ambient Noise Correlation From a Dense Seismic Network , 2018, Journal of Geophysical Research: Solid Earth.

[67]  Sebastian Rost,et al.  ARRAY SEISMOLOGY: METHODS AND APPLICATIONS , 2002 .

[68]  J. Sheng,et al.  Interferometric/daylight seismic imaging , 2004 .

[69]  Marielle Malfante,et al.  Machine Learning for Volcano-Seismic Signals: Challenges and Perspectives , 2018, IEEE Signal Processing Magazine.

[70]  Deng Cai,et al.  Unsupervised feature selection for multi-cluster data , 2010, KDD.

[71]  K. Wapenaar,et al.  Green's function representations for seismic interferometry , 2006 .

[72]  Kees Wapenaar,et al.  Retrieval of reflections from seismic background‐noise measurements , 2007 .

[73]  Satish Karra,et al.  Using Machine Learning to Discern Eruption in Noisy Environments: A Case Study using CO2-driven Cold-Water Geyser in Chimayo, New Mexico , 2018, Seismological Research Letters.

[74]  Paul Johnson,et al.  Earthquake Detection in 1D Time‐Series Data with Feature Selection and Dictionary Learning , 2018, Seismological Research Letters.

[75]  Kees Wapenaar,et al.  Seismic interferometry : history and present status , 2008 .

[76]  Don H. Johnson,et al.  Array Signal Processing: Concepts and Techniques , 1993 .

[77]  Yangkang Chen,et al.  Automatic microseismic event picking via unsupervised machine learning , 2020, Geophysical Journal International.

[78]  K. Florek,et al.  Sur la liaison et la division des points d'un ensemble fini , 1951 .

[79]  Jan Wiszniowski,et al.  Application of real time recurrent neural network for detection of small natural earthquakes in Poland , 2014, Acta Geophysica.