Seismic waves that are recorded by near-surface sensors are usually disturbed by strong noise. Hence, the recorded seismic data are sometimes of poor quality; this phenomenon can be characterized as a low signal-to-noise ratio (SNR). The low SNR of the seismic data may lower the quality of many subsequent seismological analyses, such as inversion and imaging. Thus, the removal of unwanted seismic noise has significant importance. In this article, we intend to improve the SNR of many seismological datasets by developing new denoising framework that is based on an unsupervised machine-learning technique. We leverage the unsupervised learning philosophy of the autoencoding method to adaptively learn the seismic signals from the noisy observations. This could potentially enable us to better represent the true seismic-wave components. To mitigate the influence of the seismic noise on the learned features and suppress the trivial components associated with low-amplitude neurons in the hidden layer, we introduce a sparsity constraint to the autoencoder neural network. The sparse autoencoder method introduced in this article is effective in attenuating the seismic noise. More importantly, it is capable of preserving subtle features of the data, while removing the spatially incoherent random noise. We apply the proposed denoising framework to a reflection seismic image, depth-domain receiver function gather, and an earthquake stack dataset. The purpose of this study is to demonstrate the framework’s potential in real-world applications. INTRODUCTION Seismic phases from the discontinuities in the Earth’s interior contain significant constraints for high-resolution deep Earth imaging; however, they sometimes arrive as weak-amplitude waveforms (Rost and Weber, 2001; Rost and Thomas, 2002; Deuss, 2009; Saki et al., 2015; Guan and Niu, 2017, 2018; Schneider et al., 2017; Chai et al., 2018). The detection of these weak-amplitude seismic phases is sometimes challenging because of three main reasons: (1) the amplitude of these phases is very small and can be neglected easily when seen next to the amplitudes of neighboring phases that are much larger; (2) the coherency of the weak-amplitude seismic phases is seriously degraded because of insufficient array coverage and spatial sampling; and (3) the strong random background noise that is even stronger than the weak phases in amplitude makes the detection even harder. As an example of the challenges presented, the failure in detecting the weak reflection phases from mantle discontinuities could result in misunderstanding of the mineralogy or temperature properties of the Earth interior. To conquer the challenges in detecting weak seismic phases, we need to develop specific processing techniques. In earthquake seismology, in order to highlight a specific weak phase, recordings in the seismic arrays are often shifted and stacked for different slowness and back-azimuth values (Rost and Thomas, 2002). Stacking serves as one of the most widely used approaches in enhancing the energy of target signals. Shearer (1991a) stacked long-period seismograms of shallow earthquakes that were recorded from the Global Digital Seismograph Network for 5 yr and obtained a gather that shows typical arrivals clearly from the deep Earth. Morozov and Dueker (2003) investigated the effectiveness of stacking in enhancing the signals of the receiver functions. They defined a signal-to-noise ratio (SNR) metric that was based on the multichannel coherency of the signals and the incoherency of the random noise, and they showed that the stacking can significantly improve the SNR of the stacked seismic trace. However, stacking methods have some drawbacks. First, they do not necessarily remove the noise present in the signal. Second, they require a large array of seismometers. Third, they require coherency of arrivals across the array, which are not always about earthquake seismology. From this point of view, a single-channel method seems to be a better substitute for improving the SNR of seismograms (Mousavi and Langston, 2016; 2017). In the reflection seismology community, many noise attenuation methods have been proposed and implemented in field applications over the past several decades. Prediction-based methods utilize the predictive property of the seismic signal to construct a predictive filter that prevents noise. Median filters and their variants use the statistical principle to reject Gaussian white noise or impulsive noise (Mi et al., 2000; Bonar and Sacchi, 2012). The dictionary-learning-based methods adaptively learn the basis from the data to sparsify the noisy seismic data, which will in turn suppress the noise (Zhang, van der Baan, et al., 2018). These methods require experimenters to solve the dictionary-updating and sparse-coding methods and can be very 1552 Seismological Research Letters Volume 90, Number 4 July/August 2019 doi: 10.1785/0220190028 Downloaded from https://pubs.geoscienceworld.org/ssa/srl/article-pdf/90/4/1552/4790732/srl-2019028.1.pdf by Seismological Society of America, Mattie Adam on 09 July 2019 expensive, computationally speaking. Decomposition-based methods decompose the noisy data into constitutive components, so that one can easily select the components that primarily represent the signal and remove those associated with noise. This category includes singular value decomposition (SVD)-based methods (Bai et al., 2018), empirical-mode decomposition (Chen, 2016), continuous wavelet transform (Mousavi et al., 2016), morphological decomposition (Huang et al., 2017), and so on. Rank-reduction-based methods assume that seismic data have a low-rank structure (Kumar et al., 2015; Zhou et al., 2017). If the data consist of κ complex linear events, the constructed Hankel matrix of the frequency data is a matrix of rank κ (Hua, 1992). Noise will increase the rank of theHankel matrix of the data, which can be attenuated via rank reduction. Such methods include Cadzow filtering (Cadzow, 1988; Zu et al., 2017) and SVD (Vautard et al., 1992). Most of the denoising methods are largely effective in processing reflection seismic images. The applications in more general seismological datasets are seldom reported, partially because of the fact that many seismological datasets have extremely low data quality. That is, they are characterized by low SNR and poor spatial sampling. Besides, most traditional denoising algorithms are based on carefully tuned parameters to obtain satisfactory performance. These parameters are usually data dependent and require a great deal of experiential knowledge. Thus, they are not flexible enough to use in application to many real-world problems. More research efforts have been dedicated to using machine-learning methods for seismological data processing (Chen, 2018a,b; Zhang, Wang, et al., 2018; Bergen et al., 2019; Lomax et al., 2019; McBrearty et al., 2019). Recently, supervised learning (Zhu et al., 2018) has been successfully applied for denoising of the seismic signals. However, supervised methods with deep networks require very large training datasets (sometimes to an order of a billion) of clean signals and their noisy contaminated realizations. In this article, we develop a new automatic denoising framework for improving the SNR of the seismological datasets based on an unsupervised machine-learning (UML) approach; this would be the autoencoder method. We leverage the autoencoder neural network to adaptively learn the features from the raw noisy seismological datasets during the encoding process, and then we optimally represent the data using these learned features during the decoding process. To effectively suppress the random noise, we use the sparsity constraint to regularize the neurons in the hidden layer. We apply the proposed UML-based denoising framework to a group of seismological datasets, including a reflection seismic image, a receiver function gather, and an earthquake stack. We observe a very encouraging performance, which demonstrates its great potential in a wide range of applications. METHOD Unsupervised Autoencoder Method Wewill first introduce the autoencoder neural network that we use for denoising seismological datasets. Autoencoders are specific neural networks that consist of two connected parts (decoder and encoder) that try to copy their input to the output layer. Hence, they can automatically learn the main features of the data in an unsupervised manner. In this article, the network is simply a three-layer architecture with an input layer, a hidden layer, and an output layer. The encoding process in the autoencoder neural network can be expressed as follows: EQ-TARGET;temp:intralink-;df1;323;673 ξ W1x b1 ; 1 in which x is the training sample (x∈Rn), ξ is the activation function. The decoding process can be expressed as follows: EQ-TARGET;temp:intralink-;df2;323;608 x ⌢ ξ W2x b2 : 2 In equations (1) and (2), W1 is the weighting matrix between the input layer and the hidden layer; b1 is the forward bias vector; W2 is the weighting matrix between the hidden layer and output layer; b2 is the backward bias vector; and ξ is the activation function. In this study, we use the softplus function as the activation function: EQ-TARGET;temp:intralink-;df3;323;505 ξ x log 1 e : 3 Sparsity Regularized Autoencoder To mitigate the influence of the seismic noise on the learned features and suppress the trivial components associated with low-amplitude neurons in the hidden layer, we apply a sparsity constraint to the hidden layer; that is, the output or last layer of the encoder. The sparsity constraint can help dropout the extracted nontrivial features that correspond to the noise and a small value in the hidden units. It can thus highlight the most dominant features in the data—the useful signals. The sparse penalty term can be written as follows: EQ-TARGET;temp:intralink-;df4;323;335~ R p ; 4 in which R is the penalty function: EQ-TARGET;temp:
[1]
Yangkang Chen,et al.
Fast waveform detection for microseismic imaging using unsupervised machine learning
,
2018
.
[2]
Yangkang Chen,et al.
Deep learning for seismic lithology prediction
,
2018
.
[3]
Christine Thomas,et al.
Topography of upper mantle seismic discontinuities beneath the North Atlantic: The Azores, Canary and Cape Verde plumes
,
2014
.
[4]
Guochang Liu,et al.
Stacking seismic data using local correlation
,
2009
.
[5]
Ken Dueker,et al.
Signal‐to‐noise ratios of teleseismic receiver functions and effectiveness of stacking for their enhancement
,
2003
.
[6]
R. A. Leibler,et al.
On Information and Sufficiency
,
1951
.
[7]
Peter M. Shearer,et al.
Constraints on upper mantle discontinuities from observations of long-period reflected and converted phases
,
1991
.
[8]
Ting Chen,et al.
Preface to the Focus Section on Machine Learning in Seismology
,
2019,
Seismological Research Letters.
[9]
Z. Guan,et al.
Using Fast Marching Eikonal Solver to Compute 3‐D Pds Traveltime for Deep Receiver‐Function Imaging
,
2018,
Journal of Geophysical Research: Solid Earth.
[10]
Min Bai,et al.
A structural rank reduction operator for removing artifacts in least-squares reverse time migration
,
2018,
Comput. Geosci..
[11]
R. Vautard,et al.
Singular-spectrum analysis: a toolkit for short, noisy chaotic signals
,
1992
.
[12]
Yingbo Hua.
Estimating two-dimensional frequencies by matrix enhancement and matrix pencil
,
1992,
IEEE Trans. Signal Process..
[13]
S. Mostafa Mousavi,et al.
Automatic noise-removal/signal-removal based on general cross-validation thresholding in synchrosqueezed domain and its application on earthquake data
,
2017
.
[14]
S. Mostafa Mousavi,et al.
Automatic microseismic denoising and onset detection using the synchrosqueezed continuous wavelet transform
,
2016
.
[15]
James A. Cadzow,et al.
Signal enhancement-a composite property mapping algorithm
,
1988,
IEEE Trans. Acoust. Speech Signal Process..
[16]
Shaohuan Zu,et al.
Low-frequency noise attenuation in seismic and microseismic data using mathematical morphological filtering
,
2017,
Geophysical Journal International.
[17]
Paul A. Johnson,et al.
Pairwise Association of Seismic Arrivals with Convolutional Neural Networks
,
2019,
Seismological Research Letters.
[18]
Mauricio D. Sacchi,et al.
Denoising seismic data using the nonlocal means algorithm
,
2012
.
[19]
Mirko van der Baan,et al.
Unsupervised Dictionary Learning for Signal‐to‐Noise Ratio Enhancement of Array Data
,
2018,
Seismological Research Letters.
[20]
Yangkang Chen,et al.
Random noise attenuation using local signal-and-noise orthogonalization
,
2015
.
[21]
S. Mostafa Mousavi,et al.
Hybrid Seismic Denoising Using Higher-Order Statistics and Improved Wavelet Block Thresholding
,
2016
.
[22]
Monica Maceira,et al.
Interactive Visualization of Complex Seismic Data and Models Using Bokeh
,
2018
.
[23]
Peter M. Shearer,et al.
Imaging global body wave phases by stacking long‐period seismograms
,
1991
.
[24]
Mauricio D. Sacchi,et al.
Sharp mantle transition from cratons to Cordillera in southwestern Canada
,
2015
.
[25]
L. Canales.
Random Noise Reduction
,
1984
.
[26]
Yu Jeffrey Gu,et al.
Improvement of coda phase detectability and reconstruction of global seismic data using frequency-wavenumber methods
,
2018
.
[27]
A. K. Rigler,et al.
Accelerating the convergence of the back-propagation method
,
1988,
Biological Cybernetics.
[28]
M. Weber,et al.
A reflector at 200 km depth beneath the northwest Pacific
,
2001
.
[29]
Z. Guan,et al.
An investigation on slowness‐weighted CCP stacking and its application to receiver function imaging
,
2017
.
[30]
Yangkang Chen,et al.
Automatic microseismic event picking via unsupervised machine learning
,
2020,
Geophysical Journal International.
[31]
A. Deuss,et al.
Global Observations of Mantle Discontinuities Using SS and PP Precursors
,
2009
.
[32]
Yangkang Chen,et al.
Dip-separated structural filtering using seislet transform and adaptive empirical mode decomposition based dip filter
,
2016
.
[33]
Zefeng Li,et al.
High-resolution seismic event detection using local similarity for Large-N arrays
,
2018,
Scientific Reports.
[34]
Anthony Lomax,et al.
An Investigation of Rapid Earthquake Characterization Using Single‐Station Waveforms and a Convolutional Neural Network
,
2019,
Seismological Research Letters.
[35]
S. Mostafa Mousavi,et al.
Seismic Signal Denoising and Decomposition Using Deep Neural Networks
,
2018,
IEEE Transactions on Geoscience and Remote Sensing.
[36]
Gary F. Margrave,et al.
Median filtering in Kirchhoff migration for noisy data
,
2000
.
[37]
Dong Zhang,et al.
Seismic noise attenuation using an online subspace tracking algorithm
,
2018,
Geophysical Journal International.
[38]
Xiao Pan,et al.
Iterative deblending of simultaneous-source data using a coherency-pass shaping operator
,
2017
.
[39]
Hassan Mansour,et al.
Efficient matrix completion for seismic data reconstruction
,
2015
.
[40]
Yangkang Chen,et al.
Fast dictionary learning for noise attenuation of multidimensional seismic data
,
2017,
Geophysical Journal International.
[41]
Sebastian Rost,et al.
ARRAY SEISMOLOGY: METHODS AND APPLICATIONS
,
2002
.