Rapid and Robust Cross‐Correlation‐Based Seismic Signal Identification Using an Approximate Nearest Neighbor Method

Abstract The matched filtering technique that uses the cross correlation of a waveform of interest with archived signals from a template library has proven to be a powerful tool for detecting events in regions with repeating seismicity. However, waveform correlation is computationally expensive and therefore impractical for large template sets unless dedicated distributed computing hardware and software are used. In this study, we introduce an approximate nearest neighbor (ANN) approach that enables the use of very large template libraries for waveform correlation. Our method begins with a projection into a reduced dimensionality space, based on correlation with a randomized subset of the full template archive. Searching for a specified number of nearest neighbors for a query waveform is accomplished by iteratively comparing it with the neighbors of its immediate neighbors. We used the approach to search for matches to each of ∼2300 analyst‐reviewed signal detections reported in May 2010 for the International Monitoring System station MKAR. The template library in this case consists of a data set of more than 200,000 analyst‐reviewed signal detections for the same station from February 2002 to July 2016 (excluding May 2010). Of these signal detections, 73% are teleseismic first P and 17% regional phases ( Pn , Pg , Sn , and Lg ). The analyses performed on a standard desktop computer show that the proposed ANN approach performs a search of the large template libraries about 25 times faster than the standard full linear search and achieves recall rates greater than 80%, with the recall rate increasing for higher correlation thresholds.

[1]  Kai Li,et al.  Efficient k-nearest neighbor graph construction for generic similarity measures , 2011, WWW.

[2]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[3]  Zhigang Peng,et al.  Migration of early aftershocks following the 2004 Parkfield earthquake , 2009 .

[4]  Quan Wang,et al.  Kernel Principal Component Analysis and its Applications in Face Recognition and Active Shape Models , 2012, ArXiv.

[5]  Christopher John Young,et al.  Detection of the Wenchuan aftershock sequence using waveform correlation with a composite regional network , 2016 .

[6]  Jon Louis Bentley,et al.  Multidimensional binary search trees used for associative searching , 1975, CACM.

[7]  Clara E Yoon,et al.  Earthquake detection through computationally efficient similarity search , 2015, Science Advances.

[8]  F. Waldhauser,et al.  One Magnitude Unit Reduction in Detection Threshold by Cross Correlation Applied to Parkfield (California) and China Seismicity , 2010 .

[9]  Avery Wang,et al.  An Industrial Strength Audio Search Algorithm , 2003, ISMIR.

[10]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[11]  David B. Harris,et al.  An Autonomous System for Grouping Events in a Developing Aftershock Sequence , 2011 .

[12]  Enhong Chen,et al.  Real-time earthquake monitoring using a search engine method , 2014, Nature Communications.

[13]  Monika Henzinger,et al.  Finding near-duplicate web pages: a large-scale evaluation of algorithms , 2006, SIGIR.

[14]  Michael A. Casey,et al.  Locality-Sensitive Hashing for Finding Nearest Neighbors , 2008 .

[15]  William R. Walter,et al.  Initial Global Seismic Cross-Correlation Results: Implications for Empirical Signal Detectors , 2014 .

[16]  Alexandr Andoni,et al.  Near-Optimal Hashing Algorithms for Approximate Nearest Neighbor in High Dimensions , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[17]  F. Waldhauser,et al.  Large-scale relocation of two decades of Northern California seismicity using cross-correlation and double-difference methods , 2008 .

[18]  F. Ringdal,et al.  The detection of low magnitude seismic events using array-based waveform correlation , 2006 .

[19]  David P. Schaff,et al.  Broad‐scale applicability of correlation detectors to China seismicity , 2009 .

[20]  C. Young,et al.  Applying Waveform Correlation to Three Aftershock Sequences , 2012 .

[21]  M. Slaney,et al.  Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes] , 2008, IEEE Signal Processing Magazine.

[22]  Miao Zhang,et al.  Seismological Evidence for a Low‐Yield Nuclear Test on 12 May 2010 in North Korea , 2015 .

[23]  Christopher John Young,et al.  A comparison of select trigger algorithms for automated global seismic phase and event detection , 1998, Bulletin of the Seismological Society of America.

[24]  David G. Lowe,et al.  Fast Approximate Nearest Neighbors with Automatic Algorithm Configuration , 2009, VISAPP.

[25]  David P. Schaff,et al.  Improvements to detection capability by cross-correlating for similar events: a case study of the 1999 Xiuyan, China, sequence and synthetic sensitivity tests , 2010 .