Reply to: One neuron versus deep learning in aftershock prediction

Forecasting the spatial distribution of aftershocks in the aftermath of large seismic events is of great importance for improving both our understanding of earthquake triggering and post-disaster management. Recently, DeVries et al.1 attempted to solve this scientific problem by deep learning. Using the area under the curve (AUC) of receiver operating characteristic curves, the authors showed that a deep neural network (DNN) considerably outperformed classical Coulomb stress. Here we first clarify that similar performances had already been obtained (by the same authors, in 2017)2 for various scalar stress metrics, suggesting that deep learning does not actually improve prediction. Second, we reformulate the 2017 results2 using two-parameter logistic regression (that is, one neuron) and obtain the same performance as that of the 13,451-parameter DNN. We further show that a logistic regression based on the measured distance and mainshock average slip (instead of derived stresses) performs better than the DNN. This demonstrates that so far the proposed deep learning strategy does not provide any new insight (predictive or inferential) in this domain. Operational aftershock forecasting has been possible for decades thanks to well established empirical laws3–5. Spatial patterns of aftershocks are often described as a power-law decay5–8. Physical models based on the Coulomb stress paradigm only outperform statistical methods when considered in physical/statistical hybrids with high-quality mainshock rupture data9. Coulomb stress models can on their own be as performant as statistical methods when additionally including receiver fault heterogeneities10,11. Meade et al.2 performed a thorough analysis of various scalar stress metrics and showed that several of them outperform classic Coulomb failure stress. We are here concerned with their follow-up article1, which presented similar results, but via deep learning. In this study, we aim to demonstrate that while defining larger and deeper DNNs usually does not hurt model

[1]  P. Reasenberg,et al.  Earthquake Hazard After a Mainshock in California , 1989, Science.

[2]  T. Jordan,et al.  The Forecasting Skill of Physics‐Based Seismicity Models during the 2010–2012 Canterbury, New Zealand, Earthquake Sequence , 2018, Seismological Research Letters.

[3]  Arnaud Mignan,et al.  Utsu aftershock productivity law explained from geometric operations on the permanent static stress field of mainshocks , 2017, 1706.06490.

[4]  Lifeng Wang,et al.  Propagation of Coulomb stress uncertainties in physics‐based aftershock models , 2014 .

[5]  K. Felzer,et al.  Decay of aftershock density with distance indicates triggering by dynamic stress , 2006, Nature.

[6]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[7]  Shinji Toda,et al.  Decay of aftershock density with distance does not indicate triggering by dynamic stress , 2010, Nature.

[8]  Peter Gerstoft,et al.  Machine Learning in Seismology: Turning Data into Insights , 2018, Seismological Research Letters.

[9]  Matthew C. Gerstenberger,et al.  Real-time forecasts of tomorrow's earthquakes in California , 2005, Nature.

[10]  P. Reasenberg,et al.  Earthquake aftershocks: update. , 1994, Science.

[11]  F. Viégas,et al.  Deep learning of aftershock patterns following large earthquakes , 2018, Nature.

[12]  Martin Wattenberg,et al.  What Is Better Than Coulomb Failure Stress? A Ranking of Scalar Static Stress Triggering Mechanisms from 105 Mainshock‐Aftershock Pairs , 2017 .

[13]  Marco Broccardo,et al.  One neuron is more informative than a deep neural network for aftershock pattern forecasting , 2019, 1904.01983.

[14]  David A. Rhoades,et al.  A new hybrid Coulomb/statistical model for forecasting aftershock rates , 2014 .

[15]  Michael I. Jordan,et al.  Machine learning: Trends, perspectives, and prospects , 2015, Science.