Machine Learning Predictors of Extreme Events Occurring in Complex Dynamical Systems

The ability to characterize and predict extreme events is a vital topic in fields ranging from finance to ocean engineering. Typically, the most-extreme events are also the most-rare, and it is this property that makes data collection and direct simulation challenging. We consider the problem of deriving optimal predictors of extremes directly from data characterizing a complex system, by formulating the problem in the context of binary classification. Specifically, we assume that a training dataset consists of: (i) indicator time series specifying on whether or not an extreme event occurs; and (ii) observables time series, which are employed to formulate efficient predictors. We employ and assess standard binary classification criteria for the selection of optimal predictors, such as total and balanced error and area under the curve, in the context of extreme event prediction. For physical systems for which there is sufficient separation between the extreme and regular events, i.e., extremes are distinguishably larger compared with regular events, we prove the existence of optimal extreme event thresholds that lead to efficient predictors. Moreover, motivated by the special character of extreme events, i.e., the very low rate of occurrence, we formulate a new objective function for the selection of predictors. This objective is constructed from the same principles as receiver operating characteristic curves, and exhibits a geometric connection to the regime separation property. We demonstrate the application of the new selection criterion to the advance prediction of intermittent extreme events in two challenging complex systems: the Majda–McLaughlin–Tabak model, a 1D nonlinear, dispersive wave model, and the 2D Kolmogorov flow model, which exhibits extreme dissipation events.

[1]  Themistoklis P. Sapsis,et al.  Sequential sampling strategy for extreme event statistics in nonlinear dynamical systems , 2018, Proceedings of the National Academy of Sciences.

[2]  Enrico Zio,et al.  Estimation of the Functional Failure Probability of a Thermal Hydraulic Passive System by Subset Simulation , 2009 .

[3]  Andrew J. Majda,et al.  A one-dimensional model for dispersive wave turbulence , 1997 .

[4]  Eric Vanden-Eijnden,et al.  Rogue waves and large deviations in deep sea , 2017, Proceedings of the National Academy of Sciences.

[5]  Harald E. Krogstad,et al.  Oceanic Rogue Waves , 2008 .

[6]  Benno Rumpf Laura Biven Weak turbulence and collapses in the Majda–McLaughlin–Tabak equation: Fluxes in wavenumber and in amplitude space , 2005, nlin/0503005.

[7]  A. Majda,et al.  Statistical dynamical model to predict extreme events and anomalous features in shallow water waves with abrupt depth change , 2019, Proceedings of the National Academy of Sciences.

[8]  Fan Li Modelling the stock market using a multi-scale approach , 2017 .

[9]  Mohammad Farazmand,et al.  Reduced-order prediction of rogue waves in two-dimensional deep-water waves , 2016, J. Comput. Phys..

[10]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[11]  M. Kemp,et al.  Oral, Nasal and Pharyngeal Exposure to Lipopolysaccharide Causes a Fetal Inflammatory Response in Sheep , 2015, PloS one.

[12]  A. Majda,et al.  Predicting fat-tailed intermittent probability distributions in passive scalar turbulence with imperfect models through empirical information theory , 2016 .

[13]  Lawrence Sirovich,et al.  An investigation of chaotic Kolmogorov flows , 1990 .

[14]  Andrew J. Majda,et al.  Dispersive wave turbulence in one dimension , 2001 .

[15]  Desmond J. Higham,et al.  An Algorithmic Introduction to Numerical Simulation of Stochastic Differential Equations , 2001, SIAM Rev..

[16]  Petros Koumoutsakos,et al.  Data-assisted reduced-order modeling of extreme events in complex dynamical systems , 2018, PloS one.

[17]  T. Sapsis,et al.  Reduced-order precursors of rare events in unidirectional nonlinear water waves , 2015, Journal of Fluid Mechanics.

[18]  Ky Khac Vu,et al.  Surrogate-based methods for black-box optimization , 2017, Int. Trans. Oper. Res..

[19]  Themistoklis P. Sapsis,et al.  Probabilistic Description of Extreme Events in Intermittently Unstable Dynamical Systems Excited by Correlated Stochastic Processes , 2014, SIAM/ASA J. Uncertain. Quantification.

[20]  K. Kaneko,et al.  Adaptive Response of a Gene Network to Environmental Changes by Fitness-Induced Attractor Selection , 2006, PloS one.

[21]  Kharif Christian,et al.  Rogue Waves in the Ocean , 2009 .

[22]  Dennis Gabor,et al.  Theory of communication , 1946 .

[23]  K. Chaloner,et al.  Bayesian Experimental Design: A Review , 1995 .

[24]  Eric Vanden-Eijnden,et al.  Transition-path theory and path-finding algorithms for the study of rare events. , 2010, Annual review of physical chemistry.

[25]  Themistoklis P. Sapsis,et al.  Probabilistic response and rare events in Mathieu׳s equation under correlated parametric excitation , 2016, 1706.00109.

[26]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[27]  Themistoklis P. Sapsis,et al.  Quantification and prediction of extreme events in a one-dimensional nonlinear dispersive wave model , 2014, 1401.3397.

[28]  T. Sapsis,et al.  A variational approach to probing extreme events in turbulent dynamical systems , 2017, Science Advances.

[29]  Viv Bewick,et al.  Statistics review 13: Receiver operating characteristic curves , 2004, Critical care.

[30]  Themistoklis P. Sapsis,et al.  A probabilistic decomposition-synthesis method for the quantification of rare events due to internal instabilities , 2015, J. Comput. Phys..

[31]  Efim Pelinovsky,et al.  Observation of Rogue Waves , 2009 .

[32]  Chao Yang,et al.  Solid–liquid separation by particle-flow-instability , 2014 .

[33]  Beibei Xu,et al.  Hamiltonian modeling of multi-hydro-turbine governing systems with sharing common penstock and dynamic analyses under shock load , 2016 .

[34]  S. Varadhan Large Deviations and Applications , 1984 .