The data-driven physical-based equations discovery using evolutionary approach

The modern machine learning methods allow one to obtain the data-driven models in various ways. However, the more complex the model is, the harder it is to interpret. In the paper, we describe the algorithm for the mathematical equations discovery from the given observations data. The algorithm combines genetic programming with the sparse regression. This algorithm allows obtaining different forms of the resulting models. As an example, it could be used for governing analytical equation discovery as well as for partial differential equations (PDE) discovery. The main idea is to collect a bag of the building blocks (it may be simple functions or their derivatives of arbitrary order) and consequently take them from the bag to create combinations, which will represent terms of the final equation. The selected terms pass to the evolutionary algorithm, which is used to evolve the selection. The evolutionary steps are combined with the sparse regression to pick only the significant terms. As a result, we obtain a short and interpretable expression that describes the physical process that lies beyond the data. In the paper, two examples of the algorithm application are described: the PDE discovery for the metocean processes and the function discovery for the acoustics.

[1]  Steven L. Brunton,et al.  Data-driven discovery of partial differential equations , 2016, Science Advances.

[2]  G. Madec NEMO ocean engine , 2008 .

[3]  Sergey V. Kovalchuk,et al.  Deadline-driven approach for multi-fidelity surrogate-assisted environmental model calibration: SWAN wind wave model case study , 2019, GECCO.

[4]  Sergio Escalera,et al.  Analysis of the AutoML Challenge Series 2015-2018 , 2019, Automated Machine Learning.

[5]  H. Schaeffer,et al.  Learning partial differential equations via data discovery and sparse optimization , 2017, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[6]  Hod Lipson,et al.  Distilling Free-Form Natural Laws from Experimental Data , 2009, Science.

[7]  Anna V. Kaluzhnaya,et al.  Data-Driven Partial Derivative Equations Discovery with Evolutionary Approach , 2019, ICCS.

[8]  Brandon M. Greenwell,et al.  Interpretable Machine Learning , 2019, Hands-On Machine Learning with R.

[9]  Maziar Raissi,et al.  Deep Hidden Physics Models: Deep Learning of Nonlinear Partial Differential Equations , 2018, J. Mach. Learn. Res..

[10]  S. Brunton,et al.  Discovering governing equations from data by sparse identification of nonlinear dynamical systems , 2015, Proceedings of the National Academy of Sciences.

[11]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[12]  Kaj Nyström,et al.  Data-driven discovery of PDEs in complex datasets , 2018, J. Comput. Phys..

[13]  Zachary Chase Lipton The mythos of model interpretability , 2016, ACM Queue.

[14]  Michael Ghil,et al.  Data-driven non-Markovian closure models , 2014, 1411.4700.

[15]  Sergey V. Kovalchuk,et al.  A Conceptual Approach to Complex Model Management with Generalized Modelling Patterns and Evolutionary Identification , 2018, Complex..

[16]  Arvind Satyanarayan,et al.  The Building Blocks of Interpretability , 2018 .