Improving AI System Awareness of Geoscience Knowledge: Symbiotic Integration of Physical Approaches and Deep Learning

Modeling dynamic geophysical phenomena is at the core of Earth and environmental studies. The geoscientific community relying mainly on physical representations may want to consider much deeper adoption of artificial intelligence (AI) instruments in the context of AI's global success and emergence of big Earth data. A new perspective of using hybrid physics-AI approaches is a grand vision, but actualizing such approaches remains an open question in geoscience. This study develops a general approach to improving AI geoscientific awareness, wherein physical approaches such as temporal dynamic geoscientific models are included as special recurrent neural layers in a deep learning architecture. The illustrative case of runoff modeling across the conterminous United States demonstrates that the physics-aware DL model has enhanced prediction accuracy, robust transferability, and good intelligence for inferring unobserved processes. This study represents a firm step toward realizing the vision of tackling Earth system challenges by physics-AI integration.

[1]  Jeff P. Raffensperger,et al.  Estimation of Base Flow by Optimal Hydrograph Separation for the Conterminous United States and Implications for National-Extent Hydrologic Models , 2019, Water.

[2]  Philip G Brodrick,et al.  Uncovering Ecological Patterns with Convolutional Neural Networks. , 2019, Trends in ecology & evolution.

[3]  David Walling,et al.  Combining Physically Based Modeling and Deep Learning for Fusing GRACE Satellite Data: Can We Learn From Mismatch? , 2019, Water Resources Research.

[4]  S. Hochreiter,et al.  Toward Improved Predictions in Ungauged Basins: Exploiting the Power of Machine Learning , 2019, Water Resources Research.

[5]  Tim R. McVicar,et al.  Global patterns in base flow index and recession based on streamflow observations from 3394 catchments , 2013 .

[6]  Martyn P. Clark,et al.  The CAMELS data set: catchment attributes and meteorology for large-sample studies , 2017 .

[7]  Genshiro Kitagawa,et al.  State-space modeling for seismic signal analysis , 2014 .

[8]  Karsten Schulz,et al.  Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks , 2018, Hydrology and Earth System Sciences.

[9]  Martyn P. Clark,et al.  Development of a large-sample watershed-scale hydrometeorological data set for the contiguous USA: data set characteristics and assessment of regional variability in hydrologic model performance , 2014 .

[10]  Joachim Denzler,et al.  Deep learning and process understanding for data-driven Earth system science , 2019, Nature.

[11]  Bettina K. Gier,et al.  Taking climate model evaluation to the next level , 2019, Nature Climate Change.

[12]  Jichun Wu,et al.  Deep Convolutional Encoder‐Decoder Networks for Uncertainty Quantification of Dynamic Multiphase Flow in Heterogeneous Media , 2018, Water Resources Research.

[13]  Murugesu Sivapalan,et al.  Dominant flood generating mechanisms across the United States , 2016 .

[14]  Jeffrey G. Arnold,et al.  Regional estimation of base flow for the conterminous United States by hydrologic landscape regions , 2008 .

[15]  Martyn P. Clark,et al.  Benchmarking of a Physically Based Hydrologic Model , 2017 .

[16]  David G. Tarboton,et al.  An overview of current applications, challenges, and future trends in distributed process-based models in hydrology , 2016 .

[17]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[18]  Jim E. Freer,et al.  Technical note: Inherent benchmark or not? Comparing Nash–Sutcliffe and Kling–Gupta efficiency scores , 2019, Hydrology and Earth System Sciences.

[19]  Chaopeng Shen,et al.  A Transdisciplinary Review of Deep Learning Research and Its Relevance for Water Resources Scientists , 2017, Water Resources Research.

[20]  Zhenlong Li,et al.  Big Data and cloud computing: innovation opportunities and challenges , 2017, Int. J. Digit. Earth.

[21]  Justin A. Sirignano,et al.  DGM: A deep learning algorithm for solving partial differential equations , 2017, J. Comput. Phys..

[22]  Vazken Andréassian,et al.  Spatial proximity, physical similarity, regression and ungaged catchments: A comparison of regionalization approaches based on 913 French catchments , 2008 .

[23]  Patrick M. Reed,et al.  Time‐varying sensitivity analysis clarifies the effects of watershed model formulation on model behavior , 2013 .

[24]  Hoshin Vijai Gupta,et al.  A process‐based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model , 2008 .

[25]  D. Legates,et al.  Evaluating the use of “goodness‐of‐fit” Measures in hydrologic and hydroclimatic model validation , 1999 .

[26]  Imme Ebert-Uphoff,et al.  Thoughtfully Using Artificial Intelligence in Earth Science , 2019, Eos.

[27]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[28]  Nagiza F. Samatova,et al.  Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[29]  Gilles R. C. Essou,et al.  Comparison of climate datasets for lumped hydrological modeling over the continental United States , 2016 .

[30]  Peter Bauer,et al.  The quiet revolution of numerical weather prediction , 2015, Nature.

[31]  J. McDonnell,et al.  A decade of Predictions in Ungauged Basins (PUB)—a review , 2013 .

[32]  Klaus-Robert Müller,et al.  Towards Explainable Artificial Intelligence , 2019, Explainable AI.

[33]  Avi Ostfeld,et al.  Data-driven modelling: some past experiences and new approaches , 2008 .

[34]  Anuj Karpatne,et al.  Machine Learning for the Geosciences: Challenges and Opportunities , 2017, IEEE Transactions on Knowledge and Data Engineering.

[35]  Jongho Kim,et al.  Streamflow, stomata, and soil pits: Sources of inference for complex models with fast, robust uncertainty quantification , 2019, Advances in Water Resources.

[36]  Dimitri Solomatine,et al.  A novel method to estimate model uncertainty using machine learning techniques , 2009 .

[37]  Kunihiko Fukushima,et al.  Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position , 1982, Pattern Recognit..

[38]  R. Woods,et al.  Catchment Classification and Hydrologic Similarity , 2006, Geography Compass.

[39]  Markus Reichstein,et al.  Physics‐Constrained Machine Learning of Evapotranspiration , 2019, Geophysical Research Letters.

[40]  Paulin Coulibaly,et al.  Streamflow Prediction in Ungauged Basins: Review of Regionalization Methods , 2013 .

[41]  Pierre Gentine,et al.  Deep learning to represent subgrid processes in climate models , 2018, Proceedings of the National Academy of Sciences.

[42]  Alexander Y. Sun,et al.  Monthly streamflow forecasting using Gaussian Process Regression , 2014 .

[43]  Chris North,et al.  Intelligent systems for geosciences , 2018, Communications of the ACM.

[44]  Michael Ghil,et al.  A Century of Nonlinearity in the Geosciences , 2019, Earth and Space Science.

[45]  Cynthia Rudin,et al.  Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead , 2018, Nature Machine Intelligence.

[46]  Xubin Zeng,et al.  Snowpack Change From 1982 to 2016 Over Conterminous United States , 2018, Geophysical Research Letters.

[47]  Heng Li,et al.  Deep Learning of Subsurface Flow via Theory-guided Neural Network , 2019, ArXiv.

[48]  Gregory J. McCabe,et al.  Independent effects of temperature and precipitation on modeled runoff in the conterminous United States , 2011 .

[49]  Paul A. Johnson,et al.  Similarity of fast and slow earthquakes illuminated by machine learning , 2018, Nature Geoscience.

[50]  Marc Stieglitz,et al.  Modelling daily streamflow at ungauged catchments: what information is necessary? , 2011 .

[51]  Patrick Gallinari,et al.  Deep learning for physical processes: incorporating prior scientific knowledge , 2017, ICLR.

[52]  Hans Kaper,et al.  Chapter 12: Zonal Energy Budget , 2013 .

[53]  Alexander Y. Sun,et al.  How can Big Data and machine learning benefit environment and water management: a survey of methods, applications, and future directions , 2019, Environmental Research Letters.