Benchmarking data-driven rainfall-runoff models in Great Britain: a comparison of LSTM-based models with four lumped conceptual models

Long short-term memory models (LSTMs) are recurrent neural networks from the emerging field of Deep Learning (DL), which have shown recent promise when predicting time-series especially when data are abundant. Rainfall-runoff modelling presents a challenge, yet accurate hydrological models are vital for flood forecasting, hazard impact assessment, and to assess the potential effects of climate change on floods and water resources. In this study, we compare the performance of two DL-based models, a LSTM and an Entity Aware LSTM (EA LSTM). The DL models were trained using a newly published data set, CAMELS-GB, for a sample of 518 catchments across Great Britain. To identify spatial and seasonal patterns in model performance, we compare the DL models against benchmark outputs from four lumped conceptual models recently configured for rainfall-runoff modelling in Great Britain. Our findings show that the LSTM models simulate discharge with consistently high model performance scores, including in catchments typically considered difficult to model. The LSTM achieves a mean catchment NSE of 0.88 (0.86 for the EALSTM), which represents a performance improvement of 10 %–16 % compared with the benchmark conceptual models. Seasonal and spatial patterns indicate that the largest performance improvement relative to the benchmark is in the drier summer months and in drier catchments in the South East of England. By comparing LSTMs with conceptual models, we diagnose possible reasons for their different performance. We suggest that LSTMs offer useful predictive capability for rainfall-runoff modelling in Great Britain and elsewhere and note their value to support process understanding in locations where processes are less well understood.

[1]  V. Bell,et al.  From Catchment to National Scale Rainfall-Runoff Modelling: Demonstration of a Hydrological Modelling Framework , 2014 .

[2]  T. McMahon,et al.  Historical development of rainfall‐runoff modeling , 2020, WIREs Water.

[3]  P. E. O'connell,et al.  An introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 1: History and philosophy of a physically-based, distributed modelling system , 1986 .

[4]  Dimitri Solomatine,et al.  Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 2: Application , 2009 .

[5]  Rainfall–runoff prediction at multiple timescales with a single Long Short-Term Memory network , 2021 .

[6]  Louise J. Bracken,et al.  The concept of hydrological connectivity and its contribution to understanding runoff‐dominated geomorphic systems , 2007 .

[7]  Pascal Côté,et al.  Comparison of Stochastic Optimization Algorithms in Hydrological Model Calibration , 2014 .

[8]  Hoshin Vijai Gupta,et al.  A process‐based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model , 2008 .

[9]  Keith Beven,et al.  Deep learning, hydrological processes and the uniqueness of place , 2020, Hydrological Processes.

[10]  Vladan Babovic,et al.  Hydrologically Informed Machine Learning for Rainfall‐Runoff Modeling: A Genetic Programming‐Based Toolkit for Automatic Model Induction , 2020, Water Resources Research.

[11]  Dimitri Solomatine,et al.  Improving AI System Awareness of Geoscience Knowledge: Symbiotic Integration of Physical Approaches and Deep Learning , 2020, Geophysical Research Letters.

[12]  Kuolin Hsu,et al.  HESS Opinions: Deep learning as a promising avenue toward knowledge discovery in water sciences , 2018 .

[13]  Hoshin Vijai Gupta,et al.  Does Information Theory Provide a New Paradigm for Earth Science? Hypothesis Testing , 2020, Water Resources Research.

[14]  Y. Liu,et al.  What is the hydrologically effective area of a catchment? , 2020, Environmental Research Letters.

[15]  Joachim Denzler,et al.  Deep learning and process understanding for data-driven Earth system science , 2019, Nature.

[16]  Martyn P. Clark,et al.  The CAMELS data set: catchment attributes and meteorology for large-sample studies , 2017 .

[17]  J. Freer,et al.  CAMELS-GB: hydrometeorological time series and landscape attributes for 671 catchments in Great Britain , 2020 .

[18]  Martyn P. Clark,et al.  Framework for Understanding Structural Errors (FUSE): A modular framework to diagnose differences between hydrological models , 2008 .

[19]  Sepp Hochreiter,et al.  Towards learning universal, regional, and local hydrological behaviors via machine learning applied to large-sample datasets , 2019 .

[20]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[21]  Karsten Schulz,et al.  Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks , 2018, Hydrology and Earth System Sciences.

[22]  Hoshin Vijai Gupta,et al.  Large-sample hydrology: a need to balance depth with breadth , 2013 .

[23]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..