Input variable selection for water resources systems using a modified minimum redundancy maximum relevance (mMRMR) algorithm

Input variable selection (IVS) is a necessary step in modeling water resources systems. Neglecting this step may lead to unnecessary model complexity and reduced model accuracy. In this paper, we apply the minimum redundancy maximum relevance (MRMR) algorithm to identifying the most relevant set of inputs in modeling a water resources system. We further introduce two modified versions of the MRMR algorithm (α-MRMR and β-MRMR), where α and β are correction factors that are found to increase and decrease as a power-law function, respectively, with the progress of the input selection algorithms and the increase of the number of selected input variables. We apply the proposed algorithms to 22 reservoirs in California to predict daily releases based on a set from a 121 potential input variables. Results indicate that the two proposed algorithms are good measures of model inputs as reflected in enhanced model performance. The α-MRMR and β-MRMR values exhibit strong negative correlation to model performance as depicted in lower root-mean-square-error (RMSE) values.

[1]  Ximing Cai,et al.  The role of hydrologic information in reservoir operation – Learning from historical releases , 2008 .

[2]  Chris H. Q. Ding,et al.  Minimum Redundancy Feature Selection from Microarray Gene Expression Data , 2005, J. Bioinform. Comput. Biol..

[3]  Praveen Kumar,et al.  Ecohydrologic process networks: 1. Identification , 2009 .

[4]  Wesley W. Wallender,et al.  Irrigation Hydrology: Crossing Scales , 2002 .

[5]  Michel Verleysen,et al.  Mutual information for the selection of relevant variables in spectrometric nonlinear modelling , 2006, ArXiv.

[6]  Germán Poveda,et al.  Feedbacks between Hydrological Processes in Tropical South America and Large-Scale Ocean–Atmospheric Phenomena , 1997 .

[7]  Dirk Tomandl,et al.  A Modified General Regression Neural Network (MGRNN) with new, efficient training algorithms as a robust 'black box'-tool for data analysis , 2001, Neural Networks.

[8]  Michel Verleysen,et al.  Resampling methods for parameter-free and robust feature selection with mutual information , 2007, Neurocomputing.

[9]  J. Venn,et al.  . On the diagrammatic and mechanical representation of propositions and reasonings , 2022 .

[10]  Donald F. Specht,et al.  A general regression neural network , 1991, IEEE Trans. Neural Networks.

[11]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[12]  Ashish Sharma,et al.  Selection of a kernel bandwidth for measuring dependence in hydrologic time series using the mutual information criterion , 2001 .

[13]  Ashish Sharma,et al.  Seasonal to interannual rainfall probabilistic forecasts for improved water supply management: Part 1 — A strategy for system predictor identification , 2000 .

[14]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[15]  Holger R. Maier,et al.  Input determination for neural network models in water resources applications. Part 1—background and methodology , 2005 .

[16]  Can Ozan Tan,et al.  Modeling complex nonlinear responses of shallow lakes to fish and hydrology using artificial neural networks , 2006 .

[17]  A. Sankarasubramanian,et al.  Hydroclimatology of the continental United States , 2003 .

[18]  M. Celia,et al.  Effects of heterogeneities on capillary pressure-saturation-relative permeability relationships. , 2002, Journal of contaminant hydrology.

[19]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  R. Golob,et al.  Enhanced artificial neural network inflow forecasting algorithm for run-of-river hydropower plants , 2002 .

[21]  Eric F. Wood,et al.  Effects of soil moisture aggregation on surface evaporative fluxes , 1997 .

[22]  R. O. Strobl,et al.  Artificial neural network exploration of the influential factors in drainage network derivation , 2007 .

[23]  Toby N. Carlson,et al.  The impact of land use — land cover changes due to urbanization on surface microclimate and hydrology: a satellite perspective , 2000 .