Discrete principal‐monotonicity inference for hydro‐system analysis under irregular nonlinearities, data uncertainties, and multivariate dependencies. Part I: methodology development

Hydrological system analyses are challenged by complexities of irregular nonlinearities, data uncertainties, and multivariate dependencies. Among them, the irregular nonlinearities mainly represent inexistence of regular functions for robustly simulating highly complicated relationships between variables. Few existing studies can enable reliable simulation of hydrological processes under these complexities. This may lead to decreased robustness of the constructed models, unfeasibility of suggestions for human activities, and damages to socio-economy and eco-environment. In the first of two companion papers, a discrete principal-monotonicity inference (DPMI) method is proposed for hydrological systems analysis under these complexities. Normalization of non-normally distributed samples and invertible restoration of modelling results are enabled through a discrete distribution transformation approach. To mitigate data uncertainties, statistical inference is employed to assess the significance of differences among samples. The irregular nonlinearity between the influencing factors (i.e. predictors) and the hydrological variable of interest (i.e. the predictand) is interpreted as piecewise monotonicity. Monotonicity is further represented as principal monotonicity under multivariate dependencies. Based on stepwise classification and cluster analyses, all paired samples representing the responsive relationship between the predictors and the predictand are discretized as a series of end nodes. A prediction approach is advanced for estimating the predictand value given any combination of predictors. The DPMI method can reveal evolvement rules of hydrological systems under these complexities. Reliance of existing hydro-system analysis methods on predefined functional forms is removed, avoiding artificial disturbances, e.g. empiricism in selecting model functions under irregular nonlinearities, on the modelling process. Both local and global significances of predictors in driving the evolution of hydrological variables are identified. An analysis of interactions among these complexities is also achieved. The understanding obtained from the DPMI process and associated results can facilitate hydrological prediction, guide water resources management, improve hydro-system analysis methods, or support hydrological systems analysis in other cases. The effectiveness and advantages of DPMI will be demonstrated through a case study of streamflow simulation in Xingshan Watershed, China, in another paper. Copyright © 2016 John Wiley & Sons, Ltd.

[1]  Guohe Huang,et al.  Convex contractive interval linear programming for resources and environmental systems management , 2015, Stochastic Environmental Research and Risk Assessment.

[2]  U. Filobello-Niño,et al.  High Accurate Simple Approximation of Normal Distribution Integral , 2012 .

[3]  Linda See,et al.  Applying soft computing approaches to river level forecasting , 1999 .

[4]  R. Srikanthan,et al.  Log Pearson III distribution — Is it applicable to flood frequency analysis of Australian streams? , 1981 .

[5]  Guanhui Cheng,et al.  Synchronic interval Gaussian mixed-integer programming for air quality management. , 2015, The Science of the total environment.

[6]  John R. Williams,et al.  LARGE AREA HYDROLOGIC MODELING AND ASSESSMENT PART I: MODEL DEVELOPMENT 1 , 1998 .

[7]  J. C. Wüst Current Prediction for Shipping Guidance , 1995, SNN Symposium on Neural Networks.

[8]  Wei Li,et al.  Fuzzy interval programming for energy and environmental systems management under constraint-violation and energy-substitution effects: A case study for the City of Beijing , 2014 .

[9]  P. E. O'connell,et al.  An introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 2: Structure of a physically-based, distributed modelling system , 1986 .

[10]  Philip W. Gassman,et al.  Simulation of Nutrient Losses from Chicken Litter Applications in East Central Texas with APEX and SWAT , 2001 .

[11]  S. de Bruin,et al.  Soil-landscape modelling using fuzzy c-means clustering of attribute data derived from a Digital Elevation Model (DEM) , 1998 .

[12]  Bellie Sivakumar,et al.  Chaos theory in hydrology: important issues and interpretations , 2000 .

[13]  P. E. O'connell,et al.  An introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 1: History and philosophy of a physically-based, distributed modelling system , 1986 .

[14]  Lei Chen,et al.  Effect of Rainfall Measurement Errors on Nonpoint-Source Pollution Model Uncertainty , 2016 .

[15]  L. F. Huggins,et al.  ANSWERS: A Model for Watershed Planning , 1980 .

[16]  José A. Revilla,et al.  USE OF NEURAL NETWORKS IN DESIGN OF COASTAL SEWAGE SYSTEMS , 1998 .

[17]  Philip Jordan,et al.  A comparison of SWAT, HSPF and SHETRAN/GOPC for modelling phosphorus export from three catchments in Ireland. , 2007, Water research.

[18]  Jeffrey G. Arnold,et al.  APPLICATION OF SWAT FOR THE UPPER NORTH BOSQUE RIVER WATERSHED , 2000 .

[19]  E. J. Monke,et al.  A Mathematical Model for Simulating the Hydrologic Response of a Watershed , 1968 .

[20]  Guohe Huang,et al.  A robust optimization modelling approach for managing water and farmland use between anthropogenic modification and ecosystems protection under uncertainties , 2015 .

[21]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[22]  M. Mendeş,et al.  Type I Error Rate and Power of Three Normality Tests , 2003 .

[23]  Guohe Huang,et al.  Robust planning of energy management systems with environmental and constraint-conservative considerations under multiple uncertainties , 2013 .

[24]  M. C. Jones,et al.  A reliable data-based bandwidth selection method for kernel density estimation , 1991 .

[25]  Guohe Huang,et al.  A dual-inexact fuzzy stochastic model for water resources management and non-point source pollution mitigation under multiple uncertainties , 2014 .

[26]  Julian D. Olden,et al.  A framework for hydrologic classification with a review of methodologies and applications in ecohydrology , 2012 .

[27]  Denis A. Hughes,et al.  A semi-distributed, variable time interval model of catchment hydrology—structure and parameter estimation procedures , 1994 .

[28]  M. Scardi Artificial neural networks as empirical models for estimating phytoplankton production , 1996 .

[29]  D. Legates,et al.  Evaluating the use of “goodness‐of‐fit” Measures in hydrologic and hydroclimatic model validation , 1999 .

[30]  G. S. Stelling A Numerical Method for Inundation Simulations , 2002 .

[31]  肖尚斌,et al.  Phosphorus Fractions and Its Summer’s Release Flux From Sediment in the China’s Three Gorges Reservoir , 2015 .

[32]  Vijay P. Singh,et al.  Characterizing the spatial variability of groundwater quality using the entropy theory: I. Synthetic data , 2004 .

[33]  Guohe Huang,et al.  Coupled planning of water resources and agricultural landuse based on an inexact-stochastic programming model , 2014, Frontiers of Earth Science.

[34]  G. Box,et al.  On the Experimental Attainment of Optimum Conditions , 1951 .

[35]  H. Lilliefors On the Kolmogorov-Smirnov Test for Normality with Mean and Variance Unknown , 1967 .

[36]  B. Bobée,et al.  Multivariate hydrological frequency analysis using copulas , 2004 .

[37]  F. Recknagel,et al.  Artificial neural network approach for modelling and prediction of algal blooms , 1997 .

[38]  F. Massey The Kolmogorov-Smirnov Test for Goodness of Fit , 1951 .

[39]  Y. Hong,et al.  Precipitation Estimation from Remotely Sensed Imagery Using an Artificial Neural Network Cloud Classification System , 2004 .

[40]  Muni S. Srivastava,et al.  A measure of skewness and kurtosis and a graphical method for assessing multivariate normality , 1984 .

[41]  M. Rosenblatt A CENTRAL LIMIT THEOREM AND A STRONG MIXING CONDITION. , 1956, Proceedings of the National Academy of Sciences of the United States of America.

[42]  Guohe Huang,et al.  An inexact optimization modeling approach for supporting energy systems planning and air pollution mitigation in Beijing city , 2012 .

[43]  Michael Kessler,et al.  A human‐induced downward‐skewed elevational abundance distribution of pteridophytes in the Bolivian Andes , 2007 .

[44]  S. L. Jacoby A mathematical model for nonlinear hydrologic systems , 1966 .

[45]  Peter C. Young,et al.  Data-based mechanistic modelling and the rainfall-flow non-linearity. , 1994 .

[46]  M. Levent Kavvas,et al.  Nonlinear Hydrologic Processes: Conservation Equations for Determining Their Means and Probability Distributions , 2003 .

[47]  Klaus J. Miescke,et al.  Statistical Decision Theory , 2007 .

[48]  Vahid Nourani,et al.  Application of Entropy Concept for Input Selection of Wavelet-ANN Based Rainfall-Runoff Modeling , 2016 .

[49]  Vazken Andréassian,et al.  Spatial proximity, physical similarity, regression and ungaged catchments: A comparison of regionalization approaches based on 913 French catchments , 2008 .

[50]  J. Dooge THE HYDROLOGIC CYCLE AS A CLOSED SYSTEM , 1968 .

[51]  Y. C. Wang,et al.  Phosphorus Fractions and Its Summer's Release Flux From Sediment in the China's Three Gorges Reservoir , 2015 .

[52]  Jean-Marie Dufour,et al.  Simulation�?Based Finite Sample Normality Tests in Linear Regressions , 1998 .

[53]  Guohe Huang,et al.  An interval-parameter minimax regret programming approach for power management systems planning under uncertainty , 2011 .

[54]  A. Elhan,et al.  Investigation of Four Different Normality Tests in Terms of Type 1 Error Rate and Power under Different Distributions , 2006 .

[55]  H. Scheffé Practical Solutions of the Behrens-Fisher Problem , 1970 .

[56]  null null,et al.  Artificial Neural Networks in Hydrology. II: Hydrologic Applications , 2000 .

[57]  P. Royston Approximating the Shapiro-Wilk W-test for non-normality , 1992 .

[58]  Dawei Han,et al.  Uncertainty Assessment in Environmental Risk through Bayesian Networks , 2015 .

[59]  Mahdi Zarghami,et al.  The Use of Statistical Weather Generator, Hybrid Data Driven and System Dynamics Models for Water Resources Management under Climate Change , 2015 .

[60]  Guohe Huang,et al.  Planning of municipal solid waste management systems under dual uncertainties: a hybrid interval stochastic programming approach , 2008 .

[61]  Guangtao Fu,et al.  Classified real-time flood forecasting by coupling fuzzy clustering and neural network , 2010 .

[62]  V. Klemeš,et al.  Operational Testing of Hydrological Simulation Models , 2022 .