Streamflow estimation at partially gaged sites using multiple-dependence conditions via vine copulas

Abstract. Reliable estimates of missing streamflow values are relevant for water resource planning and management. This study proposes a multiple-dependence condition model via vine copulas for the purpose of estimating streamflow at partially gaged sites. The proposed model is attractive in modeling the high-dimensional joint distribution by building a hierarchy of conditional bivariate copulas when provided a complex streamflow gage network. The usefulness of the proposed model is firstly highlighted using a synthetic streamflow scenario. In this analysis, the bivariate copula model and a variant of the vine copulas are also employed to show the ability of the multiple-dependence structure adopted in the proposed model. Furthermore, the evaluations are extended to a case study of 54 gages located within the Yadkin–Pee Dee River basin in the eastern USA. Both results inform that the proposed model is better suited for infilling missing values. To be specific, the proposed multiple-dependence model shows the improvement of 9.2 % on average compared to the bivariate model from the historical case study. The performance of the vine copula is further compared with six other infilling approaches to confirm its applicability. Results demonstrate that the proposed model produces more reliable streamflow estimates than the other approaches. In particular, when applied to partially gaged sites with sufficient available data, the proposed model clearly outperforms the other models. Even though the model is illustrated by a specific case, it can be extended to other regions with diverse hydro-climatological variables for the objective of infilling.

[1]  S. F. Railsback,et al.  Comparison of regression and time-series methods for synthesizing missing streamflow records , 1989 .

[2]  Duc Khuong Nguyen,et al.  Global financial crisis and dependence risk analysis of sector portfolios: a vine copula approach , 2017 .

[3]  Y. Dinpashoh,et al.  Modeling flood event characteristics using D-vine structures , 2017, Theoretical and Applied Climatology.

[4]  A. Daneshkhah,et al.  Probabilistic modeling of flood characterizations with parametric and minimum information pair-copula model , 2016 .

[5]  Giovanni Ravazzani,et al.  Regionalization of Flow-Duration Curves through Catchment Classification with Streamflow Signatures and Physiographic–Climate Indices , 2016 .

[6]  Richard M. Vogel,et al.  On the deterministic and stochastic use of hydrologic models , 2016 .

[7]  Rochus Niemierko,et al.  A D-vine copula quantile regression approach for the prediction of residential heating energy consumption based on historical data , 2019, Applied Energy.

[8]  Vijay P. Singh,et al.  Modeling multisite streamflow dependence with maximum entropy copula , 2013 .

[9]  P. Embrechts,et al.  Dependence modeling with copulas , 2007 .

[10]  Claudia Czado,et al.  Pair-Copula Constructions of Multivariate Copulas , 2010 .

[11]  Abdul Aziz Jemain,et al.  IDF relationships using bivariate copula for storm events in Peninsular Malaysia , 2012 .

[12]  Ximing Cai,et al.  Prediction of regional streamflow frequency using model tree ensembles , 2014 .

[13]  Gianfausto Salvadori,et al.  Frequency analysis via copulas: Theoretical aspects and applications to hydrological events , 2004 .

[14]  Richard M. Vogel,et al.  On the probability distribution of daily streamflow in the United States , 2017 .

[15]  Kuk-Hyun Ahn,et al.  Use of a nonstationary copula to predict future bivariate low flow frequency in the Connecticut river basin , 2016 .

[16]  Christian Genest,et al.  Beyond simplified pair-copula constructions , 2012, J. Multivar. Anal..

[17]  Ozgur Kisi,et al.  A wavelet-support vector machine conjunction model for monthly streamflow forecasting , 2011 .

[18]  Peder Hjorth,et al.  Imputation of missing values in a precipitation–runoff process database , 2009 .

[19]  Hoshin Vijai Gupta,et al.  Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling , 2009 .

[20]  S. Steinschneider,et al.  Hierarchical Bayesian Model for Streamflow Estimation at Ungauged Sites via Spatial Scaling in the Great Lakes Basin , 2019, Journal of Water Resources Planning and Management.

[21]  S. Simonovic,et al.  Bivariate flood frequency analysis. Part 2: a copula‐based approach with mixed marginal distributions , 2009 .

[22]  Ton H. Snelder,et al.  Comparing methods for estimating flow duration curves at ungauged sites , 2012 .

[23]  Claudia Czado,et al.  D-vine copula based quantile regression , 2015, Comput. Stat. Data Anal..

[24]  Claudia Czado,et al.  R‐vine models for spatial time series with an application to daily mean temperature , 2014, Biometrics.

[25]  Zhiyong Liu,et al.  A multivariate conditional model for streamflow prediction and spatial precipitation refinement , 2015 .

[26]  Fateh Chebana,et al.  Multivariate missing data in hydrology – Review and applications , 2017 .

[27]  David M. Zimmer Analyzing Comovements in Housing Prices Using Vine Copulas , 2015 .

[28]  Attilio Castellarin,et al.  Geostatistical prediction of flow–duration curves in an index-flow framework , 2014 .

[29]  W. Asquith,et al.  Copula Theory as a Generalized Framework for Flow‐Duration Curve Based Streamflow Estimates in Ungaged and Partially Gaged Catchments , 2019, Water Resources Research.

[30]  Eike Christian Brechmann,et al.  Conditional copula simulation for systemic risk stress testing , 2013 .

[31]  Shuo Wang,et al.  Short-term power load probability density forecasting method using kernel-based support vector quantile regression and Copula theory , 2017 .

[32]  Qing Xu,et al.  Evaluating forecast performances of the quantile autoregression models in the present global crisis in international equity markets , 2013 .

[33]  Gery Geenens,et al.  Probit Transformation for Kernel Density Estimation on the Unit Interval , 2013, 1303.4121.

[34]  M. Bhatti,et al.  Recent development in copula and its applications to the energy, forestry and environmental sciences , 2019, International Journal of Hydrogen Energy.

[35]  S. Vicente‐Serrano,et al.  Gap Filling of Monthly Temperature Data and Its Effect on Climatic Variability and Trends , 2019, Journal of Climate.

[36]  A. Frigessi,et al.  Pair-copula constructions of multiple dependence , 2009 .

[37]  Guangtao Fu,et al.  Copula-based frequency analysis of overflow and flooding in urban drainage systems , 2014 .

[38]  Claudia Czado,et al.  Analyzing Dependent Data with Vine Copulas , 2019, Lecture Notes in Statistics.

[39]  Attilio Castellarin,et al.  Regional flow-duration curves: reliability for ungauged basins , 2004 .

[40]  A. Bárdossy,et al.  Interpolation of precipitation under topographic influence at different time scales , 2013 .

[41]  Friedrich Schmid,et al.  Multivariate conditional versions of Spearman's rho and related measures of tail dependence , 2007 .

[42]  Vladimir U. Smakhtin,et al.  Daily flow time series patching or extension: a spatial interpolation approach based on flow duration curves , 1996 .

[43]  Thibault Vatter,et al.  Generalized additive models for conditional dependence structures , 2015, J. Multivar. Anal..

[44]  Claudia Czado,et al.  Simplified pair copula constructions - Limitations and extensions , 2013, J. Multivar. Anal..

[45]  N. Verhoest,et al.  A continuous rainfall model based on vine copulas , 2015 .

[46]  T. Ouarda,et al.  Regional flood-duration frequency modeling in the changing environment , 2006 .

[47]  Roger M. Cooke,et al.  Probability Density Decomposition for Conditionally Dependent Random Variables Modeled by Vines , 2001, Annals of Mathematics and Artificial Intelligence.

[48]  Wang Lu A high-dimensional vine copula approach to comovement of China's financial markets , 2013, 2013 International Conference on Management Science and Engineering 20th Annual Conference Proceedings.

[49]  Kuk-Hyun Ahn,et al.  Regional flood frequency analysis using spatial proximity and basin characteristics: Quantile regression vs. parameter regression technique , 2016 .

[50]  G. Mendicino,et al.  Evaluation of parametric and statistical approaches for the regionalization of flow duration curves in intermittent regimes , 2013 .

[51]  Rui Kang,et al.  Multivariate Degradation Modeling of Smart Electricity Meter with Multiple Performance Characteristics via Vine Copulas , 2017, Qual. Reliab. Eng. Int..

[52]  Claudia Czado,et al.  Selecting and estimating regular vine copulae and application to financial returns , 2012, Comput. Stat. Data Anal..

[53]  Soroosh Sorooshian,et al.  Toward improved calibration of hydrologic models: Combining the strengths of manual and automatic methods , 2000 .

[54]  Lu Zhang,et al.  A new regionalization approach and its application to predict flow duration curve in ungauged basins , 2010 .

[55]  V. Singh,et al.  Copula-based method for multisite monthly and daily streamflow simulation , 2014 .

[56]  T. Bedford,et al.  Vines: A new graphical model for dependent random variables , 2002 .

[57]  Ataur Rahman,et al.  Regional flood frequency analysis in arid regions : a case study for Australia , 2012 .