A log‐sinh transformation for data normalization and variance stabilization

[1] When quantifying model prediction uncertainty, it is statistically convenient to represent model errors that are normally distributed with a constant variance. The Box-Cox transformation is the most widely used technique to normalize data and stabilize variance, but it is not without limitations. In this paper, a log-sinh transformation is derived based on a pattern of errors commonly seen in hydrological model predictions. It is suited to applications where prediction variables are positively skewed and the spread of errors is seen to first increase rapidly, then slowly, and eventually approach a constant as the prediction variable becomes greater. The log-sinh transformation is applied in two case studies, and the results are compared with one- and two-parameter Box-Cox transformations.

[1]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[2]  C. Perrin,et al.  Improvement of a parsimonious model for streamflow simulation , 2003 .

[3]  B. Bates,et al.  A Markov Chain Monte Carlo Scheme for parameter estimation and inference in conceptual rainfall‐runoff modeling , 2001 .

[4]  R. Sakia The Box-Cox transformation technique: a review , 1992 .

[5]  Benjamin Renard,et al.  Evaluation of statistical models for forecast errors from the HBV model , 2010 .

[6]  A. Brath,et al.  A stochastic approach for assessing the uncertainty of rainfall‐runoff simulations , 2004 .

[7]  R. L. Winkler,et al.  Scoring Rules for Continuous Probability Distributions , 1976 .

[8]  P. Reichert,et al.  Hydrological modelling of the Chaohe Basin in China: Statistical model formulation and Bayesian inference , 2007 .

[9]  Martin Vingron,et al.  Variance stabilization applied to microarray data calibration and to the quantification of differential expression , 2002, ISMB.

[10]  George Kuczera,et al.  Critical evaluation of parameter consistency and predictive uncertainty in hydrological modeling: A case study using Bayesian total error analysis , 2009 .

[11]  G. Kuczera Improved parameter inference in catchment models: 1. Evaluating parameter uncertainty , 1983 .

[12]  Quan J. Wang,et al.  Quantifying parameter uncertainty in stochastic models using the Box–Cox transformation , 2002 .

[13]  Q. J. Wang,et al.  Multisite probabilistic forecasting of seasonal flows for streams with zero value occurrences , 2011 .

[14]  Thomas C. Pagano,et al.  Monthly versus daily water balance models in simulating monthly runoff , 2011 .

[15]  J. Vrugt,et al.  A formal likelihood function for parameter and predictive inference of hydrologic models with correlated, heteroscedastic, and non‐Gaussian errors , 2010 .

[16]  Quan J. Wang,et al.  A Bayesian joint probability modeling approach for seasonal forecasting of streamflows at multiple sites , 2009 .