Sampling properties and empirical estimates of extreme events

Abstract The statistical characteristics of the largest observations in a sample are highly uncertain. In this work we consider the problem of how to define empirical estimates of exceedance probabilities and return periods associated with an ordered sample of observations. Understanding the sampling properties of these quantities is important for assessing the fit of a statistical model and also for placing confidence bounds on estimates of extreme events from Monte Carlo simulations. The empirical distribution function (EDF) is often defined as the expected non-exceedance probability (NEP) associated with sample order statistics. Yet, due to the non-linearity of the relations between return periods, quantiles and NEP, the return period (or quantile) associated with the expected NEP is not equal to the expected return period (or quantile), leading to ambiguity. However, the sampling distributions of exceedance probabilities, return periods and quantiles are, in fact, linked by a simple relation. From this relation, it follows that defining the EDF in terms of the median NEP of the order statistics gives a consistent framework for defining empirical estimates of all three quantities. We demonstrate that the median value of the return period of the largest observation is 44% larger than the return period calculated using the common definition of the EDF in terms of the expected NEP of the order statistics. We also derive some new results about the size of the confidence intervals for exceedance probabilities and return periods.

[1]  G. H. Yu,et al.  A distribution free plotting position , 2001 .

[2]  L. Haan,et al.  Extreme value theory : an introduction , 2006 .

[3]  L. R. Beard Statistical Analysis in Hydrology , 1943 .

[4]  H. Harter,et al.  A monte carlo of plotting positions , 1985 .

[5]  S. Kotz,et al.  Parameter estimation of the generalized Pareto distribution—Part II , 2010 .

[6]  A. Bernard,et al.  The plotting of observations on probability-paper , 1955 .

[7]  J. Teugels,et al.  Statistics of Extremes , 2004 .

[8]  Antonio Lepore An Integrated Approach to shorten wind potential assessment , 2009 .

[10]  M. De,et al.  A new unbiased plotting position formula for Gumbel distribution , 2000 .

[11]  K. Adamowski Plotting Formula for Flood Frequency , 1981 .

[12]  E. J. Gumbel,et al.  Statistics of Extremes. , 1960 .

[13]  María Dolores Estudillo-Martínez,et al.  A proposal for plotting positions in probability plots , 2014 .

[14]  J. G. Bryan,et al.  Introduction to probability and random variables , 1961 .

[15]  Jonathan R. M. Hosking,et al.  A Comparison of Unbiased and Plotting‐Position Estimators of L Moments , 1995 .

[16]  A. Lepore A Note on the Plotting Position Controversy and a New Distribution-free Formula , 2010 .

[17]  F. David,et al.  Statistical Estimates and Transformed Beta-Variables. , 1960 .

[18]  FuglemM.,et al.  Plotting positions for fitting distributions and extreme value analysis , 2013 .

[19]  Nicholas J. Cook,et al.  Rebuttal of “Problems in the extreme value analysis” , 2012 .

[20]  H. Hong,et al.  Plotting positions and approximating first two moments of order statistics for Gumbel distribution: estimating quantiles of wind speed , 2014 .

[21]  Irene A. Stegun,et al.  Handbook of Mathematical Functions. , 1966 .

[22]  Niels C. Lind,et al.  Methods of structural safety , 2006 .

[23]  G. Trenkler Continuous univariate distributions , 1994 .

[24]  H. Leon Harter,et al.  Another look at plotting positions , 1984 .

[25]  A. C. Davison Statistical models: Name Index , 2003 .

[26]  On the plotting of flood‐discharges , 1943 .

[27]  C. Cunnane Unbiased plotting positions — A review , 1978 .

[28]  Nigel W. Arnell,et al.  Unbiased plotting positions for the general extreme value distribution , 1986 .

[29]  A. Lepore Nearly Unbiased Probability Plots for Extreme Value Distributions , 2017, New Statistical Developments in Data Science.

[30]  Ying Luo,et al.  Assessing the mobility benefits of proactive optimal variable speed limit control during recurrent and non-recurrent congestion , 2015 .

[31]  Pasquale Erto,et al.  New Distribution-Free Plotting Position Through an Approximation to the Beta Median , 2013 .

[32]  Clive Anderson,et al.  Estimating Changing Extremes Using Empirical Ranking Methods , 2002 .

[33]  AbuBakr S. Bahaj,et al.  A comparison of estimators for the generalised Pareto distribution , 2011 .

[34]  Irving I. Gringorten,et al.  A plotting rule for extreme probability paper , 1963 .

[35]  A. Takemura,et al.  An objective look at obtaining the plotting positions for QQ-plots , 2014, 1409.6885.

[36]  Nico M. Temme Asymptotic inversion of the incomplete beta function , 1991 .

[37]  W. Weibull A statistical theory of the strength of materials , 1939 .

[38]  S. L. Guo A discussion on unbiased plotting positions for the general extreme value distribution , 1990 .