Modeling actuarial data with a composite lognormal-Pareto model

The actuarial and insurance industries frequently use the lognormal and the Pareto distributions to model their payments data. These types of payment data are typically very highly positively skewed. Pareto model with a longer and thicker upper tail is used to model the larger loss data, while the larger data with lower frequencies as well as smaller data with higher frequencies are usually modeled by the lognormal distribution. Even though the lognormal model covers larger data with lower frequencies, it fades away to zero more quickly than the Pareto model. Furthermore, the Pareto model does not provide a reasonable parametric fit for smaller data due to its monotonic decreasing shape of the density. Therefore, taking into account the tail behavior of both small and large losses, we were motivated to look for a new avenue to remedy the situation. Here we introduce a two-parameter smooth continuous composite lognormal-Pareto model that is a two-parameter lognormal density up to an unknown threshold value and a two-parameter Pareto density for the remainder. The resulting two-parameter smooth density is similar in shape to the lognormal density, yet its upper tail is larger than the lognormal density and the tail behavior is quite similar to the Pareto density. Parameter estimation techniques and properties of this new composite lognormal-Pareto model are discussed and we compare its performance with the other commonly used models. A simulated example and a well-known fire insurance data set are analyzed to show the importance and applicability of this newly proposed composite lognormal-Pareto model.

[1]  Michel Denuit,et al.  Generalized Pareto Fit to the Society of Actuaries’ Large Claims Database , 2003 .

[2]  A. McNeil Estimating the Tails of Loss Severity Distributions Using Extreme Value Theory , 1997, ASTIN Bulletin.

[3]  J. Teugels,et al.  Practical Analysis of Extreme Values , 1996 .

[4]  P. Berg,et al.  Hogg R. V. and Klugman S. A. (1984). Loss Distributions, xii + 235, £28.45. Chichester: John Wiley & Sons Limited , 1984 .

[5]  Johan Segers,et al.  “Generalized Pareto Fit to the Society of Actuaries’ Large Claims Database,” Ana C. Cebrián, Michel Denuit, and Philippe Lambert, July 2003 , 2004 .

[6]  Sidney I. Resnick,et al.  Discussion of the Danish Data on Large Fire Insurance Losses , 1997, ASTIN Bulletin.

[7]  Henrik Ramlau-Hansen A solvency study in non-life insurance: Part 1. Analyses of fire, windstorm, and glass claims , 1988 .

[8]  J. H. Pollard,et al.  Introductory statistics with applications in general insurance , 1999 .

[9]  Henrik Ramlau-Hansen A solvency study in non-life insurance , 1988 .

[10]  Stuart A. Klugman,et al.  Loss Models: From Data to Decisions , 1998 .

[11]  J. Corcoran Modelling Extremal Events for Insurance and Finance , 2002 .

[12]  Lena Jaeger,et al.  Loss Models: From Data to Decisions , 2006 .

[13]  Howard R. Waters,et al.  Loss Models: from Data to Decisions. By Stuart Klugman, Harry Panjer and Gordon Willmot [John Wiley & Sons, New York, 1998] , 1999 .

[14]  Johan Segers,et al.  Discussion of "Generalized Pareto fit to the Society of Actuaries large claims database" , 2004 .