Analyzing Large Workers’ Compensation Claims Using Generalized Linear Models and Monte Carlo Simulation

Insurance practitioners rely on statistical models to predict future claims in order to provide financial protection. Proper predictive statistical modeling is more challenging when analyzing claims with lower frequency, but high costs. The paper investigated the use of predictive generalized linear models (GLMs) to address this challenge. Workers’ compensation claims with costs equal to or more than US$100,000 were analyzed in agribusiness industries in the Midwest of the USA from 2008 to 2016. Predictive GLMs were built with gamma, Weibull, and lognormal distributions using the lasso penalization method. Monte Carlo simulation models were developed to check the performance of predictive models in cost estimation. The results show that the GLM with gamma distribution has the highest predictivity power (R2 = 0.79). Injury characteristics and worker’s occupation were predictive of large claims’ occurrence and costs. The conclusions of this study are useful in modifying and estimating insurance pricing within high-risk agribusiness industries. The approach of this study can be used as a framework to forecast workers’ compensation claims amounts with rare, high-cost events in other industries. This work is useful for insurance practitioners concerned with statistical and predictive modeling in financial risk analysis.

[1]  Kumer Pial Das,et al.  Understanding extreme stock trading volume by generalized Pareto distribution , 2016 .

[2]  H. Zou The Adaptive Lasso and Its Oracle Properties , 2006 .

[3]  Peng Shi,et al.  Actuarial Applications of a Hierarchical Insurance Claims Model , 2009, ASTIN Bulletin.

[4]  Roberto Luzzi,et al.  Large Occupational Accidents Data Analysis with a Coupled Unsupervised Algorithm: The S.O.M. K-Means Method. An Application to the Wood Industry , 2018, Safety.

[5]  Mahdi Hasanipanah,et al.  Risk Assessment and Prediction of Flyrock Distance by Combined Multiple Regression Analysis and Monte Carlo Simulation of Quarry Blasting , 2016, Rock Mechanics and Rock Engineering.

[6]  Adel Badri,et al.  Development of a Preliminary Model for Evaluating Occupational Health and Safety Risk Management Maturity in Small and Medium-Sized Enterprises , 2018 .

[7]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[8]  S. Haneuse,et al.  On the Assessment of Monte Carlo Error in Simulation-Based Statistical Analyses , 2009, The American statistician.

[9]  David R. Anderson,et al.  Multimodel Inference , 2004 .

[10]  Qihe Tang Heavy Tails of Discounted Aggregate Claims in the Continuous-Time Renewal Model , 2007, Journal of Applied Probability.

[11]  CALCULATION OF THE CAPITAL REQUIREMENT USING THE MONTE CARLO SIMULATION FOR NON-LIFE INSURANCE , 2016 .

[12]  Yuvin Chinniah,et al.  Design of a Self-Audit Tool for the Application of Lockout on Machinery in the Province of Quebec, Canada to Control Hazardous Energies , 2019, Safety.

[13]  U. Epa,et al.  Guiding Principles for Monte Carlo Analysis , 1997 .

[14]  Trevor Hastie,et al.  Linear Model Selection and Regularization , 2021, Springer Texts in Statistics.

[15]  Ferry Butar Butar,et al.  An Insight Into Heavy-Tailed Distribution , 2010 .

[16]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[17]  Peng Shi,et al.  Long-tail longitudinal modeling of insurance company expenses , 2010 .

[18]  Rodrigo S. Targino,et al.  Bayesian Modelling, Monte Carlo Sampling and Capital Allocation of Insurance Risks , 2017 .

[19]  Søren Asmussen,et al.  Conditional Monte Carlo for sums, with applications to insurance and finance , 2018, Annals of Actuarial Science.

[20]  Ishita Sengupta,et al.  Workers' Compensation: Benefits, Coverage, and Costs, 2007 , 2009 .

[21]  Steven Haberman,et al.  Generalized linear models and actuarial science , 1996 .

[22]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[23]  Leo Guelman,et al.  Gradient boosting trees for auto insurance loss cost modeling and prediction , 2012, Expert Syst. Appl..

[24]  Laurel J. Fish,et al.  Statistical Analysis Of A Class: Monte Carlo And Multiple Imputation Spreadsheet Methods For Estimation And Extrapolation , 2017 .

[25]  David R. Anderson,et al.  Bayesian Methods in Cosmology: Model selection and multi-model inference , 2009 .

[26]  V. Packová,et al.  Loss Distributions in Insurance Risk Management , 2015 .

[27]  D. C. Nath,et al.  Modeling of Insurance Data through Two Heavy Tailed Distributions: Computations of Some of Their Actuarial Quantities through Simulation from Their Equilibrium Distributions and the Use of Their Convolutions , 2016 .

[28]  Clive L. Keatinge MODELING LOSSES WITH THE MIXED EXPONENTIAL DISTRIBUTION , 2000 .

[29]  F. Lindskog,et al.  Insurance valuation : A computable multi-period cost-of-capital approach , 2016, 1607.04100.

[30]  Lukas Josef Hahn Multi-Year Non-Life Insurance Risk of Dependent Lines of Business in the Multivariate Additive Loss Reserving Model , 2017 .

[31]  GLM in Life Insurance 1 GENERALIZED LINEAR MODELS IN LIFE INSURANCE : DECREMENTS AND RISK FACTOR ANALYSIS UNDER SOLVENCY II , 2008 .

[32]  Edward W. Frees,et al.  Predictive Modeling Applications in Actuarial Science , 2014 .

[33]  W. L. Dunn,et al.  Monte Carlo methods for design and analysis of radiation detectors , 2009 .

[34]  S. Mingoti,et al.  Clustering Algorithms for Categorical Data: A Monte Carlo Study , 2012 .

[35]  Michelle Xia Bayesian Adjustment for Insurance Misrepresentation in Heavy-Tailed Loss Regression , 2018 .

[36]  Scott D. Szymendera Workers’ Compensation: Overview and Issues , 2017 .

[37]  A. Atherly,et al.  Health risk factors as predictors of workers' compensation claim occurrence and cost , 2016, Occupational and Environmental Medicine.