Fraud Detection Using a Multinomial Logit Model with Missing Information

Recently, Artis, Ayuso, and Guillen (2002, Journal of Risk and Insurance 69: 325-340; henceforth AAG) estimate a logit model using claims data. Some of the claims are categorized as 'honest' and other claims are known to be fraudulent. Using the approach of Hausman, Abrevaya, and Scott-Morton (1998 Journal of Econometrics 87: 239-269), AAG estimate a modified logit model allowing for the possibility that some claims classified as 'honest' might actually be fraudulent. Applying this model to data on Spanish automobile insurance claims, AGG find that 5 percent of the fraudulent claims go undetected. The purpose of this article is to estimate the model of AAG using a logit model with missing information. A constrained version of this model is used to reexamine the Spanish insurance claim data. The results indicate how to identify misclassified claims. We also show how misclassified claims can be identified using the AAG approach. We show that both approaches can be used to probabilistically identify misclassified claims.

[1]  Bronwyn H Hall,et al.  Estimation and Inference in Nonlinear Structural Models , 1974 .

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  The Effects of Insurance on the Possibilities of Fraud , 1984 .

[4]  Geert Ridder,et al.  POOLING STATES IN THE MULTINOMIAL LOGIT MODEL , 1991 .

[5]  John A. Major,et al.  EFD: A hybrid knowledge/statistical‐based system for the detection of fraud , 1992, Int. J. Intell. Syst..

[6]  J. Poterba,et al.  Unemployment Benefits and Labor Market Transitions: A Multinomial Logit Model with Errors in Classification , 1995 .

[7]  P. Picard,et al.  Auditing claims in the insurance market with fraud: The credibility issue , 1996 .

[8]  L. Magder,et al.  Logistic regression when the outcome is measured with uncertainty. , 1997, American journal of epidemiology.

[9]  C. Bollinger,et al.  Modeling Discrete Choice with Response Error: Food Stamp Participation , 1997 .

[10]  J. Hausman,et al.  Misclassification of the dependent variable in a discrete-response setting , 1998 .

[11]  P. Brockett,et al.  Using Kohonen's Self-Organizing Feature Map to Uncover Automobile Bodily Injury Claims Fraud , 1998 .

[12]  Mercedes Ayuso,et al.  Modelling different types of automobile insurance fraud behaviour in the Spanish market , 1999 .

[13]  Guido Dedene,et al.  A Comparison of State-of-The-Art Classification Techniques for Expert Automobile Insurance Claim Fraud Detection , 2002 .

[14]  G. Dionne,et al.  Deductible Contracts Against Fraudulent Claims: Evidence from Automobile Insurance , 2001, Review of Economics and Statistics.

[15]  Patrick L. Brockett,et al.  Fraud Classification Using Principal Component Analysis of Ridits , 2002 .

[16]  R. Derrig Insurance Fraud , 1996 .

[17]  Mercedes Ayuso,et al.  Detection of Automobile Insurance Fraud with Discrete Choice Models and Misclassified Claims , 2002 .

[18]  Montserrat Guillen,et al.  Selection Bias and Auditing Policies for Insurance Claims , 2007 .