A time-efficient model for detecting fraudulent health insurance claims using Artificial neural networks

Health insurance has come in rescue for people, in reducing their medical expenditure, which otherwise would have taken a high toll on their income. There are both private and government-funded agencies serving in the health insurance sector. With soaring high demand among the public, healthcare is not safe from the fraudsters. The usage of computerized techniques has proved this area even more vulnerable. It has become highly essential to detect this fraud at the earliest, such that the impact of loss could be minimized. This paper throws light on a framework in detecting fraud with faster learning and identifying the maximum number of fraud instances. The usual problems, like data heterogeneity and imbalanced classification of classes, have also been discussed in this paper. As a part of developing an efficient framework for fraud detection, we applied several learners and optimization techniques. The framework has evaluated with claims dataset obtained from the CMS Medicare facility. We finally reached to a conclusion that the application of Multi-Layer Perceptron, a feed-forward Neural Network with genetic algorithm optimization had helped in enhancing the results and gain higher accuracy. PCA was also applied to pick the most significant variables. The use of PCA and other appropriate pre-processing techniques has also helped us in reducing the training time, thereby achieving efficiency in terms of accuracy and speed.

[1]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[2]  AlahakoonDamminda,et al.  Minority report in fraud detection , 2004 .

[3]  Taghi M. Khoshgoftaar,et al.  A Novel Method for Fraudulent Medicare Claims Detection from Expected Payment Deviations (Application Paper) , 2016, 2016 IEEE 17th International Conference on Information Reuse and Integration (IRI).

[4]  Eşref Adalı,et al.  Multilayer perceptron neural network technique for fraud detection , 2017, 2017 International Conference on Computer Science and Engineering (UBMK).

[5]  Shiguo Wang,et al.  A Comprehensive Survey of Data Mining-Based Accounting-Fraud Detection Research , 2010, 2010 International Conference on Intelligent Computation Technology and Automation.

[6]  Rashedur M. Rahman,et al.  Improving accuracy of students’ final grade prediction model using optimal equal width binning and synthetic minority over-sampling technique , 2015, Decis. Anal..

[7]  Taghi M. Khoshgoftaar,et al.  Medicare Fraud Detection Using Machine Learning Methods , 2017, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).

[8]  Taghi M. Khoshgoftaar,et al.  Medicare Fraud Detection Using Random Forest with Class Imbalanced Big Data , 2018, 2018 IEEE International Conference on Information Reuse and Integration (IRI).

[9]  Taghi M. Khoshgoftaar,et al.  Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims , 2016, 2016 IEEE 28th International Conference on Tools with Artificial Intelligence (ICTAI).

[10]  Robin Aly,et al.  Categorizing and Describing the Types of Fraud in Healthcare , 2015 .

[11]  Taghi M. Khoshgoftaar,et al.  Big Data fraud detection using multiple medicare data sources , 2018, J. Big Data.

[12]  Taghi M. Khoshgoftaar,et al.  Multivariate outlier detection in medicare claims payments applying probabilistic programming methods , 2017, Health Services and Outcomes Research Methodology.

[13]  Absalom E. Ezugwu,et al.  Ant colony optimization edge selection for support vector machine speed optimization , 2019, Neural Computing and Applications.

[14]  Dean Zhao,et al.  A novel optimized GA–Elman neural network algorithm , 2017, Neural Computing and Applications.

[15]  Chengqi Zhang,et al.  Cost-Sensitive Parallel Learning Framework for Insurance Intelligence Operation , 2019, IEEE Transactions on Industrial Electronics.

[16]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[17]  Damminda Alahakoon,et al.  Minority report in fraud detection: classification of skewed data , 2004, SKDD.

[18]  Bouabid El Ouahidi,et al.  Using Genetic Algorithm to Improve Classification of Imbalanced Datasets for Credit Card Fraud Detection , 2018, 2018 2nd Cyber Security in Networking Conference (CSNet).

[19]  Taghi M. Khoshgoftaar,et al.  A survey on addressing high-class imbalance in big data , 2018, Journal of Big Data.

[20]  Anthony Brabazon,et al.  Identifying online credit card fraud using Artificial Immune Systems , 2010, IEEE Congress on Evolutionary Computation.

[21]  Shamik Sural,et al.  Credit card fraud detection: A fusion approach using Dempster-Shafer theory and Bayesian learning , 2009, Inf. Fusion.

[22]  Chang-Tien Lu,et al.  Survey of fraud detection techniques , 2004, IEEE International Conference on Networking, Sensing and Control, 2004.

[23]  Anazida Zainal,et al.  Fraud detection system: A survey , 2016, J. Netw. Comput. Appl..

[24]  Dustin van der Haar,et al.  A Comparison of Machine Learning Methods Applicable to Healthcare Claims Fraud Detection , 2019, ICITS.

[25]  Alair Pereira do Lago,et al.  Credit Card Fraud Detection with Artificial Immune System , 2008, ICARIS.

[26]  Mohammad Kazem Akbari,et al.  A novel model for credit card fraud detection using Artificial Immune Systems , 2014, Appl. Soft Comput..

[27]  William J Rudman,et al.  Healthcare fraud and abuse. , 2009, Perspectives in health information management.

[28]  Raju S. Bapi,et al.  An Unbalanced Data Classification Model Using Hybrid Sampling Technique for Fraud Detection , 2007, PReMI.

[29]  Vadlamani Ravi,et al.  A novel hybrid undersampling method for mining unbalanced datasets in banking and insurance , 2015, Eng. Appl. Artif. Intell..