The term fraud refers to an intentional deception or misrepresentation made by a person or an entity, with the knowledge that the deception could result in some kinds of unauthorized benefits to that person or entity. Fraud detection, being part of the overall fraud control, should be automated as much as possible to reduce the manual steps of a screening/checking process. In the health care systems, fraud has led to significant additional expenses. Development of a cost-effective health care system requires effective ways to detect fraud. It is impossible to be certain about the legitimacy of and intention behind an application or transaction. Given the reality, the best cost effective option is to infer potential fraud from the available data using mathematical models and suitable algorithms. Among these, in recent years coclustering has emerged as a powerful data mining tool for analysis of dyadic data connecting two entities. An important data mining task pertinent to dyadic data is to obtain a clustering of each entity. In this paper application of Bayesian ideas in healthcare fraud detection will be presented. The emphasis will be on the use of Bayesian co-clustering and link analysis methodologies to identify potentially fraudulent providers and beneficiaries who have unusual group memberships. Detection of such unusual memberships will be helpful to decision makers for auditing of providers and beneficiaries.
[1]
Melissa Popkoski.
Statistical Issues in Insurance/Payor Processes
,
2012
.
[2]
Deepak Agarwal,et al.
Predictive discrete latent factor models for large scale dyadic data
,
2007,
KDD '07.
[3]
John A. Major,et al.
EFD: A Hybrid Knowledge/Statistical-Based System for the Detection of Fraud
,
2002
.
[4]
David J. Hand,et al.
Statistical fraud detection: A review
,
2002
.
[5]
P E Kalb,et al.
Health care fraud and abuse.
,
1999,
JAMA.
[6]
Michael I. Jordan,et al.
Latent Dirichlet Allocation
,
2001,
J. Mach. Learn. Res..
[7]
Dario Gregori,et al.
Quality of Electronic Medical Records
,
2012
.
[8]
G. Casella,et al.
Explaining the Gibbs Sampler
,
1992
.
[9]
George M. Church,et al.
Biclustering of Expression Data
,
2000,
ISMB.
[10]
Rasim Muzaffer Musal.
Two models to investigate Medicare fraud within unsupervised databases
,
2010,
Expert Syst. Appl..
[11]
Graham J. Williams,et al.
On-line unsupervised outlier detection using finite mixtures with discounting learning algorithms
,
2000,
KDD '00.
[12]
D. Binder.
Bayesian cluster analysis
,
1978
.
[13]
Jionghua Jin,et al.
A survey on statistical methods for health care fraud detection
,
2008,
Health care management science.