Health Insurance Anomaly Detection Based on Dynamic Heterogeneous Information Network

With the development of health insurance, the consequent problem of health insurance fraud has become increasingly serious. Outlier detection is a common method for health insurance anomaly detection. Researchers use some prior knowledge to assume patterns and indicators of interest. However, fraud pattern is concealed and changeable. The method of mining anomalies from fixed patterns is difficult to meet the current needs. In order to overcome this limitation, this paper combines the rich expression ability of heterogeneous information networks to model the complex relationships between entities, and establish a health insurance business representation model. The paper explores all possible business patterns, interrelated business portfolio patterns and related indicators in the field of health insurance. Considering the dynamic nature of a network, anomaly mining is carried out from both horizontal and vertical perspectives. Among them, the horizontal comparison adopts a fixed time period. The vertical comparison dynamically adjusts the time period according to the frequency of occurrence of the health insurance pattern instance, and then performs indicator calculation and outlier detection. Finally, the experimental results on the real data set show that our approach can narrow the scope of professional review and find more records of possible fraud than traditional methods.

[1]  Marina E. Johnson,et al.  Multi-stage methodology to detect health insurance claim fraud , 2016, Health care management science.

[2]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[3]  Jiawei Han,et al.  AspEm: Embedding Learning by Aspects in Heterogeneous Information Networks , 2018, SDM.

[4]  Hyun Ah Song,et al.  FRAUDAR: Bounding Graph Fraud in the Face of Camouflage , 2016, KDD.

[5]  Jos van Hillegersberg,et al.  Outlier-based Health Insurance Fraud Detection for U.S. Medicaid Data , 2014, ICEIS.

[6]  Philip S. Yu,et al.  HeteSim: A General Framework for Relevance Measure in Heterogeneous Networks , 2013, IEEE Transactions on Knowledge and Data Engineering.

[7]  Christos Faloutsos,et al.  M-Zoom: Fast Dense-Block Detection in Tensors with Quality Guarantees , 2016, ECML/PKDD.

[8]  Melih Kirlidog,et al.  A Fraud Detection Approach with Data Mining in Health Insurance , 2012 .

[9]  Christos Faloutsos,et al.  D-Cube: Dense-Block Detection in Terabyte-Scale Tensors , 2017, WSDM.

[10]  Xiang Li,et al.  Meta Structure: Computing Relevance in Large Heterogeneous Information Networks , 2016, KDD.

[11]  Yizhou Sun,et al.  RankClus: integrating clustering with ranking for heterogeneous information network analysis , 2009, EDBT '09.

[12]  Yizhou Sun,et al.  Mining Heterogeneous Information Networks: Principles and Methodologies , 2012, Mining Heterogeneous Information Networks: Principles and Methodologies.

[13]  Christos Faloutsos,et al.  CatchSync: catching synchronized behavior in large directed graphs , 2014, KDD.

[14]  Philip S. Yu,et al.  Semantic Path based Personalized Recommendation on Weighted Heterogeneous Information Networks , 2015, CIKM.

[15]  Jiawei Han,et al.  Easing Embedding Learning by Comprehensive Transcription of Heterogeneous Information Networks , 2018, KDD.

[16]  Dik Lun Lee,et al.  Meta-Graph Based Recommendation Fusion over Heterogeneous Information Networks , 2017, KDD.

[17]  Philip S. Yu,et al.  PathSim , 2011, Proc. VLDB Endow..