Mining Anomalies in Medicare Big Data Using Patient Rule Induction Method

The public health infrastructure delivers proper health care services as part of the basic needs of the general population. The health care system in the United States is rapidly changing in order to provide a better and convenient healthcare system to the public. Unfortunately, this comprehensive expand has also given rise to healthcare frauds in recent years where losses surge up to $1.8 billion in the country. Organizations such as the Center for Medicare Services (CMS) have started providing accesses to comprehensive medical big data to promote the identification of healthcare frauds as an important research topic. In this paper, we will use the Patient Rule Induction Method (PRIM) based bump hunting method to identify the spaces of higher modes and masses to indicate the peak anomalies in the CMS 2014 dataset. By applying our framework, we can find a way to observe anomalies, which can be attributed to frauds in legal medical practices or other interesting insights in the CMS dataset. This will enable us to characterize the attribute space and explain the events incurring losses to the medicare/medicaid program. The proposed framework is compared with several methods to illustrate the efficiency and effectiveness of the proposed framework for fraud detection.

[1]  Nicholas I. Fisher,et al.  Bump hunting in high-dimensional data , 1999, Stat. Comput..

[2]  Rangasami L. Kashyap,et al.  Augmented transition networks as video browsing models for multimedia databases and multimedia information systems , 1999, Proceedings 11th International Conference on Tools with Artificial Intelligence.

[3]  Shu-Ching Chen,et al.  Video Semantic Concept Discovery using Multimodal-Based Association Classification , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[4]  Min Chen,et al.  Video Semantic Event/Concept Detection Using a Subspace-Based Multimedia Data Mining Framework , 2008, IEEE Transactions on Multimedia.

[5]  Rangasami L. Kashyap,et al.  Semantic Models for Multimedia Database Searching and Browsing , 2000, Advances in Database Systems.

[6]  Chengcui Zhang,et al.  An intelligent framework for spatio-temporal vehicle tracking , 2001, ITSC 2001. 2001 IEEE Intelligent Transportation Systems. Proceedings (Cat. No.01TH8585).

[7]  Chao Chen,et al.  Web media semantic concept retrieval via tag removal and model fusion , 2013, ACM Trans. Intell. Syst. Technol..

[8]  Shu-Ching Chen,et al.  Network intrusion detection through Adaptive Sub-Eigenspace Modeling in multiagent systems , 2007, ACM Trans. Auton. Adapt. Syst..

[9]  Min Chen,et al.  A multimodal data mining framework for soccer goal detection based on decision tree logic , 2006, Int. J. Comput. Appl. Technol..

[10]  Xiuqi Li,et al.  Image Retrieval By Color , Texture , And Spatial Information , 2002 .

[11]  Choochart Haruechaiyasak,et al.  Collaborative Filtering by Mining Association Rules from User Access Sequences , 2005, International Workshop on Challenges in Web Information Retrieval and Integration.

[12]  Mei-Ling Shyu,et al.  Leveraging Concept Association Network for Multimedia Rare Concept Mining and Retrieval , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[13]  Choochart Haruechaiyasak,et al.  Category cluster discovery from distributed WWW directories , 2003, Inf. Sci..

[14]  Shu-Ching Chen,et al.  Video semantic concept detection via associative classification , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[15]  Chengcui Zhang,et al.  Innovative Shot Boundary Detection for Video Indexing , 2005 .

[16]  Misop Han,et al.  Variability in Medicare utilization and payment among urologists. , 2015, Urology.

[17]  Mei-Ling Shyu,et al.  Handling nominal features in anomaly intrusion detection problems , 2005, 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications (RIDE-SDMA'05).

[18]  Si Wu,et al.  Improving support vector machine classifiers by modifying kernel functions , 1999, Neural Networks.

[19]  Varun Chandola,et al.  Knowledge discovery from massive healthcare claims data , 2013, KDD.

[20]  Xin Huang,et al.  User Concept Pattern Discovery Using Relevance Feedback And Multiple Instance Learning For Content-Based Image Retrieval , 2002, MDM/KDD.

[21]  Chengcui Zhang,et al.  Learning-based spatio-temporal vehicle tracking and indexing for transportation multimedia database systems , 2003, IEEE Trans. Intell. Transp. Syst..

[22]  Mark R. Segal,et al.  Machine Learning Benchmarks and Random Forest Regression , 2004 .

[23]  Mei-Ling Shyu,et al.  Effective Feature Space Reduction with Imbalanced Data for Semantic Concept Detection , 2008, 2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008).

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[26]  Xiuqi Li,et al.  Web document classification based on fuzzy association , 2002, Proceedings 26th Annual International Computer Software and Applications.

[27]  Rangasami L. Kashyap,et al.  Temporal And Spatial Semantic Models For Multimedia Presentations , 1997 .

[28]  Jean-Eudes Dazard,et al.  Local Sparse Bump Hunting , 2010, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[29]  Shu-Ching Chen,et al.  Effective supervised discretization for classification based on correlation maximization , 2011, 2011 IEEE International Conference on Information Reuse & Integration.

[30]  Rangasami L. Kashyap,et al.  Augmented Transition Network as a Semantic Model for Video Data , 2001 .

[31]  Nitesh V. Chawla,et al.  Does Medical School Training Relate to Practice? Evidence from Big Data , 2015, Big Data.

[32]  Stuart Harvey Rubin,et al.  A Human-Centered Multiple Instance Learning Framework for Semantic Video Retrieval , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[33]  Joseph E. Beck,et al.  Naive Bayes Classifiers for User Modeling , 1999 .

[34]  Min Chen,et al.  A latent semantic indexing based method for solving multiple instance learning problem in region-based image retrieval , 2005, Seventh IEEE International Symposium on Multimedia (ISM'05).

[35]  Mei-Ling Shyu,et al.  Weighted Association Rule Mining for Video Semantic Detection , 2010, Int. J. Multim. Data Eng. Manag..

[36]  Rangasami L. Kashyap,et al.  Identifying Overlapped Objects for Video Indexing and Modeling in Multimedia Database Systems , 2001, Int. J. Artif. Intell. Tools.

[37]  Lewis Morris,et al.  Combating fraud in health care: an essential component of any cost containment strategy. , 2009, Health affairs.

[38]  Rangasami L. Kashyap,et al.  Generalized Affinity-Based Association Rule Mining for Multimedia Database Queries , 2001, Knowledge and Information Systems.