MeTA

Physicians and health care organizations always collect large amounts of data during patient care. These large and high-dimensional datasets are usually characterized by an inherent sparseness. Hence, analyzing these datasets to figure out interesting and hidden knowledge is a challenging task. This article proposes a new data mining framework based on generalized association rules to discover multiple-level correlations among patient data. Specifically, correlations among prescribed examinations, drugs, and patient profiles are discovered and analyzed at different abstraction levels. The rule extraction process is driven by a taxonomy to generalize examinations and drugs into their corresponding categories. To ease the manual inspection of the result, a worthwhile subset of rules (i.e., nonredundant generalized rules) is considered. Furthermore, rules are classified according to the involved data features (medical treatments or patient profiles) and then explored in a top-down fashion: from the small subset of high-level rules, a drill-down is performed to target more specific rules. The experiments, performed on a real diabetic patient dataset, demonstrate the effectiveness of the proposed approach in discovering interesting rule groups at different abstraction levels.

[1]  Mobyen Uddin Ahmed,et al.  Mining rare cases in post-operative pain by means of outlier detection , 2011, 2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[2]  Asha Gowda Karegowda,et al.  Cascading K-means Clustering and K-Nearest Neighbor Classifier for Categorization of Diabetic Patients , 2012 .

[3]  C M Ashton,et al.  International Classification of Diseases, 9th Revision, Clinical Modification codes in discharge abstracts are poor measures of complication occurrence in medical inpatients. , 1997, Medical care.

[4]  Masaru Kitsuregawa,et al.  FP-tax: tree structure based generalized association rule mining , 2004, DMKD '04.

[5]  Nikolaj Tatti,et al.  Probably the best itemsets , 2010, KDD.

[6]  Elena Baralis,et al.  Early prediction of the highest workload in incremental cardiopulmonary tests , 2013, ACM Trans. Intell. Syst. Technol..

[7]  Fabrizio Riguzzi,et al.  Algorithms for Efficiently and Effectively Using Background Knowledge in Tertius , 2006 .

[8]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[9]  Elena Baralis,et al.  Analysis of Medical Pathways by Means of Frequent Closed Sequences , 2010, KES.

[10]  Tai-Hsi Wu,et al.  Using data mining techniques to predict hospitalization of hemodialysis patients , 2011, Decis. Support Syst..

[11]  Andrzej Bartke,et al.  Physiological Basis of Aging and Geriatrics, Fourth Edition, Paola S. Timiras (Ed.). Informa Healthcare USA, Inc. (2007) , 2008 .

[12]  Ramakrishnan Srikant,et al.  Mining generalized association rules , 1995, Future Gener. Comput. Syst..

[13]  Jun Wei Liu,et al.  Mining Association Rules in Spatio‐Temporal Data: An Analysis of Urban Socioeconomic and Land Cover Change , 2005, Trans. GIS.

[14]  Jiawei Han,et al.  Mining Multiple-Level Association Rules in Large Databases , 1999, IEEE Trans. Knowl. Data Eng..

[15]  Luca Cagliero,et al.  Itemset generalization with cardinality-based constraints , 2013, Inf. Sci..

[16]  Milos Hauskrecht,et al.  A temporal pattern mining approach for classifying electronic health record data , 2013, ACM Trans. Intell. Syst. Technol..

[17]  Jaideep Srivastava,et al.  Selecting the right interestingness measure for association patterns , 2002, KDD.

[18]  Luca Cagliero,et al.  Frequent weighted itemset mining from gene expression data , 2013, 13th IEEE International Conference on BioInformatics and BioEngineering.

[19]  Hai Yang,et al.  ACM Transactions on Intelligent Systems and Technology - Special Section on Urban Computing , 2014 .

[20]  Jilles Vreeken,et al.  Tell me what i need to know: succinctly summarizing data with itemsets , 2011, KDD.

[21]  Thanaruk Theeramunkong,et al.  A new method for finding generalized frequent itemsets in generalized association rule mining , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[22]  W. Herman,et al.  Treatment of Type 1 Diabetes: Synopsis of the 2017 American Diabetes Association Standards of Medical Care in Diabetes , 2017, Annals of Internal Medicine.

[23]  Elena Baralis,et al.  Analysis of diabetic patients through their examination history , 2013, Expert Syst. Appl..

[24]  Luca Cagliero,et al.  Support driven opportunistic aggregation for generalized itemset extraction , 2010, 2010 5th IEEE International Conference Intelligent Systems.

[25]  Gyeong Ho Lee,et al.  Diagnostic Analysis of Patients with Essential Hypertension Using Association Rule Mining , 2010, Healthcare informatics research.

[26]  Luca Cagliero Discovering Temporal Change Patterns in the Presence of Taxonomies , 2013, IEEE Transactions on Knowledge and Data Engineering.

[27]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[28]  Ramakrishnan Srikant,et al.  Mining Association Rules with Item Constraints , 1997, KDD.

[29]  Rajeev Motwani,et al.  Beyond market baskets: generalizing association rules to correlations , 1997, SIGMOD '97.

[30]  Yukie Ikedaa,et al.  Knowledge Based and Intelligent Information and Engineering Systems An Evacuation Route Planning for Safety Route Guidance System after Natural Disaster Using Multi-Objective Genetic Algorithm , 2016 .

[31]  Tim Nolan,et al.  International Diabetes Federation. , 2013, Diabetes research and clinical practice.

[32]  Milos Hauskrecht,et al.  A Bayesian Scoring Technique for Mining Predictive and Non-Spurious Rules , 2012, ECML/PKDD.

[33]  Luca Cagliero,et al.  Misleading Generalized Itemset discovery , 2014, Expert Syst. Appl..

[34]  Corrado Loglisci,et al.  Mining Generalized Association Rules on Biomedical Literature , 2005, IEA/AIE.

[35]  Durga Toshniwal,et al.  Classification of type-2 diabetic patients by using Apriori and predictive Apriori , 2011, Int. J. Comput. Vis. Robotics.

[36]  P. Loy International Classification of Diseases--9th revision. , 1978, Medical record and health care information journal.

[37]  Xuehui Meng,et al.  Comparison of three data mining models for predicting diabetes or prediabetes by risk factors , 2013, The Kaohsiung journal of medical sciences.

[38]  Guisheng Yin,et al.  GMA: An Approach for Association Rules Mining on Medical Images , 2012, ICIC.

[39]  Maguelonne Teisseire,et al.  Healthcare Trajectory Mining by Combining Multidimensional Component and Itemsets , 2012, NFMCP.

[40]  Ramakrishnan Srikant,et al.  Fast Algorithms for Mining Association Rules in Large Databases , 1994, VLDB.

[41]  W. Heiser,et al.  The identification of Parkinson's disease subtypes using cluster analysis: A systematic review , 2010, Movement disorders : official journal of the Movement Disorder Society.

[42]  W. Cefalu Diabetes Care: “State of the Union” , 2012, Diabetes Care.

[43]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[44]  Tobias Scheffer,et al.  Finding association rules that trade support optimally against confidence , 2001, Intell. Data Anal..

[45]  Elena Baralis,et al.  Analysis of diagnostic pathways for colon cancer , 2012 .

[46]  Carlo Combi,et al.  Extraction, Analysis, and Visualization of Temporal Association Rules from Interval-Based Clinical Data , 2013, AIME.

[47]  ICD-9-CM. International Classification of Diseases, 9th revision, Clinical Modification. 3d edition, volumes 1, 2 and 3. Official authorized addendum effective October 1, 1990--HCFA. , 1990, Journal.

[48]  Benjamin Littenberg,et al.  Exploring Generalized Association Rule Mining for Disease Co-Occurrences , 2012, AMIA.

[49]  Yi-Ping Phoebe Chen,et al.  Association rule mining to detect factors which contribute to heart disease in males and females , 2013, Expert Syst. Appl..

[50]  Nicolas Pasquier,et al.  Discovering Frequent Closed Itemsets for Association Rules , 1999, ICDT.

[51]  Mohammed J. Zaki Mining Non-Redundant Association Rules , 2004, Data Min. Knowl. Discov..