Analysis of diabetic patients through their examination history

The analysis of medical data is a challenging task for health care systems since a huge amount of interesting knowledge can be automatically mined to effectively support both physicians and health care organizations. This paper proposes a data analysis framework based on a multiple-level clustering technique to identify the examination pathways commonly followed by patients with a given disease. This knowledge can support health care organizations in evaluating the medical treatments usually adopted, and thus the incurred costs. The proposed multiple-level strategy allows clustering patient examination datasets with a variable distribution. To measure the relevance of specific examinations for a given disease complication, patient examination data has been represented in the Vector Space Model using the TF-IDF method. As a case study, the proposed approach has been applied to the diabetic care scenario. The experimental validation, performed on a real collection of diabetic patients, demonstrates the effectiveness of the approach in identifying groups of patients with a similar examination history and increasing severity in diabetes complications.

[1]  Joseph S. Lombardo,et al.  Mining electronic medical records for patient care patterns , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[2]  JoAnne K. Gronley,et al.  Use of cluster analysis for gait pattern classification of patients in the early and late recovery phases following stroke. , 2003, Gait & posture.

[3]  Biing-Hwang Juang,et al.  The segmental K-means algorithm for estimating parameters of hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[4]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[5]  Asha Gowda Karegowda,et al.  Cascading K-means Clustering and K-Nearest Neighbor Classifier for Categorization of Diabetic Patients , 2012 .

[6]  M. Dowsey,et al.  Clinical pathway for fractured neck of femur: a prospective, controlled study , 2000, The Medical journal of Australia.

[7]  Claudio Cobelli,et al.  A New Classification of Diabetic Gait Pattern Based on Cluster Analysis of Biomechanical Data , 2010, Journal of diabetes science and technology.

[8]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .

[9]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[10]  Mobyen Uddin Ahmed,et al.  Mining rare cases in post-operative pain by means of outlier detection , 2011, 2011 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[11]  Nick Santamaria,et al.  CLINICAL PATHWAYS FOR FRACTURED NECK OF FEMUR: A COHORT STUDY OF HEALTH RELATED QUALITY OF LIFE, PATIENT SATISFACTION AND CLINICAL OUTCOME , 2003 .

[12]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[13]  Wei Zhong,et al.  Clinical charge profiles prediction for patients diagnosed with chronic diseases using Multi-level Support Vector Machine , 2012, Expert Syst. Appl..

[14]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[15]  Maiyaporn Phanich,et al.  Food Recommendation System Using Clustering Analysis for Diabetic Patients , 2010, 2010 International Conference on Information Science and Applications.

[16]  Nawaz Mohamudally,et al.  Application of a Unified Medical Data Miner (UMDM) for Prediction, Classification, Interpretation and Visualization on Medical Datasets: The Diabetes Dataset Case , 2011, ICDM.

[17]  W. Heiser,et al.  The identification of Parkinson's disease subtypes using cluster analysis: A systematic review , 2010, Movement disorders : official journal of the Movement Disorder Society.

[18]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[19]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[20]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[21]  Medicaid Services,et al.  International Classification of Diseases, Ninth Revision, Clinical Modification , 2011 .

[22]  Douglas H. Fisher,et al.  Knowledge Acquisition Via Incremental Conceptual Clustering , 1987, Machine Learning.

[23]  Douglas H. Fisher,et al.  Improving Inference through Conceptual Clustering , 1987, AAAI.

[24]  Xuehui Meng,et al.  Comparison of three data mining models for predicting diabetes or prediabetes by risk factors , 2013, The Kaohsiung journal of medical sciences.