Semi-supervised fuzzy co-clustering for hospital-cost analysis from electronic medical records

It has been widely recognized that decision-making is a crucial part of hospital management which goes through a process of medical behavior. Even though data mining and knowledge discovery techniques have been used to clinical medicine frequently, little research has been conducted on hospital decision-making especially hospital-cost analysis on treatment therapies among inpatients which is considered as an important aspect of annual hospital evaluation and accreditation. In this paper, we propose a novel semi-supervised fuzzy co-clustering method for hospital-cost analysis from electronic medical records. Fuzzy co-clustering is a well-known technique that performs simultaneous fuzzy clustering of objects and features which result in dynamic dimensionality reduction mechanism for categorizing high-dimensional data. However, in many real-world applications, prior knowledge of a dataset is actually available to the users; thus it is necessary to integrate this information in the clustering process. This approach is called semi-supervised fuzzy co-clustering which has been investigated and developed further within an application to hospital-cost analysis of Hanoi Medical University Hospital (HMUH), Vietnam. The findings of the paper suggest the most crucial factors for medical expense in HMUH which are significant to gradually reduce the cost of treatment meanwhile improve the quality of services.

[1]  Raghu Krishnapuram,et al.  Fuzzy co-clustering of documents and keywords , 2003, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03..

[2]  Yang Yan,et al.  Fuzzy semi-supervised co-clustering for text documents , 2013, Fuzzy Sets Syst..

[3]  William-Chandra Tjhi,et al.  A partitioning based algorithm to fuzzy co-cluster documents and words , 2006, Pattern Recognit. Lett..

[4]  Hidetomo Ichihashi,et al.  Fuzzy clustering for categorical multivariate data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[5]  Hichem Frigui,et al.  Simultaneous Clustering and Dynamic Keyword Weighting for Text Documents , 2004 .

[6]  C. Green,et al.  Applied Methods of Cost–Benefit Analysis in Health Care , 2012 .

[7]  Ping He,et al.  Semi-supervised clustering via multi-level random walk , 2014, Pattern Recognit..

[8]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[9]  Ian Davidson,et al.  Visual Data Mining: Techniques and Tools for Data Visualization and Mining , 2002 .

[10]  Hsin-Yi Chen,et al.  Semi-supervised clustering with discriminative random fields , 2012, Pattern Recognit..

[11]  William-Chandra Tjhi,et al.  A heuristic-based fuzzy co-clustering algorithm for categorization of high-dimensional data , 2008, Fuzzy Sets Syst..

[12]  Ricardo J. G. B. Campello,et al.  Relative clustering validity criteria: A comparative overview , 2010 .

[13]  Yang Yan,et al.  Semi-supervised fuzzy co-clustering algorithm for document categorization , 2011, Knowledge and Information Systems.