Similarity Measure Between Patient Traces for Clinical Pathway Analysis: Problem, Method, and Applications

Clinical pathways leave traces, described as event sequences with regard to a mixture of various latent treatment behaviors. Measuring similarities between patient traces can profitably be exploited further as a basis for providing insights into the pathways, and complementing existing techniques of clinical pathway analysis (CPA), which mainly focus on looking at aggregated data seen from an external perspective. Most existing methods measure similarities between patient traces via computing the relative distance between their event sequences. However, clinical pathways, as typical human-centered processes, always take place in an unstructured fashion, i.e., clinical events occur arbitrarily without a particular order. Bringing order in the chaos of clinical pathways may decline the accuracy of similarity measure between patient traces, and may distort the efficiency of further analysis tasks. In this paper, we present a behavioral topic analysis approach to measure similarities between patient traces. More specifically, a probabilistic graphical model, i.e., latent Dirichlet allocation (LDA), is employed to discover latent treatment behaviors of patient traces for clinical pathways such that similarities of pairwise patient traces can be measured based on their underlying behavioral topical features. The presented method provides a basis for further applications in CPA. In particular, three possible applications are introduced in this paper, i.e., patient trace retrieval, clustering, and anomaly detection. The proposed approach and the presented applications are evaluated via a real-world dataset of several specific clinical pathways collected from a Chinese hospital.

[1]  Huilong Duan,et al.  Summarizing clinical pathways from event logs , 2013, J. Biomed. Informatics.

[2]  K Vanhaecht,et al.  Effects of clinical pathways in the joint replacement: a meta-analysis , 2009, BMC medicine.

[3]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[4]  Huilong Duan,et al.  Similarity Measuring between Patient Traces for Clinical Pathway Analysis , 2013, AIME.

[5]  Wayne D. Gray,et al.  Basic objects in natural categories , 1976, Cognitive Psychology.

[6]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[7]  Laura Maruster,et al.  From data to knowledge: a method for modeling hospital logistic processes , 2005, IEEE Transactions on Information Technology in Biomedicine.

[8]  Yizong Cheng,et al.  Mean Shift, Mode Seeking, and Clustering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Vipin Kumar,et al.  Finding Clusters of Different Sizes, Shapes, and Densities in Noisy, High Dimensional Data , 2003, SDM.

[10]  Fu-Ren Lin,et al.  Mining time dependency patterns in clinical pathways , 2001, Int. J. Medical Informatics.

[11]  Yildiray Kabak,et al.  Collaborative Business Process Support in eHealth: Integrating IHE Profiles Through ebXML Business Process Specification Language , 2008, IEEE Transactions on Information Technology in Biomedicine.

[12]  S. Kul,et al.  The use of survival analysis for clinical pathways , 2010 .

[13]  Roque Marín,et al.  T‐CARE: temporal case retrieval system , 2011, Expert Syst. J. Knowl. Eng..

[14]  Mu Qiao,et al.  Towards Efficient Business Process Clustering and Retrieval: Combining Language Modeling and Structure Matching , 2011, BPM.

[15]  John R. Kimberly,et al.  The globalization of managerial innovation in health care , 2008 .

[16]  D. Steed,et al.  Utility of clinical pathway and prospective case management to achieve cost and hospital stay reduction for aortic aneurysm surgery at a tertiary care hospital. , 1997, Journal of vascular surgery.

[17]  Huilong Duan,et al.  Latent Treatment Pattern Discovery for Clinical Processes , 2013, Journal of Medical Systems.

[18]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[19]  Jochen De Weerdt,et al.  Process discovery in event logs: An application in the telecom industry , 2011, Appl. Soft Comput..

[20]  Gregoris Mentzas,et al.  A Holistic Environment for the Design and Execution of Self-Adaptive Clinical Pathways , 2011, IEEE Transactions on Information Technology in Biomedicine.

[21]  Huilong Duan,et al.  On mining clinical pathway patterns from medical behaviors , 2012, Artif. Intell. Medicine.

[22]  R. Hotchkiss Integrated care pathways , 1997, BMJ.

[23]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[24]  Diogo R. Ferreira,et al.  Business process analysis in healthcare environments: A methodology based on process mining , 2012, Inf. Syst..

[25]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[26]  Joris van de Klundert,et al.  Measuring clinical pathway adherence , 2010, J. Biomed. Informatics.

[27]  ChengYizong Mean Shift, Mode Seeking, and Clustering , 1995 .