Evaluation of Trace Alignment Quality and its Application in Medical Process Mining

Trace alignment algorithms have been used in process mining for discovering the consensus treatment procedures and process deviations. Different alignment algorithms, however, may produce very different results. No widely-adopted method exists for evaluating the results of trace alignment. Existing reference-free evaluation methods cannot adequately and comprehensively assess the alignment quality. We analyzed and compared the existing evaluation methods, identifying their limitations, and introduced improvements in two reference-free evaluation methods. Our approach assesses the alignment result globally instead of locally, and therefore helps the algorithm to optimize overall alignment quality. We also introduced a novel metric to measure the alignment complexity, which can be used as a constraint on alignment algorithm optimization. We tested our evaluation methods on a trauma resuscitation dataset and provided the medical explanation of the activities and patterns identified as deviations using our proposed evaluation methods.

[1]  Jorge Munoz-Gama,et al.  Process mining in healthcare: A literature review , 2016, J. Biomed. Informatics.

[2]  Cw Christian Günther,et al.  Towards an evaluation framework for process mining algorithms , 2007 .

[3]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[4]  Wil M. P. van der Aalst,et al.  Trace Alignment in Process Mining: Opportunities for Process Diagnostics , 2010, BPM.

[5]  Iain M. Wallace,et al.  M-Coffee: combining multiple sequence alignment methods with T-Coffee , 2006, Nucleic acids research.

[6]  Bart Baesens,et al.  A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs , 2012, Inf. Syst..

[7]  Jenny Dankelman,et al.  Workflow mining and outlier detection from clinical activity logs , 2012, J. Biomed. Informatics.

[8]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[9]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[10]  Olivier Poch,et al.  A comprehensive comparison of multiple sequence alignment programs , 1999, Nucleic Acids Res..

[11]  J. M. Sauder,et al.  Large‐scale comparison of protein sequence alignment algorithms with structure alignments , 2000, Proteins.

[12]  Boudewijn F. van Dongen,et al.  Alignment Based Precision Checking , 2012, Business Process Management Workshops.

[13]  Moe Thandar Wynn,et al.  Process Mining for Clinical Processes , 2015, ACM Trans. Manag. Inf. Syst..

[14]  Alessandro Sperduti,et al.  PLG: A Framework for the Generation of Business Process Models and Their Execution Logs , 2010, Business Process Management Workshops.

[15]  Kimmen Sjölander,et al.  A comparison of scoring functions for protein sequence profile alignment , 2004, Bioinform..

[16]  Olivier Poch,et al.  BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs , 1999, Bioinform..

[17]  Wil M. P. van der Aalst,et al.  The Need for a Process Mining Evaluation Framework in Research and Practice , 2007, Business Process Management Workshops.

[18]  Gaston H. Gonnet,et al.  Evaluation Measures of Multiple Sequence Alignments , 2000, J. Comput. Biol..

[19]  Wil M. P. van der Aalst,et al.  Process diagnostics using trace alignment: Opportunities, issues, and challenges , 2012, Inf. Syst..

[20]  P. Hogeweg,et al.  The alignment of sets of sequences and the construction of phyletic trees: An integrated method , 2005, Journal of Molecular Evolution.

[21]  Ivan Marsic,et al.  Duration-Aware Alignment of Process Traces , 2016, ICDM.

[22]  Lawrence L. Wu Some Comments on “Sequence Analysis and Optimal Matching Methods in Sociology: Review and Prospect” , 2000 .