WordNet based Semantic Similarity Measures for Process Model Matching

Process Model Matching (PMM) refers to the automatic identification of corresponding activities between a pair of process models. Due to the wider applicability of PMM techniques several semantic matching techniques have been proposed. However, these techniques focus on utilizing few word-to-word (word-level) similarity measures, without giving due consideration to activitylevel aggregation methods. The inadequate attention to the choice of activitylevel methods limit the effectiveness of the matching techniques. Furthermore, there are some WordNet-based semantic similarity measures that have shown promising results for various text matching tasks. However, the effectiveness of these measures has never been evaluated in the context of PMM. To that end, in this paper we have used five word-level semantic similarity measures and three sentence-level aggregation methods to experimentally evaluate the effectiveness of their 15 combinations for PMM. The experiments are performed on the three widely used PMMC’15 datasets. From the results we conclude that, a) Jiang similarity is more suitable than the mostly used Lin similarity, and b) QAP is the most suitable sentence-level aggregation method.

[1]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[2]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[3]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[4]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[5]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[6]  Heiner Stuckenschmidt,et al.  Probabilistic Evaluation of Process Model Matching Techniques , 2016, ER.

[7]  Janina Fengel,et al.  Semantic technologies for aligning heterogeneous business process models , 2014, Bus. Process. Manag. J..

[8]  Tao Jin,et al.  Efficient querying of large process model repositories , 2013, Comput. Ind..

[9]  Remco M. Dijkman,et al.  Measuring Similarity between Business Process Models , 2008, CAiSE.

[10]  Andrew McCallum,et al.  Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email , 2007, J. Artif. Intell. Res..

[11]  Horia Ciocarlie,et al.  Similarity of business process models in a modular design , 2016, 2016 IEEE 11th International Symposium on Applied Computational Intelligence and Informatics (SACI).

[12]  Wineke A. M. van Lent,et al.  Similarity of business process models : metrics and evaluation , 2009 .

[13]  Remco M. Dijkman,et al.  Probabilistic Optimization of Semantic Process Model Matching , 2012, BPM.

[14]  Peter Loos,et al.  The Process Model Matching Contest 2015 , 2013, EMISA.

[15]  Marc Ehrig,et al.  Measuring Similarity between Semantic Business Process Models , 2007, APCCM.

[16]  Ralf Laue,et al.  Similarity of Business Process Models—A State-of-the-Art Analysis , 2017, ACM Comput. Surv..

[17]  Jan Mendling,et al.  Increasing Recall of Process Model Matching by Improved Activity Label Matching , 2013, BPM.

[18]  Evgeniy Gabrilovich,et al.  Computing Semantic Relatedness Using Wikipedia-based Explicit Semantic Analysis , 2007, IJCAI.

[19]  Heiner Stuckenschmidt,et al.  Ranking-Based Evaluation of Process Model Matching - (Short Paper) , 2017, OTM Conferences.

[20]  Hanêne Ben-Abdallah,et al.  Business process model matching: An approach based on semantics and structure , 2015, 2015 12th International Joint Conference on e-Business and Telecommunications (ICETE).

[21]  Jan Mendling,et al.  Enabling Reuse of Process Models through the Detection of Similar Process Parts , 2012, Business Process Management Workshops.

[22]  Carole A. Goble,et al.  Investigating Semantic Similarity Measures Across the Gene Ontology: The Relationship Between Sequence and Annotation , 2003, Bioinform..

[23]  Vasile Rus,et al.  An Optimal Quadratic Approach to Monolingual Paraphrase Alignment , 2015, NODALIDA.

[24]  Janina Fengel,et al.  Semantics-Based Business Process Model Similarity , 2012, BIS.

[25]  Cyrus Rashtchian,et al.  Every Picture Tells a Story: Generating Sentences from Images , 2010, ECCV.

[26]  Horia Ciocarlie,et al.  Merging business processes for a common workflow in an organizational collaborative scenario , 2015, 2015 19th International Conference on System Theory, Control and Computing (ICSTCC).

[27]  Remco M. Dijkman,et al.  Report: The Process Model Matching Contest 2013 , 2013, Business Process Management Workshops.

[28]  Ion Androutsopoulos,et al.  A Survey of Paraphrasing and Textual Entailment Methods , 2009, J. Artif. Intell. Res..

[29]  Andreas Oberweis,et al.  Triple-S: A Matching Approach for Petri Nets on Syntactic, Semantic and Structural level , 2013, BPM 2013.

[30]  Heiner Stuckenschmidt,et al.  Overcoming individual process model matcher weaknesses using ensemble matching , 2017, Decis. Support Syst..

[31]  Bing Liu,et al.  Sentiment Analysis and Opinion Mining , 2012, Synthesis Lectures on Human Language Technologies.

[32]  Rada Mihalcea,et al.  Measuring the Semantic Similarity of Texts , 2005, EMSEE@ACL.

[33]  Andreas Oberweis,et al.  How to detect semantic business process model variants? , 2007, SAC '07.

[34]  Peter Loos,et al.  An Approach for Semantic Business Process Model Matching using Supervised Machine Learning , 2016, ECIS.

[35]  Mohammed Haddad,et al.  String Comparators Based Algorithms for Process Model Matchmaking , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[36]  Chong Wang,et al.  Reading Tea Leaves: How Humans Interpret Topic Models , 2009, NIPS.

[37]  Hajo A. Reijers,et al.  How to Make Process Model Matching Work Better? An Analysis of Current Similarity Measures , 2017, BIS.

[38]  Daniela Grigori,et al.  BPEL Processes Matchmaking for Service Discovery , 2006, OTM Conferences.

[39]  Vasile Rus,et al.  A Comparison of Greedy and Optimal Assessment of Natural Language Student Input Using Word-to-Word Similarity Metrics , 2012, BEA@NAACL-HLT.

[40]  Jan Mendling,et al.  Listen to Me: Improving Process Model Matching through User Feedback , 2014, BPM.

[41]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[42]  Remco M. Dijkman,et al.  Similarity Search of Business Process Models , 2009, IEEE Data Eng. Bull..

[43]  Michael Niemann,et al.  Comparison and retrieval of process models using related cluster pairs , 2012, Comput. Ind..