A systematic mapping study of process mining

ABSTRACT This study systematically assesses the process mining scenario from 2005 to 2014. The analysis of 705 papers evidenced ‘discovery’ (71%) as the main type of process mining addressed and ‘categorical prediction’ (25%) as the main mining task solved. The most applied traditional technique is the ‘graph structure-based’ ones (38%). Specifically concerning computational intelligence and machine learning techniques, we concluded that little relevance has been given to them. The most applied are ‘evolutionary computation’ (9%) and ‘decision tree’ (6%), respectively. Process mining challenges, such as balancing among robustness, simplicity, accuracy and generalization, could benefit from a larger use of such techniques.

[1]  George Valença,et al.  Accepted Manuscript Requirements Engineering for Software Product Lines: a Systematic Literature Review Accepted Manuscript Requirements Engineering for Software Product Lines: a Systematic Literature Review Accepted Manuscript , 2022 .

[2]  Hao Wang,et al.  Semantic data mining: A survey of ontology-based approaches , 2015, Proceedings of the 2015 IEEE 9th International Conference on Semantic Computing (IEEE ICSC 2015).

[3]  Paula Gomes Mian,et al.  Systematic Review in Software Engineering , 2005 .

[4]  Boudewijn F. van Dongen,et al.  Workflow mining: A survey of issues and approaches , 2003, Data Knowl. Eng..

[5]  Larry Wasserman,et al.  All of Statistics: A Concise Course in Statistical Inference , 2004 .

[6]  Man Zhang,et al.  From Business Process Models to Web Services Orchestration: The Case of UML 2.0 Activity Diagram to BPEL , 2008, ICSOC.

[7]  Lawrence B. Holder,et al.  Mining Graph Data: Cook/Mining Graph Data , 2006 .

[8]  Boudewijn F. van Dongen,et al.  Process Mining: Overview and Outlook of Petri Net Discovery Algorithms , 2009, Trans. Petri Nets Other Model. Concurr..

[9]  Maria Beatriz Felgar de Toledo,et al.  A survey on reuse in the business process management domain , 2012, Int. J. Bus. Process. Integr. Manag..

[10]  Mathias Weske,et al.  Business Process Management: Concepts, Languages, Architectures , 2007 .

[11]  Frank Leymann,et al.  Web services and business process management , 2002, IBM Syst. J..

[12]  Mark von Rosing,et al.  Business Process Model and Notation - BPMN , 2015, The Complete Business Process Handbook, Vol. I.

[13]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[14]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[15]  Susan Craw,et al.  Case-Based Reasoning , 2010, Encyclopedia of Machine Learning.

[16]  Padhraic Smyth,et al.  From Data Mining to Knowledge Discovery in Databases , 1996, AI Mag..

[17]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[18]  Bongsik Shin,et al.  Data Mining: New Arsenal for Strategic Decision Making , 1999, J. Database Manag..

[19]  Wil M. P. van der Aalst,et al.  The Application of Petri Nets to Workflow Management , 1998, J. Circuits Syst. Comput..

[20]  Francisco Curbera,et al.  Web Services Business Process Execution Language Version 2.0 , 2007 .

[21]  Haiyan Wang,et al.  A review of process mining algorithms , 2011, 2011 International Conference on Business Management and Electronic Information.

[22]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[23]  Inderjit S. Dhillon,et al.  Concept Decompositions for Large Sparse Text Data Using Clustering , 2004, Machine Learning.

[24]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[25]  Elena Deza,et al.  Encyclopedia of Distances , 2014 .

[26]  Ben He,et al.  Document Length Normalization , 2009, Encyclopedia of Database Systems.

[27]  Mathias Weske,et al.  Business Process Management: A Survey , 2003, Business Process Management.

[28]  W.M.P. van der Aalst,et al.  Business Process Management: A Comprehensive Survey , 2013 .

[29]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[30]  Claes Wohlin,et al.  Experimentation in Software Engineering , 2000, The Kluwer International Series in Software Engineering.

[31]  Maria Beatriz Felgar de Toledo,et al.  Product Line in the Business Process Management Domain , 2009 .

[32]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[33]  Efstratios Gallopoulos,et al.  TMG: A MATLAB Toolbox for Generating Term-Document Matrices from Text Collections , 2006, Grouping Multidimensional Data.

[34]  Lianping Chen,et al.  A systematic review of evaluation of variability management approaches in software product lines , 2011, Inf. Softw. Technol..

[35]  Hans-Ulrich Prokosch,et al.  Process Mining for Clinical Workflows: Challenges and Current Limitations , 2008, MIE.

[36]  Wil M. P. van der Aalst,et al.  On the suitability of UML 2.0 activity diagrams for business process modelling , 2006, APCCM.

[37]  János Abonyi,et al.  Computational Intelligence in Data Mining , 2005, Informatica.

[38]  Dr. Zbigniew Michalewicz,et al.  How to Solve It: Modern Heuristics , 2004 .

[39]  Lipo Wang,et al.  Data Mining With Computational Intelligence , 2006, IEEE Transactions on Neural Networks.

[40]  Adam A. Porter,et al.  Empirical studies of software engineering: a roadmap , 2000, ICSE '00.

[41]  Ivan Jordanov,et al.  An overview of the use of neural networks for data mining tasks , 2012, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[42]  Mathias Weske,et al.  Advances in business process management , 2004, Data Knowl. Eng..

[43]  Lawrence B. Holder,et al.  Mining Graph Data , 2006 .

[44]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[45]  Daniel Amyot,et al.  Process mining in healthcare: a systematised literature review , 2016, Int. J. Electron. Heal..

[46]  Owen Johnson,et al.  Process mining in oncology: A literature review , 2016, 2016 6th International Conference on Information Communication and Management (ICICM).

[47]  Kumaravel Appavoo,et al.  A Review on Software Process Mining Using Petri Nets , 2016 .

[48]  Paulo Cortez,et al.  Data Mining with Neural Networks and Support Vector Machines Using the R/rminer Tool , 2010, ICDM.

[49]  Evangelos Triantaphyllou Data Mining and Knowledge Discovery via Logic-Based Methods: Theory, Algorithms, and Applications , 2010 .

[50]  Jef Wijsen,et al.  Logical Languages for Data Mining , 2003, Logics for Emerging Applications of Databases.

[51]  Robert Wrembel,et al.  Data Warehouses And Olap: Concepts, Architectures And Solutions , 2006 .

[52]  Marcelo Fantinato,et al.  Process mining through artificial neural networks and support vector machines: A systematic literature review , 2015, Bus. Process. Manag. J..

[53]  C. Humby,et al.  Process Mining: Data science in Action , 2014 .

[54]  Ken Lunn,et al.  Business processes--attempts to find a definition , 2003, Inf. Softw. Technol..

[55]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[56]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[57]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[58]  Fei Liu,et al.  Survey of business process management: challenges and solutions , 2016, Enterp. Inf. Syst..

[59]  Jorge Munoz-Gama,et al.  Process mining in healthcare: A literature review , 2016, J. Biomed. Informatics.

[60]  Robert Feldt,et al.  Validity Threats in Empirical Software Engineering Research - An Initial Survey , 2010, SEKE.

[61]  Evangelos Triantaphyllou,et al.  Data Mining and Knowledge Discovery via Logic-Based Methods , 2010 .

[62]  Janet L. Kolodner,et al.  Case-Based Reasoning , 1989, IJCAI 1989.

[63]  Longbing Cao Data Mining and Multi-agent Integration , 2009 .

[64]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[65]  Ashutosh Tiwari,et al.  A review of business process mining: state-of-the-art and future trends , 2008, Bus. Process. Manag. J..