Discovering Interacting Artifacts from ERP Systems

Enterprise Resource Planning (ERP) systems are widely used to manage business documents along a business processes and allow very detailed recording of event data of past process executions and involved documents. This recorded event data is the basis for auditing and detecting unusual flows. Process mining techniques can analyze event data of processes stored in linear event logs to discover a process model that reveals unusual executions. Existing approaches to obtain linear event logs from ERP data require a single case identifier to which all behavior can be related. However, in ERP systems processes such as Order to Cash operate on multiple interrelated business objects, each having their own case identifier, their own behavior, and interact with each other. Forcing these into a single case creates ambiguous dependencies caused by data convergence and divergence which obscures unusual flows in the resulting process model. In this paper, we present a new semi-automatic, end-to-end approach for analyzing event data in a plain database of an ERP system for unusual executions. More precisely, we identify an artifact-centric process model describing the business objects, their life-cycles, and how the various objects interact along their life-cycles. This way, we prevent data divergence and convergence. We report on two case studies where our approach allowed to successfully analyze processes of ERP systems and reliably revealed unusual flows later confirmed by domain experts.

[1]  Sander J. J. Leemans,et al.  PM ^2 : A Process Mining Project Methodology , 2015, CAiSE.

[2]  Andreas Zeller Specifications for Free , 2011, NASA Formal Methods.

[3]  Achim D. Brucker,et al.  Secure and Compliant Implementation of Business Process-Driven Systems , 2012, Business Process Management Workshops.

[4]  Amit P. Sheth,et al.  Semantic (Web) Technology In Action: Ontology Driven Information Systems for Search, Integration and Analysis , 2003, IEEE Data Eng. Bull..

[5]  Andrea Mocci,et al.  Detecting component changes at run time with behavior models , 2012, Computing.

[6]  Zhendong Su,et al.  Javert: fully automatic mining of general temporal properties from dynamic traces , 2008, SIGSOFT '08/FSE-16.

[7]  Bart Baesens,et al.  A multi-dimensional quality assessment of state-of-the-art process discovery algorithms using real-life event logs , 2012, Inf. Syst..

[8]  Wil M. P. van der Aalst,et al.  Workflow mining: discovering process models from event logs , 2004, IEEE Transactions on Knowledge and Data Engineering.

[9]  Thomas R. Gross,et al.  Automatic Generation of Object Usage Specifications from Large Method Traces , 2009, 2009 IEEE/ACM International Conference on Automated Software Engineering.

[10]  Siau-Cheng Khoo,et al.  SMArTIC: towards building an accurate, robust and scalable specification miner , 2006, SIGSOFT '06/FSE-14.

[11]  Reda Alhajj Extracting the extended entity-relationship model from a legacy relational database , 2003, Inf. Syst..

[12]  X Xixi Lu,et al.  Artifact-centric log extraction and process discovery , 2013 .

[13]  Keisuke Yano,et al.  A Practical Approach to Automated Business Process Discovery , 2013, 2013 17th IEEE International Enterprise Distributed Object Computing Conference Workshops.

[14]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[15]  Majed A. Al-Mashari,et al.  Supply‐chain re‐engineering using enterprise resource planning (ERP) systems: an analysis of a SAP R/3 implementation case , 2000 .

[16]  Andreas Zeller,et al.  Detecting object usage anomalies , 2007, ESEC-FSE '07.

[17]  Dirk Fahland,et al.  Discovering interacting artifacts from ERP systems (extended version) , 2015 .

[18]  Qiang Fu,et al.  Mining program workflow from interleaved traces , 2010, KDD.

[19]  Cong Yu,et al.  Schema summarization , 2006, VLDB.

[20]  Veda C. Storey,et al.  Reverse Engineering of Relational Databases: Extraction of an EER Model from a Relational Database , 1994, Data Knowl. Eng..

[21]  Richard Hull,et al.  Business artifacts with guard-stage-milestone lifecycles: managing artifact interactions with conditions and events , 2011, DEBS '11.

[22]  Wil M. P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[23]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .

[24]  Dirk Fahland,et al.  Behavioral Conformance of Artifact-Centric Process Models , 2011, BIS.

[25]  Marlon Dumas,et al.  Beyond Tasks and Gateways: Discovering BPMN Models with Subprocesses, Boundary Events and Activity Markers , 2014, BPM.

[26]  Sander J. J. Leemans,et al.  Discovering Block-Structured Process Models from Event Logs - A Constructive Approach , 2013, Petri Nets.

[27]  Bernhard Thalheim,et al.  Handbook of Conceptual Modeling - Theory, Practice, and Research Challenges , 2011 .

[28]  Michele Lanza,et al.  Visualizing Developer Interactions , 2014, 2014 Second IEEE Working Conference on Software Visualization.

[29]  Richard Hull,et al.  Business Artifacts: A Data-centric Approach to Modeling Business Operations and Processes , 2009, IEEE Data Eng. Bull..

[30]  Boudewijn F. van Dongen,et al.  Process Discovery using Integer Linear Programming , 2009, Fundam. Informaticae.

[31]  Alicia Ageno,et al.  Adaptive information extraction , 2006, CSUR.

[32]  Moe Thandar Wynn,et al.  Understanding Process Behaviours in a Large Insurance Company in Australia: A Case Study , 2013, CAiSE.

[33]  Anil Nigam,et al.  Business artifacts: An approach to operational specification , 2003, IBM Syst. J..

[34]  Johann A. Makowsky,et al.  Identifying Extended Entity-Relationship Object Structures in Relational Schemas , 1990, IEEE Trans. Software Eng..

[35]  Andreas Zeller,et al.  Mining behavior models from enterprise web applications , 2013, ESEC/FSE 2013.

[36]  Jan Mendling,et al.  Styles in business process modeling: an exploration and a model , 2015, Software & Systems Modeling.

[37]  Dirk Fahland,et al.  Conformance Checking of Interacting Processes with Overlapping Instances , 2011, BPM.

[38]  R. M. van Giessel,et al.  Process mining in SAP R/3 , 2004 .

[39]  A. J. M. M. Weijters,et al.  Flexible Heuristics Miner (FHM) , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[40]  Arie van Deursen,et al.  A Systematic Survey of Program Comprehension through Dynamic Analysis , 2008, IEEE Transactions on Software Engineering.

[41]  Viara Popova,et al.  Artifact Lifecycle Discovery , 2013, Int. J. Cooperative Inf. Syst..

[42]  Dirk Fahland,et al.  Automatic Discovery of Data-Centric and Artifact-Centric Processes , 2012, Business Process Management Workshops.

[43]  Boudewijn F. van Dongen,et al.  Process mining: a two-step approach to balance between underfitting and overfitting , 2008, Software & Systems Modeling.

[44]  Erhard Rahm,et al.  BIIIG: Enabling business intelligence with integrated instance graphs , 2014, 2014 IEEE 30th International Conference on Data Engineering Workshops.

[45]  Maria T. Pazienza,et al.  Information Extraction , 1997 .

[46]  Jon Espen Ingvaldsen,et al.  Preprocessing Support for Large Scale Process Mining of SAP Transactions , 2007, Business Process Management Workshops.

[47]  Wil M. P. van der Aalst,et al.  Genetic process mining: an experimental evaluation , 2007, Data Mining and Knowledge Discovery.

[48]  Renée J. Miller,et al.  Schema Discovery , 2003, IEEE Data Eng. Bull..

[49]  Dirk Fahland,et al.  Mining branching-time scenarios , 2013, 2013 28th IEEE/ACM International Conference on Automated Software Engineering (ASE).